Fast cross product is not that fast

Relevant code: https://github.com/nipy/PySurfer/blob/a5a019ec5c6d25cdef92aca84a5e1ce1f9ba6fef/surfer/utils.py#L179-L213

Hi,

I was browsing through the code and I accidentally stumbled upon the function `_fast_cross_3d` referenced in the URL above.
I don't think that this implementation is faster than `numpy`'s implementation of the cross product - for example, please take a look at the following code:
```python
import numpy as np

x = np.random.rand(1_000_000, 3)
y = np.random.rand(1, 3)

def cross_3d_pysurfer(x, y):
    return np.c_[x[:, 1] * y[:, 2] - x[:, 2] * y[:, 1],
                 x[:, 2] * y[:, 0] - x[:, 0] * y[:, 2],
                 x[:, 0] * y[:, 1] - x[:, 1] * y[:, 0]]

def cross_3d_numpy(x, y):
    return np.cross(x, y)
```
When I time it, I get the following results:
```bash
In [1]: %timeit cross_3d_pysurfer(x, y)
Out[1]: 34.7 ms ± 3.9 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

In [2]: %timeit cross_3d_numpy(x, y)
Out[2]: 29.4 ms ± 4.63 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)
```

A potential, but very small speed up could be achieved by using `np.stack` instead of `np.c_` because it has some cool compiled `numpy`'s stuff which `np.c_` lacks. For example:

```python
def cross_3d_pysurfer_modified(x, y):
    return np.stack((x[:, 1] * y[:, 2] - x[:, 2] * y[:, 1],
                     x[:, 2] * y[:, 0] - x[:, 0] * y[:, 2],
                     x[:, 0] * y[:, 1] - x[:, 1] * y[:, 0])).T
```

```bash
In [3]: %timeit cross_3d_pysurfer_modified(x, y)
Out[3]: 28.4 ms ± 3.95 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)
```

I didn't want to open a PR because I really don't know if this is even relevant anymore or if the difference in a few ms is that important, but still wanted to let you know :)

	def _fast_cross_3d(x, y):
	"""Compute cross product between list of 3D vectors

	Much faster than np.cross() when the number of cross products
	becomes large (>500). This is because np.cross() methods become
	less memory efficient at this stage.

	Parameters
	----------
	x : array
	Input array 1.
	y : array
	Input array 2.

	Returns
	-------
	z : array
	Cross product of x and y.

	Notes
	-----
	x and y must both be 2D row vectors. One must have length 1, or both
	lengths must match.
	"""
	assert x.ndim == 2
	assert y.ndim == 2
	assert x.shape[1] == 3
	assert y.shape[1] == 3
	assert (x.shape[0] == 1 or y.shape[0] == 1) or x.shape[0] == y.shape[0]
	if max([x.shape[0], y.shape[0]]) >= 500:
	return np.c_[x[:, 1] * y[:, 2] - x[:, 2] * y[:, 1],
	x[:, 2] * y[:, 0] - x[:, 0] * y[:, 2],
	x[:, 0] * y[:, 1] - x[:, 1] * y[:, 0]]
	else:
	return np.cross(x, y)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fast cross product is not that fast #315

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Fast cross product is not that fast #315

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions