Airspeed velocity benchmarks #1

hameerabbasi · 2019-04-15T15:55:09Z

No description provided.

skrah · 2019-04-16T10:42:42Z

Thanks, this takes a bit of time to comprehend. I have a couple of remarks:

I find asv unsuitable for interactive use, so I prefer to leave the original files. Perhaps put everything in an asv directory and the original .py files into a scripts directory.
I couldn't find the reason why the tuple access result in asv differs so much from the loop result:

Accessing an element in an array of tuples
------------------------------------------

   xnd:   0.2511618137359619
   numpy: 0.7269613742828369

Is there some sort of make clean for asv?

hameerabbasi · 2019-04-16T11:02:52Z

Thanks, this takes a bit of time to comprehend. I have a couple of remarks:

It is pretty thick code, I agree.

I find asv unsuitable for interactive use, so I prefer to leave the original files. Perhaps put everything in an asv directory and the original .py files into a scripts directory.

Done.

I couldn't find the reason why the tuple access result in asv differs so much from the loop result:

I'm pretty sure it's because the Python object has to be constructed each time you do something like a[0], whereas in the tuple access, everything can be done in the C level and then accessed from Python once. It may also be the overhead of the loop itself.

Is there some sort of make clean for asv?

There is asv rm bit it just cleans the results for previous commits' results, not the entire environment tree.

I would just use git clean -xfd ..

skrah · 2019-04-16T12:29:50Z

I'd like to go to the bottom of the issue of different results for tuple element access. I don't think it's a loop issue, because all access benchmarks have the same loop.

One reason might be that xnd generates views also for elements and numpy does not (by default). So that would be an advantage for xnd that should show up in the benchmarks.

Are you sure that single elements and not subtrees are accessed (I don't have time to really dig into the code)?

hameerabbasi · 2019-04-16T12:32:06Z

Oh. If dim[1] < dim[0], it's a subtree access. If dim[1] == dim[0], an element is accessed (In NumPy), and an element-view (In XND).

hameerabbasi · 2019-04-16T12:36:11Z

Actually, in the for loop version, the last iteration of the loop in NumPy accesses a scalar (for dim[0] == dim[1]).

dim[0] == array.ndim, dim[1] is the number of integer indices.

hameerabbasi

To expand on the previous issue.

hameerabbasi · 2019-04-16T12:38:53Z

asv/benchmarks/benchmarks.py

+        self.array = globals()[module].array(lst)
+
+    def time_access_tuple(self, size, dim, module):
+        self.array[(0,) * dim[1]]


Here's the access for tuple access. Equivalent to a[0, 0, ...] where the ellipsis is a continuation and not a Python ellipsis.

NumPy does not appear to be able to use multi-indexing on tuples:

>>> import numpy as np >>> lst = [('Rex', 9, 81.0)] * 100 >>> dt = np.dtype([('name', 'U10'), ('age', 'i4'), ('weight', 'f4')]) >>> x = np.array(lst, dtype=dt) >>> y = x[0][0] >>> type(y) <class 'numpy.str_'> >>> >>> y = x[0, 0] Traceback (most recent call last): File "<stdin>", line 1, in <module> IndexError: too many indices for array

I don't test tuples in the element access at all, I could add that.

The fast benchmark measures y = x[0][0], i.e. it accesses the 'Rex' tuple element, not the array element.

xnd is able to access through tuples:

>>> lst = [('Rex', 9, 81.0)] * 100 >>> x = xnd(lst) >>> x[0, 0] xnd('Rex', type='string')

So that should be even faster for xnd.

hameerabbasi · 2019-04-16T12:39:16Z

asv/benchmarks/benchmarks.py

+        self.array[(0,) * dim[1]]
+
+    def time_access_chained(self, size, dim, module):
+        a = self.array


Here, I cache a, then in a loop, access it. This is equivalent to a[0][0]...

skrah · 2019-04-16T14:16:00Z

In general I think people are likely to write benchmarks for small arrays. xnd is currently at a disadvantage because the python convenience classes in __init__.py are not optimized for small arrays.

Here is the construction benchmark when using the C Xnd constructor directly on a small one element list:

from xnd import Xnd
...
for i in range(repeat):
    x = Xnd(ndt("1 * int64"), lst)
...

Dtype provided
--------------

   xnd:   0.07504105567932129
   numpy: 0.08943438529968262

skrah · 2019-04-16T14:17:21Z

So I'll rewrite the class in C now before publishing the benchmarks.

hameerabbasi requested a review from skrah April 15, 2019 15:58

Add ASV benchmarks.

c0cd204

hameerabbasi force-pushed the asv-benchmarks branch from b0b95ce to c0cd204 Compare April 16, 2019 11:03

hameerabbasi commented Apr 16, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Airspeed velocity benchmarks #1

Airspeed velocity benchmarks #1

hameerabbasi commented Apr 15, 2019

skrah commented Apr 16, 2019

hameerabbasi commented Apr 16, 2019

skrah commented Apr 16, 2019

hameerabbasi commented Apr 16, 2019

hameerabbasi commented Apr 16, 2019

hameerabbasi left a comment

hameerabbasi Apr 16, 2019 •

edited

Loading

skrah Apr 16, 2019 •

edited

Loading

hameerabbasi Apr 16, 2019

skrah Apr 16, 2019 •

edited

Loading

skrah Apr 16, 2019 •

edited

Loading

hameerabbasi Apr 16, 2019

skrah commented Apr 16, 2019

skrah commented Apr 16, 2019

Airspeed velocity benchmarks #1

Are you sure you want to change the base?

Airspeed velocity benchmarks #1

Conversation

hameerabbasi commented Apr 15, 2019

skrah commented Apr 16, 2019

hameerabbasi commented Apr 16, 2019

skrah commented Apr 16, 2019

hameerabbasi commented Apr 16, 2019

hameerabbasi commented Apr 16, 2019

hameerabbasi left a comment

Choose a reason for hiding this comment

hameerabbasi Apr 16, 2019 • edited Loading

Choose a reason for hiding this comment

skrah Apr 16, 2019 • edited Loading

Choose a reason for hiding this comment

hameerabbasi Apr 16, 2019

Choose a reason for hiding this comment

skrah Apr 16, 2019 • edited Loading

Choose a reason for hiding this comment

skrah Apr 16, 2019 • edited Loading

Choose a reason for hiding this comment

hameerabbasi Apr 16, 2019

Choose a reason for hiding this comment

skrah commented Apr 16, 2019

skrah commented Apr 16, 2019

hameerabbasi Apr 16, 2019 •

edited

Loading

skrah Apr 16, 2019 •

edited

Loading

skrah Apr 16, 2019 •

edited

Loading

skrah Apr 16, 2019 •

edited

Loading