Improve performance of flatten in DenseMatrix #3400

dvd101x · 2025-02-21T23:48:22Z

Hi, this improves the performance of flattening matrices by 60%.

…ctangular)

…ize recursion

josdejong · 2025-02-27T11:18:59Z

Thanks David. I've run the benchmark comparing develop and this PR, but I get smaller performance gains than you, in the order of 10-20% improvement. I'm a bit in doubt whether we should merge this PR: it adds quite some complexity to the code, but the performance improvement seems relatively small. It feels a bit like micro optimization (a different nodejs version or browser type may give bigger performance differences than the code optimization here). What do you think?

# develop

flatten(array)             0.77 µs   ±0.39%
flatten(genericMatrix)     1.99 µs   ±0.44%
flatten(numberMatrix)      2.00 µs   ±0.57%

# PR 3400

flatten(array)             0.60 µs   ±0.19%
flatten(genericMatrix)     1.88 µs   ±0.63%
flatten(numberMatrix)      1.87 µs   ±0.54%

dvd101x · 2025-02-27T13:52:10Z

Thanks for reviewing this Jos, let me check what happened and report back.

dvd101x · 2025-03-04T02:55:43Z

Hi, I was missing a parenthesis. Was using Matrix.size instead of Matrix.size()

These are the results:

# develop

flatten(array)             0.69 µs   ±0.03%
flatten(genericMatrix)     1.83 µs   ±0.17%
flatten(numberMatrix)      1.83 µs   ±0.19%

# PR 3400

flatten(array)             0.67 µs   ±0.15%
flatten(genericMatrix)     1.47 µs   ±0.25%
flatten(numberMatrix)      1.46 µs   ±0.19%

Faster by 25% on my end, could you please validate?

I think the initial improvement I was showing was comparing to develop before #3354

josdejong · 2025-03-05T17:57:32Z

Thanks David, I get the same results as you when running the benchmark.

I have a couple of thoughts:

All the matrix versions of flatten does is run flatten and then convert the result in a DenseMatrix. So the difference between the first test of the benchmark and the other two should be only caused by the create function of DenseMatrix. Since this PR only changes the source code of the low level array function, it may be good to only test it on Array (and then test the two variances: with and without the extra knowledge of maxDepth, otherwise it's a bit comparing apples and pears.
I'm still a bit in doubt if 25% performance gain is worth the additional code.
Instead of passing a maxDepth (which can be used wrongly), it may be enough to pass a boolean flag denoting that the function can be sure that the passed array is a matrix where every row has the same length. The function can then determine the matrix size in a cheap way using arraySize or just looking at array[0] to determine whether to recurse further or not.

I did some experimenting too 😁, it's too tempting. The following function improves the first benchmark test flatten(array) from 0.69 µs to 0.41 µs on my machine, without need for much extra code and complexity (does not need 2 implementations or an extra parameter), maybe that is good enough?

export function flatten (array) {
  if (!Array.isArray(array)) {
    // if not an array, return as is
    return array
  }

  const flat = []

  function _flatten (array) {
    for (let i = 0; i < array.length; i++) {
      const item = array[i]
      if (Array.isArray(item)) {
        _flatten(item)
      } else {
        flat.push(item)
      }
    }
  }

  _flatten(array)

  return flat
}

dvd101x · 2025-03-05T20:38:17Z

Hi Jos!

Thank you for reviewing this!

Ok, I will include a specific test on Array. I was testing only high level code, since at a high level math.flatten doesn't distinguish between different types of arrays.
Ok, that's a fair concern.
Since maxDepth is only used internally I thought it might be ok. Initially I was working with array[0] but in my opinion the algorithm was easier to understand with maxDepth. I think using array[0] allows for another optimization that I will test and show if possible.
That's very nice, I will review in depth.

I will make a new benchmark between arrays including your example, compare using maxDepth and array[0] and report back.

I completely understand if in the end the added complexity of a second algorithm isn't worth it. I expect something positive from comparing apples to apples on the following days.

dvd101x · 2025-03-06T04:05:57Z

Hi Jos,

I included a new commit, with your code proposal for regular arrays and modified the code for rectangular arrays to take a boolean as an argument instead of the maximum depth.

Test	Time PR A	Error PR A	Time Dev	Error Dev	Improvement A
flatten(array)	0.44 µs	±0.19%	0.69 µs	±0.04%	36.23%
flattenArray(array)	0.45 µs	±0.25%	0.69 µs	±0.09%	34.78%
flattenArray(array, false)	0.44 µs	±0.19%	0.69 µs	±0.05%	36.23%
flattenArray(array, true)	0.33 µs	±0.29%	0.69 µs	±0.05%	52.17%
flatten(genericMatrix)	1.51 µs	±0.16%	1.84 µs	±0.18%	17.93%
flatten(numberMatrix)	1.51 µs	±0.16%	1.82 µs	±0.16%	17.03%

As a summary your code proposal is 36% faster than dev for arrays in general, for rectangular arrays the specialized algorithm shows 52% improvement. As you mentioned the improvement is lost in the creation of a Matrix, in the end it improves 18%.

I tested your proposed algorithm for arrays in general using it also for matrices and the improvement is 14% (instead of 18%).

Compared to the previous commit, with changes only in the algorithm for rectangular arrays but using maxDepth.

Test	Time PR B	Error PR B	Time Dev	Error Dev	Improvement B
flatten(array)	0.67 µs	±0.14%	0.69 µs	±0.05%	2.89%
flattenArray(array)	0.67 µs	±0.23%	0.69 µs	±0.10%	2.89%
flattenArray(array, -1)	0.68 µs	±0.18%	0.69 µs	±0.04%	1.44%
flattenArray(array, 2)	0.33 µs	±0.30%	0.69 µs	±0.04%	52.17%
flatten(genericMatrix)	1.44 µs	±0.20%	1.83 µs	±0.19%	21.31%
flatten(numberMatrix)	1.43 µs	±0.20%	1.83 µs	±0.20%	21.86%

It had the same 52% improvement, I think just by luck it shows 21% improvement for Matrices (vs 18% of the new commit)

A nice improvement of using a boolean indicating if it's rectangular, is that you don't need to know the size of the array, just that it is rectangular, as it's the case for the function reshape it's algorithm assumes that the array is rectangular. It could be tasted how much faster would reshape be.

My conclusions:

Your proposal is very good, applying it's logic instead of array.forEach could have significant improvements (in this case 36 %)
It's important to differentiate between rectangular and jagged arrays (in this case 52% vs 36% both with the same overall logic of your proposal)
This isn't the main issue to address, as the matrix creation affects the most, but I don't have a proposal for that at the moment.
Maybe it's a good idea to differentiate arrays, both in code and in documentation.
On various PR's I made the introduction of maxDepth as I thought might be easier to understand and I think I found marginal improvements, those could be easily be refactored to use a boolean.

Implemented function that flattens an array with a validated size (re…

16451ef

…ctangular)

dvd101x changed the title ~~Implemented function that flattens an array with a validated size (rectangular)~~ Improve performance of flatten in DenseMatrix Feb 21, 2025

fix: improve flatten function to handle maxDepth edge cases and optim…

b0c6144

…ize recursion

dvd101x added 2 commits March 3, 2025 19:35

Merge branch 'develop' into faster-denseMatrix-flatten

bbb3c72

Fixed issue with incorrect maxDepth

947883e

dvd101x added 2 commits March 5, 2025 16:18

Changed maxDepth in favor of isHomogeneous

2d3c77b

Added boolean check

4c115c1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of flatten in DenseMatrix #3400

Improve performance of flatten in DenseMatrix #3400

dvd101x commented Feb 21, 2025

josdejong commented Feb 27, 2025

dvd101x commented Feb 27, 2025

dvd101x commented Mar 4, 2025 •

edited

Loading

josdejong commented Mar 5, 2025

dvd101x commented Mar 5, 2025

dvd101x commented Mar 6, 2025

Improve performance of flatten in DenseMatrix #3400

Are you sure you want to change the base?

Improve performance of flatten in DenseMatrix #3400

Conversation

dvd101x commented Feb 21, 2025

josdejong commented Feb 27, 2025

dvd101x commented Feb 27, 2025

dvd101x commented Mar 4, 2025 • edited Loading

josdejong commented Mar 5, 2025

dvd101x commented Mar 5, 2025

dvd101x commented Mar 6, 2025

dvd101x commented Mar 4, 2025 •

edited

Loading