-
-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Start of Series.index_of(), for primitive numeric types #19894
base: main
Are you sure you want to change the base?
Conversation
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #19894 +/- ##
===========================================
+ Coverage 59.38% 79.52% +20.14%
===========================================
Files 1554 1558 +4
Lines 215612 216644 +1032
Branches 2452 2456 +4
===========================================
+ Hits 128035 172283 +44248
+ Misses 87019 43804 -43215
+ Partials 558 557 -1 ☔ View full report in Codecov by Sentry. |
dtype if dtype.is_unsigned_integer() => value.cast(&DataType::UInt64), | ||
dtype if dtype.is_float() => value.cast(&DataType::Float64), | ||
DataType::Null => AnyValue::Null, | ||
_ => unimplemented!("index_of() not yet supported for dtype {:?}", value_dtype), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The other types (except boolean) can go into the row-encoding and use the branch for binary data.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't really understand what this means, could you expand a little? (I will probably do those in separate PR, but still good to start that with the right approach to save time.)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think it should be in one go, as I think it can all go to a single generic impl.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The row-encoding is used in multi-key group-bys/joins and joins on nested types. I would recommend reading in the join dispatch. Can give a bit more pointers, if you get lost. :)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
OK will go read.
First part of #5503.
Next steps, in follow-up PRs: