Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow uint32 dtype for coords #249

Open
brendancol opened this issue Apr 15, 2019 · 4 comments
Open

Allow uint32 dtype for coords #249

brendancol opened this issue Apr 15, 2019 · 4 comments
Labels
enhancement Indicates new feature requests

Comments

@brendancol
Copy link

Hey awesome library! I'm really having a blast with it.

One question / comment:

For my purposes, I need the COO array with the largest dtype for coords of uint32.

When I create an array, the COO constructor seems to convert all coord arrays to intp or int64 in my case.
https://github.com/pydata/sparse/blob/master/sparse/coo/core.py#L234

Any thoughts on skipping that astype if coords is already an array?

@hameerabbasi
Copy link
Collaborator

Hey!

awesome library! I'm really having a blast with it.

Thanks!

For my purposes, I need the COO array with the largest dtype for coords of uint32.

When I create an array, the COO constructor seems to convert all coord arrays to intp or int64 in my case.
https://github.com/pydata/sparse/blob/master/sparse/coo/core.py#L234

Any thoughts on skipping that astype if coords is already an array?

Unfortunately, we'd love to. In the past, we converted coords to the smallest possible dtype that could hold the data. We had issues with this (overflows and such), two separate bugs were filed that were a result of this, and I discovered a bug or two myself.

After going through the "whack-a-mole" process of squashing bugs left and right, I just decided to make it int64.

If there was a coordinated effort to examine each line of code that assigns to coords for overflows and fix it... I'd be all for it. There have been requests for this in the past.

@hameerabbasi hameerabbasi added the enhancement Indicates new feature requests label Apr 16, 2019
@hameerabbasi
Copy link
Collaborator

However, if we work on #125 or something along those lines, that'd solve your issues, to my understanding, as it has better compression efficiency in the majority of cases.

@eric-wieser
Copy link
Contributor

I've found myself wanting uint16 here too.

In the past, we converted coords to the smallest possible dtype that could hold the data. We had issues with this (overflows and such), two separate bugs were filed that were a result of this, and I discovered a bug or two myself.

Could you link to the commit where you removed this, and to some of the issues?

@hameerabbasi
Copy link
Collaborator

#32 is one such issue, and 1447e0f is the commit where this was removed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Indicates new feature requests
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants