You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The text was updated successfully, but these errors were encountered:
betatim
changed the title
Improve API uniformity and features across the ecosystem
Improve API uniformity and features across the ecosystem related to parallelism
May 7, 2024
Updated the title to include "parallelism", it seemed a bit broad without it :D
stefanv
changed the title
Improve API uniformity and features across the ecosystem related to parallelism
Improve parallel API uniformity and features across the ecosystem
May 7, 2024
jarrodmillman
changed the title
Improve parallel API uniformity and features across the ecosystem
SPEC: Improve parallel API uniformity and features across the ecosystem
May 7, 2024
Here are the overarching questions around a parallelism API:
What should the keyword parameter be? workers, n_jobs, etc.
Should it be added to every function call that does anything parallel? For example, np.linalg.matmul(..., workers=?)
What to do with operators that are not function calls? A_array @ B_array is parallel. The solution is a context manager like threadpoolctl.
Should libraries configure each other when it comes to parallelism? scikit-learn will prevent oversubscription with NumPy BLAS calls + joblib's n_jobs by using threadpoolctl.
See https://thomasjpfan.github.io/parallelism-python-libraries-design/
Across libraries, we should have standard mechanisms and naming for parallel concepts and features.
See also https://discuss.scientific-python.org/t/terminology-for-parameters-controlling-parallel-computation/1016/5
The text was updated successfully, but these errors were encountered: