-
Notifications
You must be signed in to change notification settings - Fork 190
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEA]: Implement cuda::kernel_ref
and cuda::function_ref
#2993
Comments
I like the suggstion. @pciolkosz is also working on wrapping some of the driver APIs, so let's see what he thinks of the suggestion. |
We will definitely need something like this for dynamic loading. I'm not sure yet if we want only kernel/library or function/module as well. |
In case we want function and modules as well, I was wondering whether it would be a good idea to rename the |
I'd like to take on this, cc: @jrhemstad |
I have the implementation almost finalized missing overloads for If you want to finalize it yourself, feel free to copy the branch :) |
@davebayer I took a quick look at the linked branch. I asked around and thought about the function/module and I think we should start without them and see if anyone complains. Another thing I wanted to improve are kernel attributes. I have seen multiple bugs resulting from races between different threads updating the shared memory size related attributes. Ideally these would be read-only and instead part of a kernel configuration if non-default value is needed. But that would require changes to |
Is this a duplicate?
Area
CUDA Experimental (cudax)
Is your feature request related to a problem? Please describe.
CUDAX already provides C++ wrappers for streams, devices, events and others. It does not provide wrappers for kernels and functions.
Describe the solution you'd like
I'd like to expand the API of wrappers for
CUkernel
andCUfunction
implemented askernel_ref
andfunction_ref
. Both are non-owning references to the original types implemented via the CUDA Driver API.In the future return values of
kernel_ref::get_library()
andfunction_ref::get_module()
could be replaced withlibrary_ref
andmodule_ref
if implemented.Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: