Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for the new CUDA virtual memory management functions for shared memory. #4538

Open
Reznic opened this issue Jun 22, 2022 · 6 comments
Labels
enhancement New feature or request

Comments

@Reznic
Copy link

Reznic commented Jun 22, 2022

Hi,
I'm trying to use the Triton server on Jetson platform (Jetpack 5).
Previously, before the jetson, we used Triton server via the grpc client, and passed a Cuda shared memory handle, allocated with the cuda IPC API.
As I understand, the cuIPC functions are not supported on the Jetson, and instead, I have to use the new CUDA virtual memory management functions: cuMemExportToShareableHandle
As described here

Currently, cuda shared memory registration on the Triton server is only implemented for the cuda-IPC memory handle.
(in RegisterCUDASharedMemory method at shared_memory_manager.cpp).

  1. Does it mean that the only current option, in Jetson platform, to pass input tensors to the Triton server, is via system shared memory?

  2. Is supporting cuda shared memory with the new memory management API, on your roadmap?
    E.g. Implementation of RegisterCUDASharedMemory which uses cuMemImportFromShareableHandle function, and gRPC client support for it.
    If so, when do you plan on releasing it?

Thanks very much

@dyastremsky dyastremsky added the question Further information is requested label Jun 22, 2022
@dyastremsky
Copy link
Contributor

I believe you are correct that system shared memory is the way to pass input tensors. I'm not sure that it's on our roadmap yet, though we could file a feature request.

@CoderHam may be able to provide additional information.

@Tabrizian
Copy link
Member

Thanks for your feature request. There is a ticket on our backlog for the same but it has not been priotorized yet.

@Tabrizian Tabrizian added enhancement New feature or request and removed question Further information is requested labels Jul 8, 2022
@Reznic
Copy link
Author

Reznic commented Jan 23, 2023

Hi, @Tabrizian @dyastremsky
Is this feature on your roadmap?
Thanks

@dyastremsky
Copy link
Contributor

Thanks for checking. Yes, it is.

@hackerliang
Copy link

Hi, @Tabrizian @dyastremsky
What's the release date of this enhancement?
Thanks

@dyastremsky
Copy link
Contributor

We have not yet announced a public release date.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Development

No branches or pull requests

4 participants