Support for the new CUDA virtual memory management functions for shared memory. #4538

Reznic · 2022-06-22T14:38:01Z

Hi,
I'm trying to use the Triton server on Jetson platform (Jetpack 5).
Previously, before the jetson, we used Triton server via the grpc client, and passed a Cuda shared memory handle, allocated with the cuda IPC API.
As I understand, the cuIPC functions are not supported on the Jetson, and instead, I have to use the new CUDA virtual memory management functions: cuMemExportToShareableHandle
As described here

Currently, cuda shared memory registration on the Triton server is only implemented for the cuda-IPC memory handle.
(in RegisterCUDASharedMemory method at shared_memory_manager.cpp).

Does it mean that the only current option, in Jetson platform, to pass input tensors to the Triton server, is via system shared memory?
Is supporting cuda shared memory with the new memory management API, on your roadmap?
E.g. Implementation of RegisterCUDASharedMemory which uses cuMemImportFromShareableHandle function, and gRPC client support for it.
If so, when do you plan on releasing it?

Thanks very much

The text was updated successfully, but these errors were encountered:

dyastremsky · 2022-06-22T16:32:25Z

I believe you are correct that system shared memory is the way to pass input tensors. I'm not sure that it's on our roadmap yet, though we could file a feature request.

@CoderHam may be able to provide additional information.

Tabrizian · 2022-07-08T19:05:31Z

Thanks for your feature request. There is a ticket on our backlog for the same but it has not been priotorized yet.

Reznic · 2023-01-23T16:19:37Z

Hi, @Tabrizian @dyastremsky
Is this feature on your roadmap?
Thanks

dyastremsky · 2023-01-23T16:49:14Z

Thanks for checking. Yes, it is.

hackerliang · 2023-05-11T05:46:24Z

Hi, @Tabrizian @dyastremsky
What's the release date of this enhancement?
Thanks

dyastremsky · 2023-05-19T03:50:33Z

We have not yet announced a public release date.

dyastremsky added the question Further information is requested label Jun 22, 2022

Tabrizian added enhancement New feature or request and removed question Further information is requested labels Jul 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for the new CUDA virtual memory management functions for shared memory. #4538

Support for the new CUDA virtual memory management functions for shared memory. #4538

Reznic commented Jun 22, 2022

dyastremsky commented Jun 22, 2022

Tabrizian commented Jul 8, 2022

Reznic commented Jan 23, 2023

dyastremsky commented Jan 23, 2023

hackerliang commented May 11, 2023

dyastremsky commented May 19, 2023

Support for the new CUDA virtual memory management functions for shared memory. #4538

Support for the new CUDA virtual memory management functions for shared memory. #4538

Comments

Reznic commented Jun 22, 2022

dyastremsky commented Jun 22, 2022

Tabrizian commented Jul 8, 2022

Reznic commented Jan 23, 2023

dyastremsky commented Jan 23, 2023

hackerliang commented May 11, 2023

dyastremsky commented May 19, 2023