-
Notifications
You must be signed in to change notification settings - Fork 200
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Question] Difference between Embedding Training Cache and GPU Embedding Cache #424
Comments
Hi @hsezhiyan
Thanks, |
Thank you for the response @minseokl In that case, will ETC (which is under deprecation) be replaced by GPU Embedding Cache for training cases? Because it looks like GPU Embedding Cache can be used for both inference and training |
@hsezhiyan |
Thank you for the answer @yingcanw! I'd like to ask a few followup questions:
|
@minseokl if you also have any insights, I would appreciate it! |
@sezhiyanhari Sorry for the late reply. |
Hi, could you provide an example script about training using HKV and SOK? I am a little confused about how HKV could replace ETC because as far as I know, HKV is a single GPU key-value store. Could it eliminate the Parameter Server in ETC? Any insights are appreciated. |
Hi @lausannel , HKV is a key-value store that uses GPU + CPU memory, where the memory for values can be stored either on the GPU or on the CPU. |
@kanghui0204 Thanks for your explaination! |
What is the difference between the Embedding Training Cache (https://github.com/NVIDIA-Merlin/HugeCTR/tree/main/HugeCTR/src/embedding_training_cache) and the GPU Embedding Cache (https://github.com/NVIDIA-Merlin/HugeCTR/tree/main/gpu_cache)?
It appears as if the Embedding Training Cache is used only during training. Does it use the GPU Embedding Cache under the hood?
The text was updated successfully, but these errors were encountered: