a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.
-
Updated
Apr 7, 2025 - Python
a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.
Add a description, image, and links to the omni-language-model topic page so that developers can more easily learn about it.
To associate your repository with the omni-language-model topic, visit your repo's landing page and select "manage topics."