Real Time Demo that allows natural conversations #91

freddyaboulton · 2024-10-31T16:01:26Z

Overview

This PR adds an interactive demo that enables natural, continuous conversations with Qwen2-Audio. Users can engage in fluid dialogue with the model through their microphone. Responses are automatically generated when they finish speaking. This enhancement makes the model more accessible and natural to interact with.

Key Features

Real-time audio streaming using WebRTC
Automatic speech detection and processing
Support for both local and cloud deployment

Dependencies

Added requirements:

gradio-webrtc (gradio custom component that enables real time audio/video streaming). Disclaimer - I am the author of this extension.
twilio (optional, for cloud deployment)

Demo

qwen2-audio.mp4

There is some delay in processing the response due to acquiring the shared GPU on HuggingFace spaces. On dedicated hardware it should be much faster but I don't have the GPUs to verify myself.

Real time Gradio demo

7f8d4ad

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Real Time Demo that allows natural conversations #91

Real Time Demo that allows natural conversations #91

freddyaboulton commented Oct 31, 2024

Real Time Demo that allows natural conversations #91

Are you sure you want to change the base?

Real Time Demo that allows natural conversations #91

Conversation

freddyaboulton commented Oct 31, 2024

Overview

Key Features

Dependencies

Demo