Gemini Realtime WebRTC

Gemini Realtime API with WebRTC, Like OpenAI Realtime API with WebRTC.

Features

Real-time voice communication with Gemini AI
High-quality audio processing:
- 48kHz sample rate support
- Opus codec for efficient audio compression
- Automatic audio resampling
- Smart audio buffering with 100ms accumulation
WebRTC-based communication:
- Low-latency audio streaming
- Reliable data channel for text messages
Debug capabilities:
- Configurable audio dumping for all streams
- Detailed logging
- PCM file format support

Prerequisites

Go 1.21 or higher
FFmpeg libraries (for audio processing)
Opus codec library
Google API Key for Gemini AI

Installation

Install system dependencies:

# For Debian/Ubuntu
apt-get install pkg-config libopus-dev libavcodec-dev libavformat-dev libavutil-dev libswresample-dev

# For macOS
brew install opus ffmpeg

Clone the repository:

git clone https://github.com/realtime-ai/gemini-realtime-webrtc.git
cd gemini-realtime-webrtc

Install Go dependencies:

go mod download

Configuration

Set up environment variables:

# Required
export GOOGLE_API_KEY=your_api_key_here

# Optional (for audio debugging)
export DUMP_SESSION_AUDIO=true  # Dump AI response audio
export DUMP_REMOTE_AUDIO=true   # Dump user input audio
export DUMP_LOCAL_AUDIO=true    # Dump playback audio

Running the Application

Start the server:

go run main.go

Open the web client:
- Navigate to tests/gemini_realtime_webrtc.html in your browser
- Click "Connect" to establish WebRTC connection
- Allow microphone access when prompted

Architecture

pkg/gateway: WebRTC server and connection management
pkg/audio: Audio processing utilities
- Resampling between different sample rates
- Audio buffering with smart accumulation
- PCM/WAV file handling
pkg/utils: Common utilities and helper functions

Development

Building from source:

go build -o server

Running tests:

go test ./...

Contributing

Fork the repository
Create your feature branch
Commit your changes
Push to the branch
Create a new Pull Request

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
docs		docs
pkg		pkg
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gemini Realtime WebRTC

Features

Prerequisites

Installation

Configuration

Running the Application

Architecture

Development

Building from source:

Running tests:

Contributing

About

Releases

Packages

Languages

License

wizd/gemini-realtime-webrtc

Folders and files

Latest commit

History

Repository files navigation

Gemini Realtime WebRTC

Features

Prerequisites

Installation

Configuration

Running the Application

Architecture

Development

Building from source:

Running tests:

Contributing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages