This repository contains starter templates for building AI agents using the Pipecat framework and deploying them to Pipecat Cloud.
- voice - Voice conversation agent with STT, LLM and TTS
- twilio - Telephony agent that works with Twilio
- natural_conversation - Text chat agent focused on natural dialogue
- openai_realtime - Agent using OpenAI's streaming capabilities
- gemini_multimodal_live - Multimodal agent using Google's Gemini models
- vision - Computer vision agent that can analyze images
Each starter contains:
- A functioning
bot.py
file with a complete agent implementation - A Dockerfile for containerization
- Required dependencies in requirements.txt
- A targeted README with specific customization options
- Docker installed on your system
- Pipecat Cloud account
- Python 3.10+
For detailed instructions on building, deploying, and running your agent, please refer to the Pipecat Cloud documentation.
-
Choose a starter: Navigate to the starter that best fits your use case
-
Customize your agent: Modify the
bot.py
file to change the behavior of your agent -
Build the Docker image:
docker build --platform=linux/arm64 -t my-agent:latest .
-
Push to a container registry:
docker tag my-agent:latest your-repository/my-agent:latest docker push your-repository/my-agent:latest
-
Deploy to Pipecat Cloud:
pipecat deploy agent-name your-repository/my-agent:latest --secrets my-secrets
-
Start a session:
pipecat agent start agent-name --use-daily
Different starters require various API keys, which should be added to your Pipecat Cloud secrets.
pipecat secrets set my-secrets \
OPENAI_API_KEY=sk-... \
DEEPGRAM_API_KEY=... \
CARTESIA_API_KEY=...
For more information on the Pipecat framework and Pipecat Cloud:
This project is licensed under the BSD 2-Clause License - see the LICENSE file for details.