Avatar Chatbot Project

Description

This project creates an interactive voice-based chatbot with a visual avatar. The bot listens to user input, generates responses using the LLaMA model, and speaks responses out loud using eSpeak. The avatar's visual behavior switches between idle and speaking modes depending on the bot's activity.

Features

Voice Input: Users can speak into the microphone to interact with the bot.
Response Generation: LLaMA model generates conversational and concise responses.
Voice Output: Responses are spoken aloud using eSpeak.
Visual Avatar: A video of an avatar is displayed, switching between idle and speaking modes during interactions.
Vosk Model: Speech-to-text functionality is powered by the Vosk model, providing real-time voice input recognition.

Requirements

Software Dependencies:

Python 3.8+
eSpeak
Vosk
Pyaudio
PyGame
PyAv
Ollama (LLaMA)

Install Required Python Libraries:

pip install -r requirements.txt

Install eSpeak:

Linux:
```
sudo apt-get install espeak
```
Windows: Download and install eSpeak from the official site.

Install Ollama and LLaMA Model

Download Ollama:
- Visit Ollama's download page to download the appropriate version for your operating system.
Install Ollama:
- macOS:
```
brew install ollama/tap/ollama
```
- Windows/Linux: Follow the installation guide after downloading the binary from the Ollama download page.
Download the LLaMA 3.2 model: After installing Ollama, download the LLaMA 3.2 model by running:
```
ollama pull llama3.2
```
Start Ollama: To use Ollama in the project, you need to start the Ollama server:
```
ollama serve
```

How to Run

Clone the project:

git clone https://github.com/SabaSyed/SpeechAvatarBot.git
cd SpeechAvatarBot

Ensure all dependencies are installed (see requirements above).
Run the app.py script:
```
python app.py
```
Interaction:
- The bot will listen for user input.
- It will generate a response using LLaMA and speak it using eSpeak.
- The avatar's video will switch between idle and speaking videos based on activity.

Project Structure

├── idle.mp4                # Idle video for avatar
├── speaking.mp4            # Speaking video for avatar
├── app.py                  # Main script for chatbot functionality
├── requirements.txt        # File containing python libraries to be installed
└── README.md               # Project documentation

Known Issues

Ensure that the audio device is correctly configured for Vosk to capture input and for eSpeak to produce audio.
The performance may vary depending on system resources, especially with LLaMA model response generation.
Make sure that the Ollama server is running before starting the bot.

Acknowledgments

Vosk for speech-to-text recognition.
Ollama's LLaMA for response generation.
eSpeak for quick and efficient text-to-speech conversion.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
vosk-model-small-en-us-0.15		vosk-model-small-en-us-0.15
README.md		README.md
app.py		app.py
optimized.py		optimized.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Avatar Chatbot Project

Description

Features

Requirements

Software Dependencies:

Install Required Python Libraries:

Install eSpeak:

Install Ollama and LLaMA Model

How to Run

Project Structure

Known Issues

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Languages

SabaSyed/SpeechAvatarBot

Folders and files

Latest commit

History

Repository files navigation

Avatar Chatbot Project

Description

Features

Requirements

Software Dependencies:

Install Required Python Libraries:

Install eSpeak:

Install Ollama and LLaMA Model

How to Run

Project Structure

Known Issues

Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages