Skip to content

Speech Translate app and server that can generate audio based on your own voice

Notifications You must be signed in to change notification settings

wayne-xyz/Voice-Translator

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

77 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Voice Translator App and Server

License

Overview

The Voice Translator App and Server is a project that enables users to interactively translate spoken language using their own voice. It provides a seamless communication experience by allowing users to engage in real-time conversations with people who speak different languages.

Features

  • Voice Recognition: This App can recognize user's voice and turn to the text content.
  • Language Translation: This App can send user's speak content to server to processing translation, At present, only Chinese and English are supported.
  • Real-time Interaction: It has a relatively high response speed and supports real-time face-to-face conversations for two individuals.
  • Voice mimic: Generated translation is baed on user own voice. It utilize the ML on Server.

Framwork & tools

iOS

Swift, Custom UIView, Speech framework,Grand Central Dispatch(GCD), AVFoundation.

Server

Python, Fastapi, Pytorch, Cuda, XTTS, argostranslate.

Showcase

  • Custom animation when you say something ,the circle will be changed by your voice power

  • Custom animation when data is loading the ring will turn circle

![Video]

Usage

  1. Support Python 3.11.5
  2. Support iOS 15.6
  3. Additionally need XTTS-v2 folder

About

Speech Translate app and server that can generate audio based on your own voice

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Swift 70.7%
  • Python 28.0%
  • Shell 1.3%