[NEW RELEASE] Introducing MARS5, open-source, insanely prosodic text-to-speech (TTS) model. #251
Replies: 2 comments
-
Certainly takes some guts to shoutout a personal tool on someone else's repo. However, This MARS seems to be built on similar methods to Tortoise and XTTS. So it is old, and does not use phonetic text for proper pronunciations. Additionally, it lacks a means to train or finetune your own model with the source available (Readme lacks the steps entirely). This puts it in the same realm as Bark (suno-ai) and ElevelLabs. Not end user friendly unless you use the paid service. The only relation to StyleTTS2 is that it is a TTS model. |
Beta Was this translation helpful? Give feedback.
-
Why are you not here? |
Beta Was this translation helpful? Give feedback.
-
Hey community members! 👋
We, at CAMB.AI, are super stoked to announce the open source release of MARS5, a new speech emulation model that is able to replicate even extremely tough prosody like sports commentary, anime, movies with just a few seconds of audio reference.
Check out our release: https://github.com/Camb-ai/MARS5-TTS
Watch our demo here:
337749366-3e191508-e03c-4ff9-9b02-d73ae0ebefdd.mp4
and the full release video: https://www.youtube.com/watch?v=bmJSLPYrKtE
We're excited to hear feedback and see the community build on top of it!
Quick links:
Discord: https://discord.gg/ZzsKTAKM
Github: https://www.github.com/camb-ai/mars5-tts
Website: https://www.camb.ai/
Youtube: https://www.youtube.com/@camb-ai
Beta Was this translation helpful? Give feedback.
All reactions