Jan 15, 05:40 PM

OuteAI Launches OuteTTS 0.3 with 1B and 500M Models for Multilingual Voice Cloning, Trained on 20,000 Hours of Audio

OuteAI has launched OuteTTS 0.3, featuring two models: 1B and 500M. This advanced speech synthesis model supports multilingual capabilities, including English, Japanese, Korean, Chinese, French, and German. It is trained on 20,000 hours of audio and offers zero-shot voice cloning, speed, and emotion control. The model is powered by OLMo-1B and Qwen 2.5 0.5B, and has received funding through Hugging Face grants. Additionally, Hailuo AI has introduced a new text-to-voice feature that also allows for zero-shot voice cloning, with comparisons being made to F5 TTS, a leading open-source solution. Users are experimenting with various AI voice models, including Hailuo AI's T2A-01-HD, which allows for emotional expression in voice cloning.

#OuteAI #English #Japanese #Korean #Chinese #French #German #OLMo #Qwen #Hugging Face #Hailuo AI

Written with ChatGPT (GPT-4o mini).

OuteAI Launches OuteTTS 0.3 with 1B and 500M Models for Multilingual Voice Cloning, Trained on 20,000 Hours of Audio

Sources

Additional media

Similar Stories

Similar Stories