Nov 5, 01:27 AM

Oute AI Launches OuteTTS-0.1-350M Model on LLaMa Architecture for Instant Voice Cloning, Trained on 700K Hours of Audio

Oute AI has launched OuteTTS-0.1-350M, a new text-to-speech (TTS) synthesis model that utilizes a pure language modeling approach without the need for external adapters. This model is built on the LLaMa architecture and features zero-shot voice cloning capabilities, allowing users to clone voices instantly without prior training. It operates on-device using llama.cpp, making it accessible and efficient for users. Additionally, a new speech-to-speech model called Fish Agent v0.1 3B has been introduced by FishAudio, which was trained on 700,000 hours of multilingual audio. This model also supports zero-shot voice cloning and can process both text and audio inputs, providing ultra-fast results.

#Oute AI #Oute #LLaMa #Fish Agent #FishAudio

Written with ChatGPT (GPT-4o mini).

Sources

Wes Roth@WesRothMoney
2 years ago
OuteAI’s new Smol TTS, OuteTTS-0.1-350M, lets you clone voices instantly without training. Built on LLaMa technology and free to use, it runs smoothly on your device. https://t.co/YaOKSo5j5u
Vaibhav (VB) Srivastav@reach_vb
2 years ago
Wow! New Speech to Speech model - Fish Agent v0.1 3B by @FishAudio 🔥 > Trained on 700K hours of multilingual audio > Continue-pretrained version of Qwen-2.5-3B-Instruct for 200B audio & text tokens > Zero-shot voice cloning > Text + audio input/ Audio output > Ultra-fast… https://t.co/UvdwxGUm4w
Vlad Ruso PhD@vlruso
2 years ago
OuteTTS-0.1-350M Released: A Novel Text-to-Speech (TTS) Synthesis Model that Leverages Pure Language Modeling without External Adapters https://t.co/O8fJF6Rpfg #TextToSpeech #AIAdvancements #OuteTTS #VoiceCloning #SpeechSynthesis #ai #news #llm #ml #research #ainews #innovati… https://t.co/Wc6slvIR6P

Additional media

Image #1 for story oute-ai-launches-outetts-0-1-350m-model-on-llama-architecture-instant-voice-on-e53f0c79

Oute AI Launches OuteTTS-0.1-350M Model on LLaMa Architecture for Instant Voice Cloning, Trained on 700K Hours of Audio

Sources

Additional media

Similar Stories