A new text-to-speech (TTS) model called Kokoro v1.0 has been released, featuring 82 million parameters and licensed under Apache 2.0. This multilingual TTS model supports English, Spanish, French, Italian, Japanese, and Mandarin. It operates fully offline and locally in web browsers, utilizing WebGPU for acceleration, allowing for real-time speech generation. Users can generate 10 seconds of speech in approximately one second at a cost of $0. The model is designed to be easily integrated, requiring only five lines of code to implement. Initial user feedback indicates a significant improvement in speed compared to previous versions without WebGPU support.
This is amazing! I did try kokoro js without WebGPU before and it way way too slow. But now its almost instant. Crazy! Great work @xenovacom and everyone involved! https://t.co/xWE64K61LW
Five lines of code is all you need to get SoTA TTS directly within your browser! https://t.co/DGLglE2jUe https://t.co/ZQloOef02g
NEW: Kokoro v1.0 - 82M parameters, Apache 2.0 Licensed, multilingual TTS model - powered by WebGPU in browser! 🔥 Fully offline, 100% local with support for English, Spanish, French, Italian, Japanese and Mandarin! ⚡ Works directly in your browser with blazingly fast inference… https://t.co/yIl18fhvU4