``VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance,'' Jiheum Yeom, Heeseung Kim, Jooyoung Choi, Che Hyun Lee, Nohil Park, Sungroh Yoon, https://t.co/dLteHypVE5
``NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers,'' Nohil Park, Heeseung Kim, Che Hyun Lee, Jooyoung Choi, Jiheum Yeom, Sungroh Yoon, https://t.co/92xyHStoPl
``Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM,'' Fengrun Zhang, Wang Geng, Hukai Huang, Cheng Yi, He Qu, https://t.co/dXySY7RZtW
Google LLC has introduced a zero-shot cross-lingual voice transfer module for text-to-speech (TTS) systems. This innovative technology allows for seamless voice transfer across multiple languages, enhancing the capabilities of multilingual TTS systems. The research, led by F Biadsy, Y Chen, I Elias, and K Kastner, is part of Google's ongoing efforts to improve speech synthesis and voice recognition technologies.