
Nvidia has released BigVGAN v2, a state-of-the-art neural vocoder designed to transform audio synthesis. The new model features a custom CUDA kernel for inference, which includes fused upsampling and activation kernel, resulting in up to 3x faster inference on A100 GPUs. Additionally, BigVGAN v2 boasts improved discriminator and loss functions, utilizing a multi-scale sub-band CQT discriminator and a multi-scale mel spectrogram loss. BigVGAN v2 is a Mel spectrogram to waveform generator. This release is expected to significantly advance the field of audio synthesis.
Nvidia AI Releases BigVGAN v2: A State-of-the-Art Neural Vocoder Transforming Audio Synthesis https://t.co/hvmr8HfjCJ #NvidiaAI #BigVGANv2 #AudioSynthesis #AIevolution #PracticalSolutions #ai #news #llm #ml #research #ainews #innovation #artificialintelligence #machinelearnin… https://t.co/SsPDb0RGqq
Nvidia AI Releases BigVGAN v2: A State-of-the-Art Neural Vocoder Transforming Audio Synthesis Read our take on this: https://t.co/RaVoofe8Gp Model: https://t.co/Jt1cGo8b6p Paper: https://t.co/cbRWuqA8Gl In the rapidly developing field of audio synthesis, Nvidia has recently… https://t.co/GU2lBRJdn7
✨ Train #generativeAI models more efficiently with NVIDIA Megatron-Core, an open source library for large-scale training. Discover its advancements in scalability, training resiliency, and the newly added support for #multimodal training. ➡️ https://t.co/pgz1A2xh81✨








