VibeVoice bezzam/VibeVoice-1.5B Text-to-Speech • 3B • Updated Feb 16 • 32 • 1 bezzam/VibeVoice-7B Text-to-Speech • 9B • Updated May 5 • 20 bezzam/VibeVoice-AcousticTokenizer Feature Extraction • 0.7B • Updated Feb 5 • 6 bezzam/VibeVoice-SemanticTokenizer Feature Extraction • 0.3B • Updated Dec 3, 2025 • 2
Neural codecs facebook/encodec_48khz Feature Extraction • 19.1M • Updated Sep 6, 2023 • 6.06k • 35 facebook/encodec_32khz Feature Extraction • 59M • Updated Sep 4, 2023 • 35.8k • 18 facebook/encodec_24khz Feature Extraction • 23.3M • Updated Jul 25, 2023 • 30.8k • 54 descript/dac_44khz Feature Extraction • 76.6M • Updated Oct 11, 2024 • 58.7k • • 11
VibeVoice bezzam/VibeVoice-1.5B Text-to-Speech • 3B • Updated Feb 16 • 32 • 1 bezzam/VibeVoice-7B Text-to-Speech • 9B • Updated May 5 • 20 bezzam/VibeVoice-AcousticTokenizer Feature Extraction • 0.7B • Updated Feb 5 • 6 bezzam/VibeVoice-SemanticTokenizer Feature Extraction • 0.3B • Updated Dec 3, 2025 • 2
Neural codecs facebook/encodec_48khz Feature Extraction • 19.1M • Updated Sep 6, 2023 • 6.06k • 35 facebook/encodec_32khz Feature Extraction • 59M • Updated Sep 4, 2023 • 35.8k • 18 facebook/encodec_24khz Feature Extraction • 23.3M • Updated Jul 25, 2023 • 30.8k • 54 descript/dac_44khz Feature Extraction • 76.6M • Updated Oct 11, 2024 • 58.7k • • 11
Running Open ASR Leaderboard configuration for NVIDIA NeMo ASR models 🎙 Run benchmark evaluations on your model