Text-to-Speech Leaderboard

Models LJSPEECH
utmos_standard_deviation utmos_mean wer_standard_deviation wer_mean
Qwen-2.5-Omni 0.340 4.153 124.642 134.956
SparkTTS 0.161 4.350 8.978 4.084