Text-to-Speech Leaderboard

Models LJSPEECH
utmos_standard_deviation utmos_mean wer_standard_deviation wer_mean
SOTA_D31_Qwen-2.5-Omni 0.340 4.153 124.642 134.956
SOTA_D31_SparkTTS 0.161 4.350 8.978 4.084