Text-to-Speech Leaderboard
| Models | LJSPEECH | |||
|---|---|---|---|---|
| wer_mean | wer_standard_deviation | utmos_mean | utmos_standard_deviation | |
| SOTA_D31_Qwen-2.5-Omni | 134.956 | 124.642 | 4.153 | 0.340 |
| SOTA_D31_SparkTTS | 4.084 | 8.978 | 4.350 | 0.161 |
| Models | LJSPEECH | |||
|---|---|---|---|---|
| wer_mean | wer_standard_deviation | utmos_mean | utmos_standard_deviation | |
| SOTA_D31_Qwen-2.5-Omni | 134.956 | 124.642 | 4.153 | 0.340 |
| SOTA_D31_SparkTTS | 4.084 | 8.978 | 4.350 | 0.161 |