Text-to-Speech Leaderboard
| Models | LJSPEECH | |||
|---|---|---|---|---|
| utmos_standard_deviation | utmos_mean | wer_standard_deviation | wer_mean | |
| SparkTTS | 0.161 | 4.350 | 8.978 | 4.084 |
| Qwen-2.5-Omni | 0.340 | 4.153 | 124.642 | 134.956 |
| Models | LJSPEECH | |||
|---|---|---|---|---|
| utmos_standard_deviation | utmos_mean | wer_standard_deviation | wer_mean | |
| SparkTTS | 0.161 | 4.350 | 8.978 | 4.084 |
| Qwen-2.5-Omni | 0.340 | 4.153 | 124.642 | 134.956 |