🇳🇴 Oslo 🏴 Edinburgh 🇦🇹 Graz
I presented a TTS-for-ASR paper:
www.isca-archive.org/interspeech_...
And one on prosody reps: www.isca-archive.org/interspeech_...
There were many interesting questions & comments - if you have more and didn't get the chance feel free to send me a message.
I presented a TTS-for-ASR paper:
www.isca-archive.org/interspeech_...
And one on prosody reps: www.isca-archive.org/interspeech_...
There were many interesting questions & comments - if you have more and didn't get the chance feel free to send me a message.
CMOS has a history in evaluation standards, just like MOS. But recently it's all about speech synth.
(7/9)
CMOS has a history in evaluation standards, just like MOS. But recently it's all about speech synth.
(7/9)
We found that our zero-shot distribution distance (similar to FID across several factors like prosody, speaker, etc.) correlated well with subjective evaluation for TTS systems from 2008 to 2024.
ttsdsbenchmark.com
We found that our zero-shot distribution distance (similar to FID across several factors like prosody, speaker, etc.) correlated well with subjective evaluation for TTS systems from 2008 to 2024.
ttsdsbenchmark.com
Alongside there is also a nice overview of all systems (see below)
Alongside there is also a nice overview of all systems (see below)