Speech Is More than Words: Do Speech-to-Text Translation Systems Leverage Prosody?
We investigate whether speech-to-text translation systems utilize prosody by introducing a new benchmark, ContraProSt. Our findings show that while models represent prosody …