Evaluation

BOUQUET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation

We introduce BOUQUET, a multi-way, multi-domain, paragraph-level dataset and benchmark for machine translation, designed for broad domain representation and crowd-sourced expansion …

The Omnilingual MT Team
Read more

Speech Is More than Words: Do Speech-to-Text Translation Systems Leverage Prosody?

We investigate whether speech-to-text translation systems utilize prosody by introducing a new benchmark, ContraProSt. Our findings show that while models represent prosody …

Ioannis Tsiamas
Read more