Evaluation

BOUQUET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation

We introduce BOUQUET, a multi-way, multi-domain, paragraph-level dataset and benchmark for machine translation, designed for broad domain representation and crowd-sourced expansion …

The Omnilingual MT Team

• Feb 6, 2025 • 1 min read

Speech Translation

Speech Is More than Words: Do Speech-to-Text Translation Systems Leverage Prosody?

We investigate whether speech-to-text translation systems utilize prosody by introducing a new benchmark, ContraProSt. Our findings show that while models represent prosody …

Ioannis Tsiamas

• Nov 1, 2024 • 1 min read