Publications

(2025). Improving Language and Modality Transfer in Translation by Character-level Modeling. In ACL 2025.
(2025). Sequential Contrastive Audio-Visual Learning. In ICASSP 2025.
(2025). BOUQUET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation. arXiv.
(2024). Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity. In ECCV 2024.
(2024). Speech Is More than Words: Do Speech-to-Text Translation Systems Leverage Prosody?. In WMT 2024.
(2024). Pushing the Limits of Zero-shot End-to-End Speech Translation. In Findings of ACL 2024.
(2023). SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations. In Findings of EMNLP 2023.
(2023). Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23. In IWSLT 2023.
(2023). Explaining How Transformers Use Context to Build Predictions. In ACL 2023.
(2023). Efficient Speech Translation with Dynamic Latent Perceivers. In ICASSP 2023.