Publications

(2025). Improving Language and Modality Transfer in Translation by Character-level Modeling. In ACL 2025.
(2025). Sequential Contrastive Audio-Visual Learning. In ICASSP 2025.
(2025). BOUQUET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation. arXiv.
(2024). Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity. In ECCV 2024.
(2024). Speech Is More than Words: Do Speech-to-Text Translation Systems Leverage Prosody?. In WMT 2024.
(2024). Pushing the Limits of Zero-shot End-to-End Speech Translation. In Findings of ACL 2024.
(2023). SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations. In Findings of EMNLP 2023.
(2023). Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23. In IWSLT 2023.
(2023). Explaining How Transformers Use Context to Build Predictions. In ACL 2023.
(2023). Efficient Speech Translation with Dynamic Latent Perceivers. In ICASSP 2023.
(2022). SHAS: Approaching optimal Segmentation for End-to-End Speech Translation. In Interspeech 2022.
(2022). Pretrained Speech Encoders and Efficient Fine-tuning Methods for Speech Translation: UPC at IWSLT 2022. In IWSLT 2022.
(2021). The TALP-UPC Participation in WMT21 News Translation Task: an mBART-based NMT Approach. In WMT 2021.
(2021). End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021. In IWSLT 2021.