Yiannis (Ioannis) Tsiamas 👾
Yiannis (Ioannis) Tsiamas
(he/him)

AI Research Scientist
Multilinguality & Multimodality | Speech & Text Translation | Representation Learning

I’m Yiannis Tsiamas, an AI Research Scientist and Ph.D. candidate at UPC Barcelona, working on multilinguality, multimodality, text/speech translation, and representation learning. I have authored papers at top-tier conferences like ACL, EMNLP, and ICASSP, and I’ve had the privilege of contributing to cutting-edge research during internships at Meta, Apple, and Dolby.

I am passionate about languages and my goal is to make AI systems trully multilingual, and accessible to everyone.

†Note: I publish under ‘Ioannis Tsiamas’, but use Yiannis as prefered name, which is the casual version Ioannis.

Download CV

Experience & Education

AI Research Scientist

Internship at the Omnilingual team of Meta FAIR.
(Aug 2024 - Present)

AI Research Scientist

Internship at the Machine Translation team of Apple AI/ML.
(Apr 2024 - Jul 2024)

AI Research Scientist

Internship on Audio-Visual Representations at Dolby AI.
(Nov 2023 - Feb 2024)

image/svg+xml

PhD in AI

PhD in Artificial Intelligence at UPC Barcelona.
(Mar 2021 - Present)

MSc in AI

MSc in Artificial Intelligence at the University of Amsterdam.
(Graduated Aug 2020)

MSc in Quant Finance

MSc in Quantitative Finance at VU University Amsterdam.
(Graduated Oct 2018)

Featured Publications

Improving Language and Modality Transfer in Translation by Character-level Modeling

We propose a character-based translation model to improve adaptability to new languages and modalities, particularly for low-resource scenarios. Our method achieves …

Ioannis Tsiamas
•
Read more

Sequential Contrastive Audio-Visual Learning

We introduce Sequential Contrastive Audio-Visual Learning (SCAV), a novel method that contrasts non-aggregated sequential representations to learn fine-grained audio-visual …

Ioannis Tsiamas
•
Read more

Pushing the Limits of Zero-shot End-to-End Speech Translation

We introduce ZeroSwot, a zero-shot speech translation method that aligns a speech encoder with a multilingual MT model using only ASR data, achieving state-of-the-art results …

Ioannis Tsiamas
•
Read more
Recent Publications
(2025). Improving Language and Modality Transfer in Translation by Character-level Modeling. In ACL 2025.
(2025). Sequential Contrastive Audio-Visual Learning. In ICASSP 2025.
(2025). BOUQUET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation. arXiv.
(2024). Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity. In ECCV 2024.
(2024). Speech Is More than Words: Do Speech-to-Text Translation Systems Leverage Prosody?. In WMT 2024.