Europarl-ST: A Multilingual Corpus for Speech Translation of Parliamentary Debates
Europarl-ST: A Multilingual Corpus for Speech Translation of Parliamentary Debates
Current research into spoken language translation (SLT), or speech-to-text translation, is often hampered by the lack of specific data resources for this task, as currently available SLT datasets are restricted to a limited set of language pairs. In this paper we present Europarl-ST, a novel multilingual SLT corpus containing paired …