Improving Speech Translation Accuracy and Time Efficiency with Fine-tuned wav2vec 2.0-based Speech Segmentation

Type: Preprint

Publication Date: 2023-01-01

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2304.12659

Locations

  • arXiv (Cornell University) - View
  • DataCite API - View

Similar Works

Action Title Year Authors
+ PDF Chat Improving Speech Translation Accuracy and Time Efficiency with Fine-tuned wav2vec 2.0-based Speech Segmentation 2023 Ryo Fukuda
Katsuhito Sudoh
Satoshi Nakamura
+ PDF Chat SHAS: Approaching optimal Segmentation for End-to-End Speech Translation 2022 Ioannis Tsiamas
Gerard I. Gállego
José A. R. Fonollosa
Marta R. Costa‐jussà
+ SHAS: Approaching optimal Segmentation for End-to-End Speech Translation 2022 Ioannis Tsiamas
Gerard I. Gállego
José A. R. Fonollosa
Marta R. Costa‐jussà
+ PDF Chat Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation 2022 Ryo Fukuda
Katsuhito Sudoh
Satoshi Nakamura
+ Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech Translation 2022 Ryo Fukuda
Katsuhito Sudoh
Satoshi Nakamura
+ Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead 2022 Piyush Behre
Naveen Parihar
Sharman Tan
Amy Shah
Eva Sharma
Geoffrey Liu
Shuangyu Chang
Hosam Khalil
Chris Basoglu
Dev S. Pathak
+ SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations 2022 Ioannis Tsiamas
José A. R. Fonollosa
Marta R. Costa‐jussà
+ SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations 2023 Ioannis Tsiamas
José Fonollosa
Marta R. Costa‐jussà
+ On Target Segmentation for Direct Speech Translation. 2020 Mattia Antonino Di Gangi
Marco Gaido
Matteo Negri
Marco Turchi
+ PDF Chat Lightweight Audio Segmentation for Long-form Speech Translation 2024 Jaesong Lee
So Yoon Kim
Hanbyul Kim
Joon Son Chung
+ PDF Chat Lightweight Audio Segmentation for Long-form Speech Translation 2024 Jaesong Lee
So Yoon Kim
Hanbyul Kim
Joon Son Chung
+ On Target Segmentation for Direct Speech Translation 2020 Mattia Antonino Di Gangi
Marco Gaido
Matteo Negri
Marco Turchi
+ On Target Segmentation for Direct Speech Translation 2020 Mattia Antonino Di Gangi
Marco Gaido
Matteo Negri
Marco Turchi
+ Learning When to Translate for Streaming Speech 2021 Qianqian Dong
Yaoming Zhu
Mingxuan Wang
Lei Li
+ RedApt: An Adaptor for wav2vec 2 Encoding \\ Faster and Smaller Speech Translation without Quality Compromise 2022 Jinming Zhao
Yang Hao
Gholamreza Haffari
Ehsan Shareghi
+ PDF Chat Learning When to Translate for Streaming Speech 2022 Dong Qian
Yaoming Zhu
Mingxuan Wang
Lei Li
+ PDF Chat Unified Speech-Text Pre-training for Speech Translation and Recognition 2022 Yun Tang
Hongyu Gong
Ning Dong
Changhan Wang
Wei-Ning Hsu
Jiatao Gu
Alexei Baevski
Xian Li
Abdelrahman Mohamed
Michael Auli
+ UniST: Unified End-to-end Model for Streaming and Non-streaming Speech Translation. 2021 Qianqian Dong
Yaoming Zhu
Mingxuan Wang
Lei Li
+ PDF Chat Direct Speech-to-Speech Neural Machine Translation: A Survey 2024 Mahendra Gupta
Maitreyee Dutta
Chandresh Kumar Maurya
+ NeurST: Neural Speech Translation Toolkit 2020 Chengqi Zhao
Mingxuan Wang
Qianqian Dong
Rong Ye
Lei Li

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (19)

Action Title Year Authors
+ BERTScore: Evaluating Text Generation with BERT 2019 Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
+ A Call for Clarity in Reporting BLEU Scores 2018 Matt Post
+ On the Properties of Neural Machine Translation: Encoder–Decoder Approaches 2014 Kyunghyun Cho
Bart van Merriënboer
Dzmitry Bahdanau
Yoshua Bengio
+ Parameter-Efficient Transfer Learning for NLP 2019 Neil Houlsby
Andrei Giurgiu
Stanisław Jastrzȩbski
Bruna Morrone
Quentin de Laroussilhe
Andréa Gesmundo
Mona Attariyan
Sylvain Gelly
+ PDF Chat End-to-End Automatic Speech Recognition Integrated with CTC-Based Voice Activity Detection 2020 Takenori Yoshimura
Tomoki Hayashi
Kazuya Takeda
Shinji Watanabe
+ PDF Chat Europarl-ST: A Multilingual Corpus for Speech Translation of Parliamentary Debates 2020 Javier Iranzo-Sánchez
Joan Albert Silvestre-Cerdà
Javier Jorge
Nahuel Roselló
Adrià Giménez
Albert Sanchís
Jorge Civera
Alfons Juan
+ BLEURT: Learning Robust Metrics for Text Generation 2020 Thibault Sellam
Dipanjan Das
Ankur P. Parikh
+ wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations 2020 Alexei Baevski
Henry Zhou
Abdelrahman Mohamed
Michael Auli
+ PDF Chat Contextualized Translation of Automatically Segmented Speech 2020 Marco Gaido
Mattia Antonino Di Gangi
Matteo Negri
Mauro Cettolo
Marco Turchi
+ PDF Chat On Knowledge Distillation for Direct Speech Translation 2020 Marco Gaido
Mattia Antonino Di Gangi
Matteo Negri
Marco Turchi