Are word boundaries useful for unsupervised language learning?

Type: Preprint

Publication Date: 2022-01-01

Citations: 5

DOI: https://doi.org/10.48550/arxiv.2210.02956

Locations

  • arXiv (Cornell University) - View - PDF
  • HAL (Le Centre pour la Communication Scientifique Directe) - View - PDF
  • DataCite API - View

Similar Works

Action Title Year Authors
+ DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon 2022 Robin Algayres
Tristan Ricoul
Julien Karadayi
Hugo Laurençon
Salah Zaiem
Abdelrahman Mohamed
Benoît Sagot
Emmanuel Dupoux
+ What do self-supervised speech models know about words? 2023 Ankita Pasad
Chung-Ming Chien
Shane Settle
Karen Livescu
+ PDF Chat DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon 2022 Robin Algayres
Tristan Ricoul
Julien Karadayi
Hugo Laurençon
Salah Zaiem
Abdelrahman Mohamed
Benoît Sagot
Emmanuel Dupoux
+ PDF Chat What Do Self-Supervised Speech Models Know About Words? 2024 Ankita Pasad
Chung-Ming Chien
Shane Settle
Karen Livescu
+ XLS-R fine-tuning on noisy word boundaries for unsupervised speech segmentation into words 2023 Robin Algayres
Pablo Diego-Simón
Benoît Sagot
Emmanuel Dupoux
+ XLS-R fine-tuning on noisy word boundaries for unsupervised speech segmentation into words 2023 Robin Algayres
Pablo Diego-Simón
Benoît Sagot
Emmanuel Dupoux
+ Efficient Transformers with Dynamic Token Pooling 2023 Piotr Nawrot
Jan Chorowski
Adrian Łańcucki
Edoardo Maria Ponti
+ Efficient Transformers with Dynamic Token Pooling 2022 Piotr Nawrot
Jan Chorowski
Adrian Łańcucki
Edoardo Maria Ponti
+ Tabula nearly rasa: Probing the Linguistic Knowledge of Character-Level Neural Language Models Trained on Unsegmented Text 2019 Michael G. Hahn
Marco Baroni
+ Tabula nearly rasa: Probing the Linguistic Knowledge of Character-Level Neural Language Models Trained on Unsegmented Text 2019 Michael Hahn
Marco Baroni
+ Neural Word Segmentation with Rich Pretraining 2017 Jie Yang
Yue Zhang
Fei Dong
+ PDF Chat Neural Word Segmentation with Rich Pretraining 2017 Jie Yang
Yue Zhang
Fei Dong
+ Universal Word Segmentation: Implementation and Interpretation 2018 Yan Shao
Christian Hardmeier
Joakim Nivre
+ PDF Chat Universal Word Segmentation: Implementation and Interpretation 2018 Yan Shao
Christian Hardmeier
Joakim Nivre
+ On the Difficulty of Segmenting Words with Attention 2021 Ramon Sanabria
Hao Tang
Sharon Goldwater
+ On the Difficulty of Segmenting Words with Attention 2021 Ramon Sanabria
Hao Tang
Sharon Goldwater
+ On the Difficulty of Segmenting Words with Attention 2021 Ramon Sanabria
Hao Tang
Sharon Goldwater
+ On the Difficulty of Segmenting Words with Attention 2021 Ramon Sanabria
Hao Tang
Sharon Goldwater
+ Word Boundary Information Isn't Useful for Encoder Language Models 2024 Edward Gow-Smith
Dylan Phelps
Harish Tayyar Madabushi
Carolina Scarton
Aline Villavicencio
+ PDF Chat From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes 2024 Zébulon Goriely
Richard Diehl Martinez
Andrew Caines
Lisa Beinborn
Paula Buttery

Works Cited by This (0)

Action Title Year Authors