Lattice rescoring strategies for long short term memory language models in speech recognition

Type: Preprint

Publication Date: 2017-12-01

Citations: 41

DOI: https://doi.org/10.1109/asru.2017.8268931

Abstract

Recurrent neural network (RNN) language models (LMs) and Long Short Term Memory (LSTM) LMs, a variant of RNN LMs, have been shown to outperform traditional N-gram LMs on speech recognition tasks. However, these models are computationally more expensive than N-gram LMs for decoding, and thus, challenging to integrate into speech recognizers. Recent research has proposed the use of lattice-rescoring algorithms using RNNLMs and LSTMLMs as an efficient strategy to integrate these models into a speech recognition system. In this paper, we evaluate existing lattice rescoring algorithms along with new variants on a YouTube speech recognition task. Lattice rescoring using LSTMLMs reduces the word error rate (WER) for this task by 8% relative to the WER obtained using an N-gram LM.

Locations

  • arXiv (Cornell University) - View - PDF
  • 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) - View

Similar Works

Action Title Year Authors
+ Lattice Rescoring Strategies for Long Short Term Memory Language Models in Speech Recognition 2017 Shankar Kumar
Michael Nirschl
Daniel Holtmann-Rice
Hank Liao
Ananda Theertha Suresh
Felix Yu
+ LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring 2021 Anton Mitrofanov
Mariya Korenevskaya
Ivan Podluzhny
Yuri Khokhlov
Aleksandr Laptev
Andrei Andrusenko
Aleksei Ilin
Maxim Korenevsky
Ivan Medennikov
Aleksei Romanenko
+ PDF Chat LT-LM: A Novel Non-Autoregressive Language Model for Single-Shot Lattice Rescoring 2021 Anton Mitrofanov
Mariya Korenevskaya
Ivan Podluzhny
Yuri Khokhlov
Aleksandr Laptev
Andrei Andrusenko
Aleksei Ilin
Maxim Korenevsky
Ivan Medennikov
Aleksei Romanenko
+ LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring 2021 Anton Mitrofanov
Mariya Korenevskaya
Ivan Podluzhny
Yuri Khokhlov
Aleksandr Laptev
Andrei Andrusenko
Aleksei Ilin
Maxim Korenevsky
Ivan Medennikov
Aleksei Romanenko
+ PDF Chat Lattention: Lattice-Attention in ASR Rescoring 2022 Prabhat Pandey
Sergio Duarte Torres
Ali Orkan Bayer
Ankur Gandhe
Volker Leutnant
+ Lattention: Lattice-attention in ASR rescoring 2021 Prabhat Pandey
Sergio Duarte Torres
Ali Orkan Bayer
Ankur Gandhe
Volker Leutnant
+ PDF Chat Lattention: Lattice-attention in ASR rescoring 2021 Prabhat Pandey
Sergio Duarte Torres
Ali Orkan Bayer
Ankur Gandhe
Volker Leutnant
+ Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models 2023 Atsunori Ogawa
Naohiro Tawara
Marc Delcroix
Shoko Araki
+ PDF Chat Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models 2022 Atsunori Ogawa
Naohiro Tawara
Marc Delcroix
Shoko Araki
+ PDF Chat A Parallelizable Lattice Rescoring Strategy with Neural Language Models 2021 Ke Li
Daniel Povey
Sanjeev Khudanpur
+ A Parallelizable Lattice Rescoring Strategy with Neural Language Models 2021 Ke Li
Daniel Povey
Sanjeev Khudanpur
+ Large-Scale Language Model Rescoring on Long-Form Data 2023 Tongzhou Chen
Cyril Allauzen
Yinghui Huang
Daniel Park
David Rybach
Wei Huang
Rodrigo Cabrera
Kartik Audhkhasi
Bhuvana Ramabhadran
Pedro J. Moreno
+ Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring 2023 Ankitha Sudarshan
Vinay Samuel
Parth Patwa
Ibtihel Amara
Aman Chadha
+ PDF Chat Future word contexts in neural network language models 2017 Xia Chen
X. Liu
Anton Ragni
Yizhou Wang
Mark Gales
+ Future Word Contexts in Neural Network Language Models 2017 Xie Chen
Xunying Liu
Anton Ragni
Yu Wang
Mark Gales
+ L2RS: A Learning-to-Rescore Mechanism for Automatic Speech Recognition 2019 Yuanfeng Song
Di Jiang
Xuefang Zhao
Qian Xu
Raymond Chi-Wing Wong
Lixin Fan
Qiang Yang
+ Large-scale Language Model Rescoring on Long-form Data 2023 Tongzhou Chen
Cyril Allauzen
Yinghui Huang
Daniel Park
David Rybach
Wei Huang
Rodrigo Cabrera
Kartik Audhkhasi
Bhuvana Ramabhadran
Pedro J. Moreno
+ Large Scale Language Modeling in Automatic Speech Recognition 2012 Ciprian Chelba
Dan Bikel
Maria Shugrina
Patrick Nguyen
Shankar Kumar
+ L2RS: A Learning-to-Rescore Mechanism for Automatic Speech Recognition. 2019 Yuanfeng Song
Di Jiang
Xuefang Zhao
Qian Xu
Raymond Chi-Wing Wong
Lixin Fan
Qiang Yang
+ An Empirical Study of Efficient ASR Rescoring with Transformers 2019 Hongzhao Huang
Fuchun Peng

Works That Cite This (21)

Action Title Year Authors
+ PDF Chat Audio-Attention Discriminative Language Model for ASR Rescoring 2020 Ankur Gandhe
Ariya Rastrow
+ An Empirical Study of Efficient ASR Rescoring with Transformers 2019 Hongzhao Huang
Fuchun Peng
+ Two-Pass End-to-End Speech Recognition 2019 Tara N. Sainath
Ruoming Pang
David Rybach
Yanzhang He
Rohit Prabhavalkar
Wei Li
Mirkó Visontai
Qiao Liang
Trevor Strohman
Yonghui Wu
+ AdaCliP: Adaptive Clipping for Private SGD 2019 Venkatadheeraj Pichapati
Ananda Theertha Suresh
Felix X. Yu
Sashank J. Reddi
Sanjiv Kumar
+ PDF Chat West: Word Encoded Sequence Transducers 2019 Ehsan Variani
Ananda Theertha Suresh
M. Weintraub
+ LSTM Language Models for LVCSR in First-Pass Decoding and Lattice-Rescoring 2019 Eugen Beck
Wei Zhou
Ralf Schlüter
Hermann Ney
+ PDF Chat Two-Pass End-to-End Speech Recognition 2019 Tara N. Sainath
Ruoming Pang
David Rybach
Yanzhang He
Rohit Prabhavalkar
Wei Li
Mirkó Visontai
Qiao Liang
Trevor Strohman
Yonghui Wu
+ PDF Chat Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models 2022 Atsunori Ogawa
Naohiro Tawara
Marc Delcroix
Shoko Araki
+ PDF Chat Full-Sum Decoding for Hybrid Hmm Based Speech Recognition Using LSTM Language Model 2020 Wei Zhou
Ralf Schlüter
Hermann Ney
+ PDF Chat Joint Contextual Modeling for ASR Correction and Language Understanding 2020 Yue Weng
Sai Sumanth Miryala
Chandra Khatri
Runze Wang
Huaixiu Zheng
Piero Molino
Mahdi Namazifar
Alexandros Papangelis
Hugh Williams
Franziska Bell

Works Cited by This (10)

Action Title Year Authors
+ Compressing Deep Convolutional Networks using Vector Quantization 2014 Yunchao Gong
Liu Liu
Ming–Hsuan Yang
Lubomir Bourdev
+ Generating Sequences With Recurrent Neural Networks 2013 Alex Graves
+ On the difficulty of training Recurrent Neural Networks 2012 Razvan Pascanu
Tomáš Mikolov
Yoshua Bengio
+ PDF Chat Quantized Convolutional Neural Networks for Mobile Devices 2016 Jiaxiang Wu
Cong Leng
Yuhang Wang
Qinghao Hu
Jian Cheng
+ Exploring the Limits of Language Modeling 2016 Rafał Józefowicz
Oriol Vinyals
Mike Schuster
Noam Shazeer
Yonghui Wu
+ TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems 2016 Martı́n Abadi
Ashish Agarwal
Paul Barham
Eugene Brevdo
Zhifeng Chen
Craig Citro
Gregory S. Corrado
Andy Davis
Jay B. Dean
Matthieu Devin
+ Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation 2016 Yonghui Wu
Mike Schuster
Zhifeng Chen
Quoc V. Le
Mohammad Norouzi
Wolfgang Macherey
Maxim Krikun
Yuan Cao
Qin Gao
Klaus Macherey
+ PDF Chat Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition 2017 Hagen Soltau
Hank Liao
Haşim Sak
+ FastText.zip: Compressing text classification models 2016 Armand Joulin
Édouard Grave
Piotr Bojanowski
Matthijs Douze
Hervé Jeǵou
Tomáš Mikolov
+ PDF Chat NN-Grams: Unifying Neural Network and n-Gram Language Models for Speech Recognition 2016 Babak Damavandi
Shankar Kumar
Noam Shazeer
Antoine Bruguier