Kaldi+PDNN: Building DNN-based ASR Systems with Kaldi and PDNN

Type: Preprint

Publication Date: 2014-01-01

Citations: 62

DOI: https://doi.org/10.48550/arxiv.1401.6984

Locations

  • arXiv (Cornell University) - View - PDF
  • DataCite API - View

Similar Works

Action Title Year Authors
+ ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi 2021 Yu Wang
Chee Siang Leow
Akio Kobayashi
Takehito Utsuro
Hiromitsu Nishizaki
+ PDF Chat ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi 2021 Yu Wang
Chee Siang Leow
Akio Kobayashi
Takehito Utsuro
Hiromitsu Nishizaki
+ PyKaldi2: Yet another speech toolkit based on Kaldi and PyTorch 2019 Liang Lu
Xiong Xiao
Zhuo Chen
Yifan Gong
+ The PyTorch-Kaldi Speech Recognition Toolkit 2018 Mirco Ravanelli
Titouan Parcollet
Yoshua Bengio
+ The PyTorch-Kaldi Speech Recognition Toolkit 2018 Mirco Ravanelli
Titouan Parcollet
Yoshua Bengio
+ PDF Chat The Pytorch-kaldi Speech Recognition Toolkit 2019 Mirco Ravanelli
Titouan Parcollet
Yoshua Bengio
+ ESPnet: End-to-End Speech Processing Toolkit 2018 Shinji Watanabe
Takaaki Hori
Shigeki Karita
Tomoki Hayashi
Jiro Nishitoba
Yuya Unno
Nelson Enrique Yalta Soplin
Jahn Heymann
Matthew Wiesner
Nanxin Chen
+ An Exploration of Mimic Architectures for Residual Network Based Spectral Mapping. 2018 Peter Plantinga
Deblin Bagchi
Eric Fosler‐Lussier
+ Building DNN Acoustic Models for Large Vocabulary Speech Recognition 2014 Andrew L. Maas
Peng Qi
Ziang Xie
Awni Hannun
Christopher T. Lengerich
Daniel Jurafsky
Andrew Y. Ng
+ Building DNN Acoustic Models for Large Vocabulary Speech Recognition 2014 Andrew L. Maas
Peng Qi
Ziang Xie
Awni Hannun
Christopher T. Lengerich
Daniel Jurafsky
Andrew Y. Ng
+ An Exploration of Mimic Architectures for Residual Network Based Spectral Mapping 2018 Peter Plantinga
Deblin Bagchi
Eric Fosler‐Lussier
+ PDF Chat An Exploration of Mimic Architectures for Residual Network Based Spectral Mapping 2018 Peter Plantinga
Deblin Bagchi
Eric Fosler‐Lussier
+ PDF Chat UTDUSS: UTokyo-SaruLab System for Interspeech2024 Speech Processing Using Discrete Speech Unit Challenge 2024 Wataru Nakata
Kazuki Yamauchi
Dong Yang
Hiroaki Hyodo
Yuki Saito
+ PDF Chat Reverb: Open-Source ASR and Diarization from Rev 2024 Nishchal Bhandari
Dong Chen
María Maza
Natalie Delworth
Jennifer Drexler Fox
Migüel Jetté
Quinten McNamara
Corey Miller
Ondřej Novotný
Ján Profant
+ Lhotse: a speech data representation library for the modern deep learning ecosystem 2021 Piotr Żelasko
Daniel Povey
Jan Trmal
Sanjeev Khudanpur
+ Integration of TensorFlow based Acoustic Model with Kaldi WFST Decoder 2019 Minkyu Lim
Ji‐Hwan Kim
+ PDF Chat Jasper: An End-to-End Convolutional Neural Acoustic Model 2019 Jason Li
Vitaly Lavrukhin
Boris Ginsburg
R. Bret Leary
Oleksii Kuchaiev
Jonathan Cohen
Huyen Nguyen
Ravi Teja Gadde
+ PDF Chat BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition 2022 Yu Zhang
Daniel Park
Wei Han
James Qin
Anmol Gulati
Joel Shor
Aren Jansen
Yuanzhong Xu
Yanping Huang
Shibo Wang
+ PDF Chat Comparative Analysis of ASR Methods for Speech Deepfake Detection 2024 Davide Salvi
Amit Kumar Singh Yadav
Kratika Bhagtani
Viola Negroni
Paolo Bestagini
Edward J. Delp
+ PDF Chat Multistream CNN for Robust Acoustic Modeling 2021 Kyu J. Han
Jing Pan
Venkata Krishna Naveen Tadala
Tao Ma
Dan Povey

Works Cited by This (0)

Action Title Year Authors