Ask a Question

Prefer a chat interface with context about you and your work?

Deep Contextualized Acoustic Representations for Semi-Supervised Speech Recognition

Deep Contextualized Acoustic Representations for Semi-Supervised Speech Recognition

We propose a novel approach to semi-supervised automatic speech recognition (ASR). We first exploit a large amount of unlabeled audio data via representation learning, where we reconstruct a temporal slice of filterbank features from past and future context frames. The resulting deep contextualized acoustic representations (DeCoAR) are then used to …