Ask a Question

Prefer a chat interface with context about you and your work?

Attention-based Wav2Text with feature transfer learning

Attention-based Wav2Text with feature transfer learning

Conventional automatic speech recognition (ASR) typically performs multi-level pattern recognition tasks that map the acoustic speech waveform into a hierarchy of speech units. But, it is widely known that information loss in the earlier stage can propagate through the later stages. After the resurgence of deep learning, interest has emerged …