Ask a Question

Prefer a chat interface with context about you and your work?

Transformer-Based Online CTC/Attention End-To-End Speech Recognition Architecture

Transformer-Based Online CTC/Attention End-To-End Speech Recognition Architecture

Recently, Transformer has gained success in automatic speech recognition (ASR) field. However, it is challenging to deploy a Transformer-based end-to-end (E2E) model for online speech recognition. In this paper, we propose the Transformer-based online CTC/attention E2E ASR architecture, which contains the chunk self-attention encoder (chunk-SAE) and the monotonic truncated attention …