Ask a Question

Prefer a chat interface with context about you and your work?

Monotonic Segmental Attention for Automatic Speech Recognition

Monotonic Segmental Attention for Automatic Speech Recognition

We introduce a novel segmental-attention model for automatic speech recognition. We restrict the decoder attention to segments to avoid quadratic runtime of global attention, better generalize to long sequences, and eventually enable streaming. We directly compare global-attention and different segmental-attention modeling variants. We develop and compare two separate time-synchronous decoders, …