Ask a Question

Prefer a chat interface with context about you and your work?

Toward Streaming ASR with Non-Autoregressive Insertion-Based Model

Toward Streaming ASR with Non-Autoregressive Insertion-Based Model

Neural end-to-end (E2E) models have become a promising technique to realize practical automatic speech recognition (ASR) systems.When realizing such a system, one important issue is the segmentation of audio to deal with streaming input or long recording.After audio segmentation, the ASR model with a small real-time factor (RTF) is preferable …