Ask AI a math question

Related Paper

Toward Streaming ASR with Non-Autoregressive Insertion-Based Model

Neural end-to-end (E2E) models have become a promising technique to realize practical automatic speech recognition (ASR) systems.When realizing such a system, one important issue is the segmentation of audio to deal with streaming input or long recording.After audio segmentation, the ASR model with a small real-time factor (RTF) is preferable …

Ask a Question