Ask a Question

Prefer a chat interface with context about you and your work?

End-to-End Multi-Speaker Speech Recognition Using Speaker Embeddings and Transfer Learning

End-to-End Multi-Speaker Speech Recognition Using Speaker Embeddings and Transfer Learning

This paper presents our latest investigation on end-to-end automatic speech recognition (ASR) for overlapped speech.We propose to train an end-to-end system conditioned on speaker embeddings and further improved by transfer learning from clean speech.This proposed framework does not require any parallel non-overlapped speech materials and is independent of the number …