Prefer a chat interface with context about you and your work?
End-to-End Multi-Speaker Speech Recognition Using Speaker Embeddings and Transfer Learning
This paper presents our latest investigation on end-to-end automatic speech recognition (ASR) for overlapped speech.We propose to train an end-to-end system conditioned on speaker embeddings and further improved by transfer learning from clean speech.This proposed framework does not require any parallel non-overlapped speech materials and is independent of the number …