Scaling speech enhancement in unseen environments with noise embeddings
Scaling speech enhancement in unseen environments with noise embeddings
We address the problem of speech enhancement generalisation to unseen environments by performing two manipulations.First, we embed an additional recording from the environment alone, and use this embedding to alter activations in the main enhancement subnetwork.Second, we scale the number of noise environments present at training time to 16,784 different …