High-Fidelity Generalized Emotional Talking Face Generation with Multi-Modal Emotion Space Learning
High-Fidelity Generalized Emotional Talking Face Generation with Multi-Modal Emotion Space Learning
Recently, emotional talking face generation has received considerable attention. However, existing methods only adopt one-hot coding, image, or audio as emotion conditions, thus lacking flexible control in practical applications and failing to handle unseen emotion styles due to limited semantics. They either ignore the one-shot setting or the quality of …