X-Dyna: Expressive Dynamic Human Image Animation

Di Chang, Hongyi Xu, You Xie, Yipeng Gao, Zhengfei Kuang, Shengqu Cai, Chenxu Zhang, Guoxian Song, Chao Wang, Yichun Shi

Type: Preprint

Publication Date: 2025-01-17

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2501.10021

View Publication

Download PDF

Abstract

We introduce X-Dyna, a novel zero-shot, diffusion-based pipeline for animating a single human image using facial expressions and body movements derived from a driving video, that generates realistic, context-aware dynamics for both the subject and the surrounding environment. Building on prior approaches centered on human pose control, X-Dyna addresses key shortcomings causing the loss of dynamic details, enhancing the lifelike qualities of human video animations. At the core of our approach is the Dynamics-Adapter, a lightweight module that effectively integrates reference appearance context into the spatial attentions of the diffusion backbone while preserving the capacity of motion modules in synthesizing fluid and intricate dynamic details. Beyond body pose control, we connect a local control module with our model to capture identity-disentangled facial expressions, facilitating accurate expression transfer for enhanced realism in animated scenes. Together, these components form a unified framework capable of learning physical human motion and natural scene dynamics from a diverse blend of human and scene videos. Comprehensive qualitative and quantitative evaluations demonstrate that X-Dyna outperforms state-of-the-art methods, creating highly lifelike and expressive animations. The code is available at https://github.com/bytedance/X-Dyna.

Locations

arXiv (Cornell University) - View - PDF

Similar Works

Action	Title	Year	Authors
+ PDF Chat	X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention	2024	You Xie Hongyi Xu Guoxian Song Chao Wang Yichun Shi Linjie Luo
+ PDF Chat	VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation	2024	Qilin Wang Zhengkai Jiang Chengming Xu Jiangning Zhang Yabiao Wang Xinyi Zhang Yun Cao Weijian Cao Chengjie Wang Yanwei Fu
+	MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer	2023	Di Chang Yichun Shi Quankai Gao Jessica Fu Hongyi Xu Guoxian Song Qing Yan X. B. Yang M. Reza Soleymani
+ PDF Chat	One Shot, One Talk: Whole-body Talking Avatar from a Single Image	2024	Jun Xiang Yudong Guo Liwei Hu Guo Boyang Yancheng Yuan Juyong Zhang
+ PDF Chat	TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models	2024	Jeongho Kim Minjung Kim Junsoo Lee Jaegul Choo
+ PDF Chat	CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention	2024	Gaojie Lin Jianwen Jiang Chao Liang Tianyun Zhong Jiaqi Yang Yanbo Zheng
+ PDF Chat	JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation	2024	Xudong Cao Sheng Shi Jun Zhao Yao Yang Jintao Fei Man Gao Gang Wang
+ PDF Chat	AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding	2024	Tao Liu Feilong Chen Shuai Fan Chenpeng Du Chen Qi Xie Chen Kai Yu
+ PDF Chat	UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation	2024	Xiang Wang Shiwei Zhang Changxin Gao Jiayu Wang Xiaoqiang Zhou Yingya Zhang Luxin Yan Nong Sang
+ PDF Chat	Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework	2024	Ziyao Huang Fan Tang Yong Zhang Xiaodong Cun Juan Cao Jintao Li Tong‐Yee Lee
+	DiffusionAvatars: Deferred Diffusion for High-fidelity 3D Head Avatars	2023	Tobias Kirschstein Simon Giebenhain Matthias Nießner
+	Versatile Face Animator: Driving Arbitrary 3D Facial Avatar in RGBD Space	2023	Haoyu Wang Haozhe Wu Junliang Xing Jia Jia
+ PDF Chat	Expressive Gaussian Human Avatars from Monocular RGB Video	2024	Hezhen Hu Zhiwen Fan Tianhao Wu Yihan Xi Seoyoung Lee Georgios Pavlakos Zhangyang Wang
+ PDF Chat	MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control	2025	Mengting Wei Tuomas Varanka Xingxun Jiang Huai-Qian Khor Guoying Zhao
+ PDF Chat	EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation	2024	Rui Meng Xingyu Zhang Yuming Li Chenguang Ma
+ PDF Chat	VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis	2024	Enric Corona Andrei Zanfir Eduard Gabriel Băzăvan Nikos Kolotouros Thiemo Alldieck Cristian Sminchisescu
+ PDF Chat	MegActor: Harness the Power of Raw Video for Vivid Portrait Animation	2024	Yang Shurong Huadong Li Juhao Wu Minhao Jing Linze Li Renhe Ji Jiajun Liang Haoqiang Fan
+	Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation	2024	Xiyi Chen Marko Mihajlovic Shaofei Wang Sergey Prokudin Siyu Tang
+	MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model	2023	Zhongcong Xu Jianfeng Zhang Jun Hao Liew Hanshu Yan Jiawei Liu Chenxu Zhang Jiashi Feng Mike Zheng Shou
+ PDF Chat	Animate-X: Universal Character Image Animation with Enhanced Motion Representation	2024	Shuai Tan Biao Gong Xiang Wang Shiwei Zhang Dandan Zheng Ruobing Zheng Kecheng Zheng Jingdong Chen Ming Yang

Works That Cite This (0)

Action	Title	Year	Authors

Works Cited by This (0)

Action	Title	Year	Authors