X-Dyna: Expressive Dynamic Human Image Animation

Type: Preprint

Publication Date: 2025-01-17

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2501.10021

Abstract

We introduce X-Dyna, a novel zero-shot, diffusion-based pipeline for animating a single human image using facial expressions and body movements derived from a driving video, that generates realistic, context-aware dynamics for both the subject and the surrounding environment. Building on prior approaches centered on human pose control, X-Dyna addresses key shortcomings causing the loss of dynamic details, enhancing the lifelike qualities of human video animations. At the core of our approach is the Dynamics-Adapter, a lightweight module that effectively integrates reference appearance context into the spatial attentions of the diffusion backbone while preserving the capacity of motion modules in synthesizing fluid and intricate dynamic details. Beyond body pose control, we connect a local control module with our model to capture identity-disentangled facial expressions, facilitating accurate expression transfer for enhanced realism in animated scenes. Together, these components form a unified framework capable of learning physical human motion and natural scene dynamics from a diverse blend of human and scene videos. Comprehensive qualitative and quantitative evaluations demonstrate that X-Dyna outperforms state-of-the-art methods, creating highly lifelike and expressive animations. The code is available at https://github.com/bytedance/X-Dyna.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ PDF Chat X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention 2024 You Xie
Hongyi Xu
Guoxian Song
Chao Wang
Yichun Shi
Linjie Luo
+ PDF Chat VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation 2024 Qilin Wang
Zhengkai Jiang
Chengming Xu
Jiangning Zhang
Yabiao Wang
Xinyi Zhang
Yun Cao
Weijian Cao
Chengjie Wang
Yanwei Fu
+ MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer 2023 Di Chang
Yichun Shi
Quankai Gao
Jessica Fu
Hongyi Xu
Guoxian Song
Qing Yan
X. B. Yang
M. Reza Soleymani
+ PDF Chat One Shot, One Talk: Whole-body Talking Avatar from a Single Image 2024 Jun Xiang
Yudong Guo
Liwei Hu
Guo Boyang
Yancheng Yuan
Juyong Zhang
+ PDF Chat TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models 2024 Jeongho Kim
Minjung Kim
Junsoo Lee
Jaegul Choo
+ PDF Chat CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention 2024 Gaojie Lin
Jianwen Jiang
Chao Liang
Tianyun Zhong
Jiaqi Yang
Yanbo Zheng
+ PDF Chat JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation 2024 Xudong Cao
Sheng Shi
Jun Zhao
Yao Yang
Jintao Fei
Man Gao
Gang Wang
+ PDF Chat AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding 2024 Tao Liu
Feilong Chen
Shuai Fan
Chenpeng Du
Chen Qi
Xie Chen
Kai Yu
+ PDF Chat UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation 2024 Xiang Wang
Shiwei Zhang
Changxin Gao
Jiayu Wang
Xiaoqiang Zhou
Yingya Zhang
Luxin Yan
Nong Sang
+ PDF Chat Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework 2024 Ziyao Huang
Fan Tang
Yong Zhang
Xiaodong Cun
Juan Cao
Jintao Li
Tong‐Yee Lee
+ DiffusionAvatars: Deferred Diffusion for High-fidelity 3D Head Avatars 2023 Tobias Kirschstein
Simon Giebenhain
Matthias Nießner
+ Versatile Face Animator: Driving Arbitrary 3D Facial Avatar in RGBD Space 2023 Haoyu Wang
Haozhe Wu
Junliang Xing
Jia Jia
+ PDF Chat Expressive Gaussian Human Avatars from Monocular RGB Video 2024 Hezhen Hu
Zhiwen Fan
Tianhao Wu
Yihan Xi
Seoyoung Lee
Georgios Pavlakos
Zhangyang Wang
+ PDF Chat MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control 2025 Mengting Wei
Tuomas Varanka
Xingxun Jiang
Huai-Qian Khor
Guoying Zhao
+ PDF Chat EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation 2024 Rui Meng
Xingyu Zhang
Yuming Li
Chenguang Ma
+ PDF Chat VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis 2024 Enric Corona
Andrei Zanfir
Eduard Gabriel Băzăvan
Nikos Kolotouros
Thiemo Alldieck
Cristian Sminchisescu
+ PDF Chat MegActor: Harness the Power of Raw Video for Vivid Portrait Animation 2024 Yang Shurong
Huadong Li
Juhao Wu
Minhao Jing
Linze Li
Renhe Ji
Jiajun Liang
Haoqiang Fan
+ Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation 2024 Xiyi Chen
Marko Mihajlovic
Shaofei Wang
Sergey Prokudin
Siyu Tang
+ MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model 2023 Zhongcong Xu
Jianfeng Zhang
Jun Hao Liew
Hanshu Yan
Jiawei Liu
Chenxu Zhang
Jiashi Feng
Mike Zheng Shou
+ PDF Chat Animate-X: Universal Character Image Animation with Enhanced Motion Representation 2024 Shuai Tan
Biao Gong
Xiang Wang
Shiwei Zhang
Dandan Zheng
Ruobing Zheng
Kecheng Zheng
Jingdong Chen
Ming Yang

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors