SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix

Type: Preprint

Publication Date: 2024-06-29

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2407.00367

Abstract

Video generation models have demonstrated great capabilities of producing impressive monocular videos, however, the generation of 3D stereoscopic video remains under-explored. We propose a pose-free and training-free approach for generating 3D stereoscopic videos using an off-the-shelf monocular video generation model. Our method warps a generated monocular video into camera views on stereoscopic baseline using estimated video depth, and employs a novel frame matrix video inpainting framework. The framework leverages the video generation model to inpaint frames observed from different timestamps and views. This effective approach generates consistent and semantically coherent stereoscopic videos without scene optimization or model fine-tuning. Moreover, we develop a disocclusion boundary re-injection scheme that further improves the quality of video inpainting by alleviating the negative effects propagated from disoccluded areas in the latent space. We validate the efficacy of our proposed method by conducting experiments on videos from various generative models, including Sora [4 ], Lumiere [2], WALT [8 ], and Zeroscope [ 42]. The experiments demonstrate that our method has a significant improvement over previous methods. The code will be released at \url{https://daipengwa.github.io/SVG_ProjectPage}.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ PDF Chat StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos 2024 Sijie Zhao
Wenbo Hu
Xiaodong Cun
Yong Zhang
Xiaoyu Li
Zhe Kong
Xiangjun Gao
Muyao Niu
Ying Shan
+ PDF Chat Elevating Flow-Guided Video Inpainting with Reference Generation 2024 Suhwan Cho
Seoung Wug Oh
Sangyoun Lee
Joon‐Young Lee
+ PDF Chat World-consistent Video Diffusion with Explicit 3D Modeling 2024 Qihang Zhang
Shuangfei Zhai
Miguel Ángel Bautista
Ke-Xuan Miao
Alexander Toshev
Joshua M. Susskind
Jiatao Gu
+ PDF Chat MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance 2024 Yuang Zhang
Jiaxi Gu
Li-Wen Wang
Han Wang
Junqi Cheng
Zhu Yue-feng
Fangyuan Zou
+ PDF Chat Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels 2024 Yikai Wang
Xinzhou Wang
Zilong Chen
Zhengyi Wang
Fuchun Sun
Jun Zhu
+ PDF Chat T-SVG: Text-Driven Stereoscopic Video Generation 2024 Qiao Jin
Xiaohong Chen
Wu Liu
Tao Mei
Yongdong Zhang
+ GenDeF: Learning Generative Deformation Field for Video Generation 2023 Wen Wang
Kecheng Zheng
Qiuyu Wang
Hao Chen
Zifan Shi
Ceyuan Yang
Yujun Shen
Chunhua Shen
+ LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes 2023 Jaeyoung Chung
Suyong Lee
Hyeongjin Nam
Jaerin Lee
Kyoung Mu Lee
+ PDF Chat Latent-Reframe: Enabling Camera Control for Video Diffusion Model without Training 2024 Zhenghong Zhou
Jie An
Jiebo Luo
+ Inpaint3D: 3D Scene Content Generation using 2D Inpainting Diffusion 2023 Kira Prabhu
Jane Y. Wu
Lynn Tsai
Peter Hedman
Dan B Goldman
Ben Poole
Michael Broxton
+ Improving Consistency and Correctness of Sequence Inpainting using Semantically Guided Generative Adversarial Network 2017 Avisek Lahiri
Arnav Jain
Prabir Kumar Biswas
Pabitra Mitra
+ PDF Chat GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking 2025 Weikang Bian
Zhaoyang Huang
Xiaoyu Shi
Yijin Li
Fu-Yun Wang
Hongsheng Li
+ PDF Chat VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model 2024 Qi Zuo
Xiaodong Gu
Lingteng Qiu
Yuan Dong
Zhengyi Zhao
Weihao Yuan
Rui Peng
Siyu Zhu
Zilong Dong
Liefeng Bo
+ PDF Chat StereoGen: High-quality Stereo Image Generation from a Single Image 2025 Xianqi Wang
Hao Yang
Gangwei Xu
Junda Cheng
Min Lin
Yong Deng
Jinliang Zang
Yu-Rui Chen
Xinqi Yang
+ PDF Chat Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images 2024 Xudong Cai
Yongcai Wang
Zhaoxin Fan
Deng Haoran
Shuo Wang
Wanting Li
Deying Li
Luo Lun
Minhang Wang
Jintao Xu
+ PDF Chat MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing 2024 Chenjie Cao
Chaohui Yu
Fan Wang
Xiangyang Xue
Yanwei Fu
+ PDF Chat Diffusion$^2$: Dynamic 3D Content Generation via Score Composition of Orthogonal Diffusion Models 2024 Zeyu Yang
Zijie Pan
Chun Gu
Li Zhang
+ PDF Chat StereoCrafter-Zero: Zero-Shot Stereo Video Generation with Noisy Restart 2024 Jian Shi
Qian Wang
Zhenyu Li
Peter Wonka
+ PDF Chat Learning Temporally Consistent Video Depth from Video Diffusion Priors 2024 Jiahao Shao
Yuanbo Yang
H. Zhou
Youmin Zhang
Yujun Shen
Matteo Poggi
Yiyi Liao
+ Make-It-4D: Synthesizing a Consistent Long-Term Dynamic Scene Video from a Single Image 2023 Liao Shen
Xingyi Li
Huiqiang Sun
Juewen Peng
Ke Xian
Zhiguo Cao
Guosheng Lin

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors