Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation

Type: Preprint

Publication Date: 2023-01-01

Citations: 13

DOI: https://doi.org/10.48550/arxiv.2303.09319

Locations

  • arXiv (Cornell University) - View
  • DataCite API - View

Similar Works

Action Title Year Authors
+ UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance 2022 Wei Li
Xue Xu
Xinyan Xiao
Jiachen Liu
Hu Yang
Guohao Li
Zhanpeng Wang
Zhifan Feng
Qiaoqiao She
Yajuan Lyu
+ UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion 2024 Wei Li
Xu Xue
Jiachen Liu
Xinyan Xiao
+ PDF Chat JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation 2024 Yu Zeng
Vishal M. Patel
Haochen Wang
Xun Huang
Ting-Chun Wang
Ming-Yu Liu
Yogesh Balaji
+ PDF Chat EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts 2024 Yucheng Han
Rui Wang
Chi Zhang
Juntao Hu
Cheng Pei
Bin Fu
Hanwang Zhang
+ Unified Discrete Diffusion for Simultaneous Vision-Language Generation 2022 Minghui Hu
Chuanxia Zheng
Heliang Zheng
Tat‐Jen Cham
Chaoyue Wang
Zuopeng Yang
Dacheng Tao
Ponnuthurai Nagaratnam Suganthan
+ DomainStudio: Fine-Tuning Diffusion Models for Domain-Driven Image Generation using Limited Data 2023 Jingyuan Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
+ Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning 2023 Jian Ma
Junhao Liang
Chen Chen
Haonan Lu
+ PDF Chat DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability 2023 Runhui Huang
Jianhua Han
Guansong Lu
Xiaodan Liang
Yihan Zeng
Wei Zhang
Hang Xu
+ DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability 2023 Runhui Huang
Jianhua Han
Guansong Lu
Xiaodan Liang
Yihan Zeng
Wei Zhang
Hang Xu
+ PDF Chat SceneBooth: Diffusion-based Framework for Subject-preserved Text-to-Image Generation 2025 Shang Chai
Zihang Lin
Min Zhou
Xubin Li
Liansheng Zhuang
Houqiang Li
+ FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention 2023 Guangxuan Xiao
Tianwei Yin
William T. Freeman
Frédo Durand
Song Han
+ PDF Chat Nested Diffusion Models Using Hierarchical Latent Priors 2024 Xiao Zhang
Ruoxi Jiang
Rebecca Willett
Michael Maire
+ PDF Chat TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion 2024 Salaheldin Mohamed
+ PDF Chat DiffX: Guide Your Layout to Cross-Modal Generative Modeling 2024 Zeyu Wang
Jingyu Lin
Y. Qian
Yixiang Huang
Sicheng Tian
Bosong Chai
Juncan Deng
Yang Qu
Lan Du
Cunjian Chen
+ PDF Chat DR-GAN: Distribution Regularization for Text-to-Image Generation 2022 Hongchen Tan
Xiuping Liu
Baocai Yin
Xin Li
+ DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models 2021 Gwanghyun Kim
Jong Chul Ye
+ PDF Chat MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation 2024 Kunpeng Song
Yizhe Zhu
Bingchen Liu
Qing Yan
Ahmed Elgammal
Xiao Yang
+ PDF Chat Fast Personalized Text-to-Image Syntheses With Attention Injection 2024 Yuxuan Zhang
Yiren Song
Jinpeng Yu
Han Pan
Zhongliang Jing
+ PDF Chat Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance 2024 Jingyuan Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
+ DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation 2023 Hong Chen
Yipeng Zhang
Xin Wang
Xuguang Duan
Yuwei Zhou
Wenwu Zhu

Works Cited by This (0)

Action Title Year Authors