MedDiT: A Knowledge-Controlled Diffusion Transformer Framework for Dynamic Medical Image Generation in Virtual Simulated Patient

Type: Preprint

Publication Date: 2024-08-22

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2408.12236

Abstract

Medical education relies heavily on Simulated Patients (SPs) to provide a safe environment for students to practice clinical skills, including medical image analysis. However, the high cost of recruiting qualified SPs and the lack of diverse medical imaging datasets have presented significant challenges. To address these issues, this paper introduces MedDiT, a novel knowledge-controlled conversational framework that can dynamically generate plausible medical images aligned with simulated patient symptoms, enabling diverse diagnostic skill training. Specifically, MedDiT integrates various patient Knowledge Graphs (KGs), which describe the attributes and symptoms of patients, to dynamically prompt Large Language Models' (LLMs) behavior and control the patient characteristics, mitigating hallucination during medical conversation. Additionally, a well-tuned Diffusion Transformer (DiT) model is incorporated to generate medical images according to the specified patient attributes in the KG. In this paper, we present the capabilities of MedDiT through a practical demonstration, showcasing its ability to act in diverse simulated patient cases and generate the corresponding medical images. This can provide an abundant and interactive learning experience for students, advancing medical education by offering an immersive simulation platform for future healthcare professionals. The work sheds light on the feasibility of incorporating advanced technologies like LLM, KG, and DiT in education applications, highlighting their potential to address the challenges faced in simulated patient-based medical education.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ PDF Chat Automated Generation of High-Quality Medical Simulation Scenarios Through Integration of Semi-Structured Data and Large Language Models 2024 Scott Sumpter
+ PDF Chat Leveraging Large Language Model as Simulated Patients for Clinical Education 2024 Yaneng Li
Cheng Zeng
Jialun Zhong
Ruoyu Zhang
Minhao Zhang
Lei Zou
+ PDF Chat Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator 2024 Yusheng Liao
Yutong Meng
Yuhao Wang
Hongcheng Liu
Yanfeng Wang
Yu Wang
+ PDF Chat MEDCO: Medical Education Copilots Based on A Multi-Agent Framework 2024 Wei Hao
Jianing Qiu
Haibao Yu
Wu Yuan
+ PDF Chat MedThink: Inducing Medical Large-scale Visual Language Models to Hallucinate Less by Thinking More 2024 Yue Jiang
Jiawei Chen
Dingkang Yang
Mingcheng Li
Shunli Wang
Tong Wu
Ke Li
Lihua Zhang
+ PDF Chat Synthetic Patients: Simulating Difficult Conversations with Multimodal Generative AI for Medical Education 2024 Simon Chu
Alex J. Goodell
+ PDF Chat LLMs Can Simulate Standardized Patients via Agent Coevolution 2024 Z. Z. Du
Lujie Zheng
Ruijin Hu
Yan Xu
Xiawei Li
Yingming Sun
Wei Chen
Jianā€Lin Wu
Haolei Cai
Haohao Ying
+ PDF Chat A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions 2024 Lei Liu
Xiaoyan Yang
Junchi Lei
Xiaoyang Liu
Yue Shen
Zhiqiang Zhang
Wei Peng
Jinjie Gu
Zhixuan Chu
Zhan Qin
+ PDF Chat MiniGPT-Med: Large Language Model as a General Interface for Radiology Diagnosis 2024 Asma Alkhaldi
Raneem Alnajim
Layan Al-Abdullatef
Rawan Alyahya
Jun Chen
Deyao Zhu
Ahmed Alsinan
Mohamed Elhoseiny
+ PDF Chat Med-PMC: Medical Personalized Multi-modal Consultation with a Proactive Ask-First-Observe-Next Paradigm 2024 Hongcheng Liu
Yusheng Liao
Siqv Ou
Yuhao Wang
H. Liu
Yanfeng Wang
Yu Wang
+ PDF Chat The Era of Foundation Models in Medical Imaging is Approaching : A Scoping Review of the Clinical Value of Large-Scale Generative AI Applications in Radiology 2024 Il-Hwan Seo
Eun-Hee Bae
Jooyoung Jeon
Yong-Jin Yoon
Jaehyung Cha
+ PDF Chat Bora: Biomedical Generalist Video Generation Model 2024 W.P. Sun
Xiaocao You
Ruizhe Zheng
Zhengqing Yuan
Li Xiang
Lifang He
Quanzheng Li
Lichao Sun
+ PDF Chat AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow 2024 Huizi Yu
Jiayan Zhou
Lingyao Li
Shan Chen
Jack Gallifant
An-tian Shi
Xiang Li
Wenyue Hua
Mingyu Jin
Guang Chen
+ PDF Chat Medical Video Generation for Disease Progression Simulation 2024 Xu Cao
Kaizhao Liang
Kuei-Da Liao
Tianren Gao
Wenqian Ye
Jintai Chen
Zewen Ding
Jianguo Cao
James M. Rehg
Jimeng Sun
+ PDF Chat Dr-LLaVA: Visual Instruction Tuning with Symbolic Clinical Grounding 2024 Shenghuan Sun
Gregory M. Goldgof
Alexander Schubert
Z.J. Sun
Thomas Hartvigsen
Atul J. Butte
Ahmed M. Alaa
+ PDF Chat D-Rax: Domain-specific Radiologic assistant leveraging multi-modal data and eXpert model predictions 2024 Hareem Nisar
Syed Muhammad Anwar
Zhifan Jiang
Abhijeet Parida
Vishwesh Nath
Holger R. Roth
Marius George Linguraru
+ Radiology-GPT: A Large Language Model for Radiology 2023 Zhengliang Liu
Aoxiao Zhong
Yiwei Li
Longtao Yang
Chao Ju
Zihao Wu
Chong Ma
Peng Shu
Cheng Chen
Sekeun Kim
+ PDF Chat Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & Hallucinations 2024 Ankit Pal
Malaikannan Sankarasubbu
+ The application of ChatGPT in medical education 2023 Jialin Liu
Siru Liu
+ Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A Review 2023 Mingze Yuan
Peng Bao
Jiajia Yuan
Yunhao Shen
Zifan Chen
Yi Xie
Jie Zhao
Yang Chen
Li Zhang
Lin Shen

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors