Towards Foundation Models for Critical Care Time Series

Type: Preprint

Publication Date: 2024-11-25

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2411.16346

Abstract

Notable progress has been made in generalist medical large language models across various healthcare areas. However, large-scale modeling of in-hospital time series data - such as vital signs, lab results, and treatments in critical care - remains underexplored. Existing datasets are relatively small, but combining them can enhance patient diversity and improve model robustness. To effectively utilize these combined datasets for large-scale modeling, it is essential to address the distribution shifts caused by varying treatment policies, necessitating the harmonization of treatment variables across the different datasets. This work aims to establish a foundation for training large-scale multi-variate time series models on critical care data and to provide a benchmark for machine learning models in transfer learning across hospitals to study and address distribution shift challenges. We introduce a harmonized dataset for sequence modeling and transfer learning research, representing the first large-scale collection to include core treatment variables. Future plans involve expanding this dataset to support further advancements in transfer learning and the development of scalable, generalizable models for critical healthcare applications.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ Transfer Learning for Clinical Time Series Analysis using Deep Neural Networks 2019 Priyanka Gupta
Pankaj Malhotra
Jyoti Narwariya
Lovekesh Vig
Gautam Shroff
+ Transfer Learning for Clinical Time Series Analysis using Recurrent Neural Networks 2018 Priyanka Gupta
Pankaj Malhotra
Lovekesh Vig
Gautam Shroff
+ Transfer Learning for Clinical Time Series Analysis using Recurrent Neural Networks 2018 Priyanka Gupta
Pankaj Malhotra
Lovekesh Vig
Gautam Shroff
+ PDF Chat Repurposing Foundation Model for Generalizable Medical Time Series Classification 2024 Nan Huang
Haishuai Wang
Zihuai He
Marinka Žitnik
Xiang Zhang
+ Relaxed Parameter Sharing: Effectively Modeling Time-Varying Relationships in Clinical Time-Series 2019 Jeeheh Oh
Jiaxuan Wang
Shengpu Tang
Michael W. Sjoding
Jenna Wiens
+ PDF Chat Generalized Prompt Tuning: Adapting Frozen Univariate Time Series Foundation Models for Multivariate Healthcare Time Series 2024 Mingzhu Liu
A. Chen
George H. Chen
+ Transfer Learning for Clinical Time Series Analysis Using Deep Neural Networks 2019 Priyanka Gupta
Pankaj Malhotra
Jyoti Narwariya
Lovekesh Vig
Gautam Shroff
+ PDF Chat MIMIC-Extract: a data extraction, preprocessing, and representation pipeline for MIMIC-III 2020 Shirly Wang
Matthew B. A. McDermott
Geeticka Chauhan
Marzyeh Ghassemi
Michael C. Hughes
Tristan Naumann
+ PDF Chat MIMIC-Extract 2020 Shirly Wang
Matthew B. A. McDermott
Geeticka Chauhan
Marzyeh Ghassemi
Michael C. Hughes
Tristan Naumann
+ PDF Chat Contrastive Representation Learning Helps Cross-institutional Knowledge Transfer: A Study in Pediatric Ventilation Management 2025 Yuxuan
Liu
Jinpei Han
Padmanabhan Ramnarayan
A. Aldo Faisal
+ Self-Supervised Transformer for Sparse and Irregularly Sampled Multivariate Clinical Time-Series 2021 Sindhu Tipirneni
Chandan K. Reddy
+ PDF Chat Intensive Care as One Big Sequence Modeling Problem 2024 Vadim Liventsev
T. A. Fritz
+ On the Importance of Step-wise Embeddings for Heterogeneous Clinical Time-Series 2023 Rita Kuznetsova
Alizée Pace
M. Burger
Hugo Yèche
Gunnar Rätsch
+ Event-Based Contrastive Learning for Medical Time Series 2023 Hyewon Jeong
Nassim Oufattole
Aparna Balagopalan
Matthew B. A. McDermott
Payal Chandak
Marzyeh Ghassemi
Collin M. Stultz
+ Self-supervised Transformer for Multivariate Clinical Time-Series with Missing Values. 2021 Sindhu Tipirneni
Chandan K. Reddy
+ PDF Chat Beyond LoRA: Exploring Efficient Fine-Tuning Techniques for Time Series Foundational Models 2024 Divij Gupta
Anubhav Bhatti
Surajsinh Parmar
+ PDF Chat Self-Supervised Transformer for Sparse and Irregularly Sampled Multivariate Clinical Time-Series 2022 Sindhu Tipirneni
Chandan K. Reddy
+ TemporAI: Facilitating Machine Learning Innovation in Time Domain Tasks for Medicine 2023 Evgeny S. Saveliev
Mihaela van der Schaar
+ Language Model Training Paradigms for Clinical Feature Embeddings 2023 Yurong Hu
M. Burger
Gunnar Rätsch
Rita Kuznetsova
+ Clairvoyance: A Pipeline Toolkit for Medical Time Series 2023 Daniel Jarrett
Jinsung Yoon
Ioana Bica
Zhaozhi Qian
Ari Ercole
Mihaela van der Schaar

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors