Fed-GLMM: A Privacy-Preserving and Computation-Efficient Federated Algorithm for Generalized Linear Mixed Models to Analyze Correlated Electronic Health Records Data

Type: Preprint

Publication Date: 2022-03-10

Citations: 4

DOI: https://doi.org/10.1101/2022.03.07.22271469

Abstract

Abstract Large collaborative research networks provide opportunities to jointly analyze multicenter electronic health record (EHR) data, which can improve the sample size, diversity of the study population, and generalizability of the results. However, there are challenges to analyzing multicenter EHR data including privacy protection, large-scale computation, heterogeneity across sites, and correlated observations. In this paper, we propose a federated algorithm for generalized linear mixed models (Fed-GLMM), which can flexibly model multicenter longitudinal or correlated data while accounting for site-level heterogeneity. Fed-GLMM can be applied to both federated and centralized research networks to enable privacy-preserving data integration and improve computational efficiency. By communicating only a limited amount of summary statistics, Fed-GLMM can achieve nearly identical results as the gold-standard method where the GLMM is directly fitted on the pooled dataset. We demonstrate the performance of Fed-GLMM in both numerical experiments and an application to longitudinal EHR data from multiple healthcare facilities.

Locations

  • medRxiv (Cold Spring Harbor Laboratory) - View - PDF

Similar Works

Action Title Year Authors
+ PDF Chat A privacy-preserving and computation-efficient federated algorithm for generalized linear mixed models to analyze correlated electronic health records data 2023 Zhiyu Yan
Kori S. Zachrison
Lee H. Schwamm
Juan Estrada
Rui Duan
+ Federated Learning Algorithms for Generalized Mixed-effects Model (GLMM) on Horizontally Partitioned Data from Distributed Sources 2021 Wentao Li
Jiayi Tong
Md. Monowar Anjum
Noman Mohammed
Yong Chen
Xiaoqian Jiang
+ Federated learning algorithms for generalized mixed-effects model (GLMM) on horizontally partitioned data from distributed sources 2022 Wentao Li
Jiayi Tong
Md. Monowar Anjum
Noman Mohammed
Yong Chen
Xiaoqian Jiang
+ Linear Mixed Modeling of Federated Data When Only the Mean, Covariance, and Sample Size Are Available 2024 Marie Analiz April Limpoco
Christel Faes
Niel Hens
+ PDF Chat Federated Learning Algorithms for Generalized Mixed-effects Model (GLMM) on Horizontally Partitioned Data from Distributed Sources 2022 Wentao Li
Jiayi Tong
Md. Monowar Anjum
Noman Mohammed
Yong Chen
Xiaoqian Jiang
+ PDF Chat Federated generalized linear mixed models for collaborative genome-wide association studies 2023 Wentao Li
Han Chen
Xiaoqian Jiang
Arif Harmanci
+ PDF Chat Linear mixed modelling of federated data when only the mean, covariance, and sample size are available 2024 Marie Analiz April Limpoco
Christel Faes
Niel Hens
+ PDF Chat Privacy-preserving construction of generalized linear mixed model for biomedical computation 2020 Rui Zhu
Chao Jiang
Xiaofeng Wang
Shuang Wang
Hao Zheng
Haixu Tang
+ Federated Generalized Linear Mixed Models for Collaborative Genome-wide Association Studies 2022 Wentao Li
Han Chen
Xiaoqian Jiang
Arif Harmanci
+ PDF Chat Federated Generalized Linear Mixed Models for Collaborative Genome-Wide Association Studies 2022 Wentao Li
Han Chen
Xiaoqian Jiang
Arif Harmanci
+ dPQL: a lossless distributed algorithm for generalized linear mixed model with application to privacy-preserving hospital profiling 2022 Chongliang Luo
Md. Nazmul Islam
Natalie E. Sheils
John Buresh
Martijn J. Schuemie
Jalpa A. Doshi
Rachel M. Werner
David A. Asch
Yong Chen
+ PDF Chat Privacy-preserving federated prediction of pain intensity change based on multi-center survey data 2024 Supratim Das
Mahdie Rafie
Paula Kammer
Søren Thorgaard Skou
D.T. Grønne
Ewa M. Roos
André Hajek
Hans‐Helmut König
Md Shihab Ullaha
Niklas Probul
+ PDF Chat dPQL: a lossless distributed algorithm for generalized linear mixed model with application to privacy-preserving hospital profiling 2021 Chongliang Luo
Md. Nazmul Islam
Natalie E. Sheils
John Buresh
Yong Chen
+ Robust Inference for Federated Meta-Learning 2023 Zijian Guo
Xiudi Li
Larry Han
Tianxi Cai
+ PDF Chat Federated mixed effects logistic regression based on one-time shared summary statistics 2024 Marie Analiz April Limpoco
Christel Faes
Niel Hens
+ PDF Chat SurvMaximin: Robust Federated Approach to Transporting Survival Risk Prediction Models 2022 Xuan Wang
Harrison G. Zhang
Xin Xiong
Chuan Hong
Griffin M. Weber
Gabriel A. Brat
Clara-Lea Bonzel
Yuan Luo
Rui Duan
Nathan Palmer
+ PDF Chat An efficient and accurate distributed learning algorithm for modeling multi-site zero-inflated count outcomes 2021 Mackenzie Edmondson
Chongliang Luo
Rui Duan
Mitchell Maltenfort
Zhaoyi Chen
Kenneth W. Locke
Justine Shults
Jiang Bian
Patrick Ryan
Christopher B. Forrest
+ PDF Chat Robust Inference for Federated Meta-Learning 2025 Zijian Guo
Xiudi Li
Larry Han
Tianxi Cai
+ PDF Chat Lossless Distributed Linear Mixed Model with Application to Integration of Heterogeneous Healthcare Data 2020 Chongliang Luo
Md. Nazmul Islam
Natalie E. Sheils
Jenna Reps
John Buresh
Rui Duan
Jiayi Tong
Mackenzie Edmondson
Martijn J. Schumie
Yong Chen
+ Federated Multiple Imputation for Variables that Are Missing Not At Random in Distributed Electronic Health Records 2024 Yi Lian
Xiaoqian Jiang
Qi Long

Works Cited by This (13)

Action Title Year Authors
+ PDF Chat WebDISCO: a web service for distributed cox model learning without patient-level data sharing 2015 Chia-Lun Lu
Shuang Wang
Zhanglong Ji
Yuan Wu
Li Xiong
Xiaoqian Jiang
Lucila Ohno‐Machado
+ PDF Chat pSCANNER: patient-centered Scalable National Network for Effectiveness Research 2014 Lucila Ohno‐Machado
Zia Agha
Douglas S. Bell
Lisa Dahm
Michele E. Day
JN Doctor
Davera Gabriel
Maninder Kahlon
Katherine Kim
Michael Hogarth
+ PDF Chat Grid Binary LOgistic REgression (GLORE): building shared models without sharing data 2012 Yuan Wu
Xiaoqian Jiang
Jihoon Kim
Lucila Ohno‐Machado
+ PDF Chat Meta-analysis in clinical trials revisited 2015 Rebecca DerSimonian
Nan M. Laird
+ A fast divide-and-conquer sparse Cox regression 2019 Yan Wang
Chuan Hong
Nathan Palmer
Qian Di
Joel Schwartz
Isaac S. Kohane
Tianxi Cai
+ PDF Chat Learning from electronic health records across multiple sites: A communication-efficient and privacy-preserving distributed algorithm 2019 Rui Duan
Mary Regina Boland
Zixuan Liu
Yue Liu
Howard H. Chang
Hua Xu
Haitao Chu
Christopher H. Schmid
Christopher B. Forrest
John H. Holmes
+ A divide-and-conquer method for sparse risk prediction and evaluation 2020 Chuan Hong
Yan Wang
Tianxi Cai
+ PDF Chat Lossless Distributed Linear Mixed Model with Application to Integration of Heterogeneous Healthcare Data 2020 Chongliang Luo
Md. Nazmul Islam
Natalie E. Sheils
Jenna Reps
John Buresh
Rui Duan
Jiayi Tong
Mackenzie Edmondson
Martijn J. Schumie
Yong Chen
+ PDF Chat Heterogeneity-aware and communication-efficient distributed statistical inference 2021 Rui Duan
Yang Ning
Yong Chen
+ Linear Mixed Models: Part I 2021 Jiming Jiang
Thuan Nguyen