Population-aware hierarchical bayesian domain adaptation via multi-component invariant learning

Type: Preprint

Publication Date: 2020-03-20

Citations: 6

DOI: https://doi.org/10.1145/3368555.3384451

Download PDF

Abstract

While machine learning is rapidly being developed and deployed in health settings such as influenza prediction, there are critical challenges in using data from one environment in another due to variability in features; even within disease labels there can be differences (e.g. "fever" may mean something different reported in a doctor's office versus in an online app). Moreover, models are often built on passive, observational data which contain different distributions of population subgroups (e.g. men or women). Thus, there are two forms of instability between environments in this observational transport problem. We first harness knowledge from health to conceptualize the underlying causal structure of this problem in a health outcome prediction task. Based on sources of stability in the model, we posit that for human-sourced data and health prediction tasks we can combine environment and population information in a novel population-aware hierarchical Bayesian domain adaptation framework that harnesses multiple invariant components through population attributes when needed. We study the conditions under which invariant learning fails, leading to reliance on the environment-specific attributes. Experimental results for an influenza prediction task on four datasets gathered from different contexts show the model can improve prediction in the case of largely unlabelled target data from a new environment and different constituent population, by harnessing both environment and population invariant information. This work represents a novel, principled way to address a critical challenge by blending domain (health) knowledge and algorithmic innovation. The proposed approach will have a significant impact in many social settings wherein who and where the data comes from matters.

Locations

  • arXiv (Cornell University) - View - PDF
  • DataCite API - View

Similar Works

Action Title Year Authors
+ Population-aware Hierarchical Bayesian Domain Adaptation 2018 Vishwali Mhasawade
Nabeel Abdur Rehman
Rumi Chunara
+ Selecting Treatment Effects Models for Domain Adaptation Using Causal Knowledge 2021 Trent Kyono
Ioana Bica
Zhaozhi Qian
Mihaela van der Schaar
+ Exploiting Causal Structure for Robust Model Selection in Unsupervised Domain Adaptation 2021 Trent Kyono
Mihaela van der Schaar
+ PDF Chat Selecting Treatment Effects Models for Domain Adaptation Using Causal Knowledge 2023 Trent Kyono
Ioana Bica
Zhaozhi Qian
Mihaela van der Schaar
+ PDF Chat PAGE: Domain-Incremental Adaptation with Past-Agnostic Generative Replay for Smart Healthcare 2024 Chia-Hao Li
Niraj K. Jha
+ Accounting for Unobserved Confounding in Domain Generalization 2021 Alexis Bellot
Mihaela van der Schaar
+ Scalable Causal Domain Adaptation 2021 Mohammad Ali Javidian
Om Pandey
Pooyan Jamshidi
+ PDF Chat Learning When the Concept Shifts: Confounding, Invariance, and Dimension Reduction 2024 Kulunu Dharmakeerthi
YoonHaeng Hur
Tengyuan Liang
+ PDF Chat Scalable Out-of-distribution Robustness in the Presence of Unobserved Confounders 2024 Parjanya Prashant
Seyedeh Baharan Khatami
Bruno Ribeiro
Babak Salimi
+ Domain adaptation under structural causal models 2021 Yuansi Chen
Peter Bühlmann
+ Domain adaptation under structural causal models 2020 Yuansi Chen
Peter Bühlmann
+ Domain Adaptation for Infection Prediction from Symptoms Based on Data from Different Study Designs and Contexts 2018 Nabeel Abdur Rehman
Maxwell Aliapoulios
Disha Umarwani
Rumi Chunara
+ Domain Adaptation in Highly Imbalanced and Overlapping Datasets 2020 Ran Ilan Ber
Tom Haramaty
+ Domain Adaptation in Highly Imbalanced and Overlapping Datasets 2020 Ran Ilan Ber
Tom Haramaty
+ Scalable Causal Transfer Learning 2021 Mohammad Ali Javidian
Om Pandey
Pooyan Jamshidi
+ PDF Chat Regularized Bayesian transfer learning for population-level etiological distributions 2020 Abhirup Datta
Jacob Fiksel
Agbessi Amouzou
Scott L. Zeger
+ Domain Conditional Predictors for Domain Adaptation 2021 João Monteiro
Xavier Gibert
Jianqiao Feng
Vincent Dumoulin
Dar-Shyang Lee
+ Domain Conditional Predictors for Domain Adaptation 2021 João Monteiro
Xavier Gibert
Jianqiao Feng
Vincent Dumoulin
Dar-Shyang Lee
+ Domain Adaptation with Factorizable Joint Shift 2022 Hao He
Yuzhe Yang
Hao Wang
+ Regularized Bayesian transfer learning for population level etiological distributions 2018 Abhirup Datta
Jacob Fiksel
Agbessi Amouzou
Scott L. Zeger