Ask a Question

Prefer a chat interface with context about you and your work?

Data Debiasing with Datamodels (D3M): Improving Subgroup Robustness via Data Selection

Data Debiasing with Datamodels (D3M): Improving Subgroup Robustness via Data Selection

Machine learning models can fail on subgroups that are underrepresented during training. While techniques such as dataset balancing can improve performance on underperforming groups, they require access to training group annotations and can end up removing large portions of the dataset. In this paper, we introduce Data Debiasing with Datamodels …