Multiple Imputation of Missing or Faulty Values Under Linear Constraints

Type: Article

Publication Date: 2014-02-20

Citations: 46

DOI: https://doi.org/10.1080/07350015.2014.885435

Abstract

Many statistical agencies, survey organizations, and research centers collect data that suffer from item nonresponse and erroneous or inconsistent values. These data may be required to satisfy linear constraints, for example, bounds on individual variables and inequalities for ratios or sums of variables. Often these constraints are designed to identify faulty values, which then are blanked and imputed. The data also may exhibit complex distributional features, including nonlinear relationships and highly nonnormal distributions. We present a fully Bayesian, joint model for modeling or imputing data with missing/blanked values under linear constraints that (i) automatically incorporates the constraints in inferences and imputations, and (ii) uses a flexible Dirichlet process mixture of multivariate normal distributions to reflect complex distributional features. Our strategy for estimation is to augment the observed data with draws from a hypothetical population in which the constraints are not present, thereby taking advantage of computationally expedient methods for fitting mixture models. Missing/blanked items are sampled from their posterior distribution using the Hit-and-Run sampler, which guarantees that all imputations satisfy the constraints. We illustrate the approach using manufacturing data from Colombia, examining the potential to preserve joint distributions and a regression from the plant productivity literature. Supplementary materials for this article are available online.

Locations

  • Journal of Business and Economic Statistics - View
  • INDIGO (University of Illinois at Chicago) - View - PDF

Similar Works

Action Title Year Authors
+ Multiple Imputation of Missing or Faulty Values Under Linear Constraints 2014 Hang J. Kim
Jerome P. Reiter
Quanli Wang
Lawrence H. Cox
Alan F. Karr
+ Multiple Imputation of Missing or Faulty Values Under Linear Constraints 2014 Hang J. Kim
Jerome P. Reiter
Quanli Wang
Lawrence H. Cox
Alan F. Karr
+ Nonparametric Bayesian Multiple Imputation for Incomplete Categorical Variables in Large-Scale Assessment Surveys 2013 Yajuan Si
Jerome P. Reiter
+ Missing Data 2005 Roderick J. A. Little
+ PDF Chat Multiple Imputation of Missing Categorical and Continuous Values via Bayesian Mixture Models With Local Dependence 2016 Jared S. Murray
Jerome P. Reiter
+ Multiple Imputation of Missing Categorical and Continuous Values via Bayesian Mixture Models with Local Dependence 2014 Jared S. Murray
Jerome P. Reiter
+ Multiple Imputation of Missing Categorical and Continuous Values via Bayesian Mixture Models with Local Dependence 2014 Jared S. Murray
Jerome P. Reiter
+ PDF Chat Simultaneous Edit-Imputation for Continuous Microdata 2015 Hang Kim
Lawrence H. Cox
Alan F. Karr
Jerome P. Reiter
Quanli Wang
+ Missing-data imputation 2006 Andrew Gelman
Jennifer Hill
+ Bayesian multiple imputation for large-scale categorical data with structural zeros 2013 Daniel Manrique‐Vallier
Jerome P. Reiter
+ PDF Chat An Empirical Comparison of Multiple Imputation Methods for Categorical Data 2017 Olanrewaju Akande
Li Fan
Jerome P. Reiter
+ Review of Single Imputation and Multiple Imputation Techniques for Handling Missing Values 2023 Kavita Sethia
Anjana Gosain
Jaspreeti Singh
+ General Considerations on Univariate Methods: Single and Multiple Imputation 2023 Matthias Templ
+ Multiple Imputation of Missing Data Using SAS 2014 Steven G. Heeringa
Patricia A. Berglund
Steven G. Heeringa
+ Missing Data 2010 Thomas Lumley
+ Missing Data 2014 Roderick J. A. Little
+ Multiple Imputation For Missing Data 2007
+ A Bayesian method for analyzing combinations of continuous, ordinal, and nominal categorical data with missing values 2014 Xiao Zhang
W. John Boscardin
Thomas R. Belin
Xiaohai Wan
Yulei He
Kui Zhang
+ PDF Chat Review for Handling Missing Data with special missing mechanism 2024 Youran Zhou
Sunil Aryal
Mohamed Reda Bouadjenek
+ Multiple Imputation of Missing Data 2024 Ramzi W. Nahhas