List Decodable Mean Estimation in Nearly Linear Time

Type: Article

Publication Date: 2020-11-01

Citations: 11

DOI: https://doi.org/10.1109/focs46700.2020.00022

Download PDF

Abstract

Learning from data in the presence of outliers is a fundamental problem in statistics. Until recently, no computationally efficient algorithms were known to compute the mean of a high dimensional distribution under natural assumptions in the presence of even a small fraction of outliers. In this paper, we consider robust statistics in the presence of overwhelming outliers where the majority of the dataset is introduced adversarially. With only an fraction of "in-liers" (clean data) the mean of a distribution is unidentifiable. However, in their influential work, [1] introduces a polynomial time algorithm recovering the mean of distributions with bounded covariance by outputting a succinct list of O(1/α) candidate solutions, one of which is guaranteed to be close to the true distributional mean; a direct analog of `List Decoding' in the theory of error correcting codes. In this work, we develop an algorithm for list decodable mean estimation in the same setting achieving up to constants the information theoretically optimal recovery, optimal sample complexity, and in nearly linear time up to polylogarithmic factors in dimension. Our conceptual innovation is to design a descent style algorithm on a nonconvex landscape, iteratively removing minima to generate a succinct list of solutions. Our runtime bottleneck is a saddle-point optimization for which we design custom primal dual solvers for generalized packing and covering SDP's under Ky-Fan norms, which may be of independent interest. We refer the reader to [2] for the full version of this paper.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ List Decodable Mean Estimation in Nearly Linear Time 2020 Yeshwanth Cherapanamjeri
Sidhanth Mohanty
Morris Yau
+ List Decodable Mean Estimation in Nearly Linear Time 2020 Yeshwanth Cherapanamjeri
Sidhanth Mohanty
Morris Yau
+ List-Decodable Sparse Mean Estimation 2022 Shiwei Zeng
Jie Shen
+ List Decodable Subspace Recovery 2020 Prasad Raghavendra
Morris Yau
+ List Decodable Subspace Recovery 2020 Prasad Raghavendra
Morris Yau
+ List Decodable Subspace Recovery 2020 Prasad Raghavendra
Morris Yau
+ High-Dimensional Robust Mean Estimation via Gradient Descent 2020 Yu Cheng
Ilias Diakonikolas
Rong Ge
Mahdi Soltanolkotabi
+ PDF Chat List-Decodable Subspace Recovery: Dimension Independent Error in Polynomial Time 2021 Ainesh Bakshi
Pravesh K. Kothari
+ List-Decodable Subspace Recovery: Dimension Independent Error in Polynomial Time 2020 Ainesh Bakshi
Pravesh K. Kothari
+ List-Decodable Mean Estimation in Nearly-PCA Time 2020 Ilias Diakonikolas
Daniel M. Kane
Daniel Kongsgaard
Jerry Li
Kevin Tian
+ List-Decodable Mean Estimation in Nearly-PCA Time 2020 Ilias Diakonikolas
Daniel M. Kane
Daniel Kongsgaard
Jerry Li
Kevin Tian
+ PDF Chat High-Dimensional Robust Mean Estimation in Nearly-Linear Time 2019 Yu Cheng
Ilias Diakonikolas
Rong Ge
+ PDF Chat Outlier-robust Mean Estimation near the Breakdown Point via Sum-of-Squares 2024 Hongjie Chen
Devarajan Sridharan
David Steurer
+ PDF Chat Clustering mixture models in almost-linear time via list-decodable mean estimation 2022 Ilias Diakonikolas
Daniel M. Kane
Daniel Kongsgaard
Jerry Li
Kevin Tian
+ PDF Chat A Sub-Quadratic Time Algorithm for Robust Sparse Mean Estimation 2024 Ankit Pensia
+ High-Dimensional Robust Mean Estimation in Nearly-Linear Time 2018 Yu Cheng
Ilias Diakonikolas
Rong Ge
+ Robust Estimators in High Dimensions without the Computational Intractability 2016 Ilias Diakonikolas
Gautam Kamath
Daniel M. Kane
Jerry Li
Ankur Moitra
Alistair Stewart
+ Robust Estimators in High Dimensions without the Computational Intractability 2016 Ilias Diakonikolas
Gautam Kamath
Daniel M. Kane
Jerry Li
Ankur Moitra
Alistair Stewart
+ PDF Chat Robust Estimators in High-Dimensions Without the Computational Intractability 2019 Ilias Diakonikolas
Gautam Kamath
Daniel M. Kane
Jerry Li
Ankur Moitra
Alistair Stewart
+ List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering 2022 Ilias Diakonikolas
Daniel M. Kane
Sushrut Karmalkar
Ankit Pensia
Thanasis Pittas