Statistically optimal robust mean and covariance estimation for anisotropic Gaussians
Arshak Minasyan
Université Paris-Saclay, Gif-sur-Yvette, FranceNikita Zhivotovskiy
University of California, Berkeley, USA

Abstract
Assume that is an -contaminated sample of independent Gaussian vectors in with mean and covariance . In the strong -contamination model, we assume that the adversary replaced an fraction of the vectors in the original Gaussian sample with arbitrary vectors. We show that there is an estimator of the mean satisfying, with probability at least , a bound of the form
where is an absolute constant and denotes the operator norm of . In the same contaminated Gaussian setup, we construct an estimator of the covariance matrix that satisfies, with probability at least ,
Both results are optimal up to multiplicative constant factors. Several previously known results were either dimension-dependent and required to be close to identity or had a sub-optimal dependence on the contamination level . As a part of the analysis, we derive sharp concentration inequalities for central order statistics of Gaussian, folded normal, and chi-squared distributions.
Cite this article
Arshak Minasyan, Nikita Zhivotovskiy, Statistically optimal robust mean and covariance estimation for anisotropic Gaussians. Math. Stat. Learn. (2025), published online first
DOI 10.4171/MSL/48