Testing convex truncation

Anindya De; Shivam Nadimpalli; Rocco A. Servedio

doi:10.4171/msl/50

JournalsmslVol. 8, No. 1/2pp. 1–31

Testing convex truncation

Anindya De
University of Pennsylvania, Philadephia, USA
ORCID
Shivam Nadimpalli
Massachusetts Institute of Technology, Cambridge, USA
ORCID
Rocco A. Servedio
Columbia University, New York, USA
ORCID

Download PDF

This article is published open access under our Subscribe to Open model.

Abstract

We study the basic statistical problem of testing whether normally distributed $n$ -dimensional data has been truncated, i.e., altered by only retaining points that lie in some unknown truncation set $S \subseteq R^{n}$ . As our main algorithmic results,

(1) we give an $O (n)$ -sample algorithm that can distinguish the standard normal distribution $N (0, I_{n})$ from $N (0, I_{n})$ conditioned on an unknown and arbitrary convex set $S$ ;

(2) we give a different $O (n)$ -sample algorithm that can distinguish $N (0, I_{n})$ from $N (0, I_{n})$ conditioned on an unknown and arbitrary mixture of symmetric convex sets.

Both our algorithms are computationally efficient and run in $O (n^{2})$ time, which is linear in the size of the input. These results stand in sharp contrast with known results for learning or testing convex bodies with respect to the normal distribution or learning convex-truncated normal distributions, where state-of-the-art algorithms require essentially $n^{O (n)}$ samples. An easy argument shows that no finite number of samples suffices to distinguish $N (0, I_{n})$ from an unknown and arbitrary mixture of general (not necessarily symmetric) convex sets, so no common generalization of results (1) and (2) above is possible. We also prove that any algorithm (computationally efficient or otherwise) that can distinguish $N (0, I_{n})$ from $N (0, I_{n})$ conditioned on an unknown symmetric convex set must use $Ω (n)$ samples. This shows that the sample complexity of each of our algorithms is optimal up to a constant factor.

Cite this article

Anindya De, Shivam Nadimpalli, Rocco A. Servedio, Testing convex truncation. Math. Stat. Learn. 8 (2025), no. 1/2, pp. 1–31

DOI 10.4171/MSL/50