Phase transitions for support recovery under local differential privacy
Cristina Butucea
CREST - ENSAE, Palaiseau, FranceAmandine Dubois
CREST - ENSAI, Bruz, FranceAdrien Saumard
CREST - ENSAI, Bruz, France
Abstract
We address the problem of variable selection in a high-dimensional but sparse mean model, under the additional constraint that only privatized data are available for inference. The original data are vectors with independent entries having a symmetric, strongly log-concave distribution on . For this purpose, we adopt a recent generalization of classical minimax theory to the framework of local -differential privacy. We provide lower and upper bounds on the rate of convergence for the expected Hamming loss over classes of at most -sparse vectors whose non-zero coordinates are separated from by a constant . As corollaries, we derive necessary and sufficient conditions (up to log factors) for exact recovery and for almost full recovery. When we restrict our attention to non-interactive mechanisms that act independently on each coordinate our lower bound shows that, contrary to the non-private setting, both exact and almost full recovery are impossible whatever the value of in the high-dimensional regime such that . However, in the regime we can exhibit a critical value (up to a logarithmic factor) such that exact and almost full recovery are possible for all and impossible for . We show that these results can be improved when allowing for all non-interactive (that act globally on all coordinates) locally -differentially private mechanisms in the sense that phase transitions occur at lower levels.
Cite this article
Cristina Butucea, Amandine Dubois, Adrien Saumard, Phase transitions for support recovery under local differential privacy. Math. Stat. Learn. 6 (2023), no. 1/2, pp. 1–50
DOI 10.4171/MSL/37