GEOMETRIZING RATES OF CONVERGENCE UNDER LOCAL DIFFERENTIAL PRIVACY CONSTRAINTS
We study the problem of estimating a functional θ(P) of an unknown probability distribution P ∈ P in which the original iid sample X1, . . . , Xn is kept private even from the statistician via an α-local differential privacy constraint. Let ω TV denote the modulus of continuity of the functional θ o...
Gespeichert in:
Veröffentlicht in: | The Annals of statistics 2020-10, Vol.48 (5), p.2646-2670 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | We study the problem of estimating a functional θ(P) of an unknown probability distribution P ∈ P in which the original iid sample X1, . . . , Xn
is kept private even from the statistician via an α-local differential privacy constraint. Let
ω
TV denote the modulus of continuity of the functional θ over P with respect to total variation distance. For a large class of loss functions l and a fixed privacy level α, we prove that the privatized minimax risk is equivalent to l(
ω
TV(n
−1/2)) to within constants, under regularity conditions that are satisfied, in particular, if θ is linear and P is convex. Our results complement the theory developed by Donoho and Liu (1991) with the nowadays highly relevant case of privatized data. Somewhat surprisingly, the difficulty of the estimation problem in the private case is characterized by
ω
TV, whereas, it is characterized by the Hellinger modulus of continuity if the original data X1, . . . , Xn
are available. We also find that for locally private estimation of linear functionals over a convex model a simple sample mean estimator, based on independently and binary privatized observations, always achieves the minimax rate. We further provide a general recipe for choosing the functional parameter in the optimal binary privatization mechanisms and illustrate the general theory in numerous examples. Our theory allows us to quantify the price to be paid for local differential privacy in a large class of estimation problems. This price appears to be highly problem specific. |
---|---|
ISSN: | 0090-5364 2168-8966 |
DOI: | 10.1214/19-AOS1901 |