Analyzing Preference Data With Local Privacy: Optimal Utility and Enhanced Robustness

Online service providers benefit from collecting and analyzing preference data from users, including both implicit preference data (e.g., watched videos of a user) and explicit preference data (e.g., ranking data over candidates). However, it brings ethical and legal issues of data privacy at the sa...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on knowledge and data engineering 2023-08, Vol.35 (8), p.7753-7767
Hauptverfasser:	Wang, Shaowei, Luo, Xuandi, Qian, Yuqiu, Du, Jiachun, Lin, Wenqing, Yang, Wei
Format:	Artikel
Sprache:	eng
Schlagworte:	Additives Data aggregation Data analysis Data management data poisoning Differential privacy Error reduction Legal issues local privacy Lower bounds Measurement minimax error Minimax technique Poisoning preference data Privacy Robustness Statistical analysis Videos
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Online service providers benefit from collecting and analyzing preference data from users, including both implicit preference data (e.g., watched videos of a user) and explicit preference data (e.g., ranking data over candidates). However, it brings ethical and legal issues of data privacy at the same time. In this paper, we study the problem of aggregating individual's preference data in the local differential privacy (LDP) setting. One naive approach is to add Laplace random noises, which however suffers from low statistical utility and is fragile to LDP-specific poisoning attacks. Therefore, we propose a novel mechanism to improve the utility and the robustness simultaneously: the additive mechanism . The additive mechanism randomly outputs a subset of candidates with a probability proportional to their total scores. For preference data with Borda rule over d d items, its mean squared error bound is optimized from O(\frac{d^{5}}{n\epsilon ^{2}}) O(d5nε2) to O(\frac{d^{4}}{n\epsilon ^{2}}) O(d4nε2) , and its maximum poisoning risk bound is reduced from +\infty +∞ to O(\frac{d^{2}}{n\epsilon }) O(d2nε
ISSN:	1041-4347 1558-2191
DOI:	10.1109/TKDE.2022.3207486