Analyzing Preference Data With Local Privacy: Optimal Utility and Enhanced Robustness
Online service providers benefit from collecting and analyzing preference data from users, including both implicit preference data (e.g., watched videos of a user) and explicit preference data (e.g., ranking data over candidates). However, it brings ethical and legal issues of data privacy at the sa...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on knowledge and data engineering 2023-08, Vol.35 (8), p.7753-7767 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Online service providers benefit from collecting and analyzing preference data from users, including both implicit preference data (e.g., watched videos of a user) and explicit preference data (e.g., ranking data over candidates). However, it brings ethical and legal issues of data privacy at the same time. In this paper, we study the problem of aggregating individual's preference data in the local differential privacy (LDP) setting. One naive approach is to add Laplace random noises, which however suffers from low statistical utility and is fragile to LDP-specific poisoning attacks. Therefore, we propose a novel mechanism to improve the utility and the robustness simultaneously: the additive mechanism . The additive mechanism randomly outputs a subset of candidates with a probability proportional to their total scores. For preference data with Borda rule over d d items, its mean squared error bound is optimized from O(\frac{d^{5}}{n\epsilon ^{2}}) O(d5nε2) to O(\frac{d^{4}}{n\epsilon ^{2}}) O(d4nε2) , and its maximum poisoning risk bound is reduced from +\infty +∞ to O(\frac{d^{2}}{n\epsilon }) O(d2nε |
---|---|
ISSN: | 1041-4347 1558-2191 |
DOI: | 10.1109/TKDE.2022.3207486 |