k-Anonymous microaggregation with preservation of statistical dependence

k-Anonymous microaggregation emerges as an essential building block in statistical disclosure control a field concerning the postprocessing of the demographic portion of surveys containing sensitive information in order to safeguard the anonymity of the respondents. Traditionally this form of microa...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Information sciences 2016-05, Vol.342, p.1-23
Hauptverfasser: Rebollo-Monedero, David, Forné, Jordi, Soriano, Miguel, Puiggalí Allepuz, Jordi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:k-Anonymous microaggregation emerges as an essential building block in statistical disclosure control a field concerning the postprocessing of the demographic portion of surveys containing sensitive information in order to safeguard the anonymity of the respondents. Traditionally this form of microaggregation has been formulated to characterize both the privacy attained and the inherent information loss due to the aggregation of quasi-identifiers which may otherwise be exploited to reidentify the individuals to which a record in a published database refers. Because the ulterior purposes of such databases involve the analysis of the statistical dependence between demographic attributes and sensitive data we must articulate mechanisms to enable the preservation of the statistical dependence between quasi-identifiers and confidential attributes beyond the mere degradation of the quasi-identifiers alone. This work addresses the problem of k-anonymous microaggregation with preservation of statistical dependence in a formal systematic manner modeling statistical dependence as predictability of the confidential attributes from the perturbed quasi-identifiers. We proceed by introducing a second mean squared error term in a combined Lagrangian cost that enables us to regulate the trade-off between quasi-identifier distortion and the confidential-attribute predictability. A Lagrangian multiplier enables us to gracefully weigh the importance of each of the two competing objectives.
ISSN:0020-0255
1872-6291
DOI:10.1016/j.ins.2016.01.012