Limiting sensitive values in an anonymized table while reducing information loss via p‐proportion

The p‐proportion model bounds the proportion of sensitive values of a sensitive attribute in each equivalence class of an anonymized database table in order to limit the ability of a user to link an individual or entity to a sensitive value in that table. Nonsensitive values are not subject to any s...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Security and privacy 2022-05, Vol.5 (3), p.n/a
Hauptverfasser: Dosselmann, Richard, Hamilton, Howard J.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The p‐proportion model bounds the proportion of sensitive values of a sensitive attribute in each equivalence class of an anonymized database table in order to limit the ability of a user to link an individual or entity to a sensitive value in that table. Nonsensitive values are not subject to any such constraints, which reduces the amount of anonymization needed to meet the requirements of this model. This leads to less information loss in an anonymized table. Anonymization is performed using an extension of the Mondrian algorithm that incorporates categorical attributes. Known as the adapted Mondrian algorithm, it generalizes a value of a categorical attribute to a set. Existing algorithms, by comparison, replace one value of a predefined hierarchy by another. The p‐proportion model is compared against the (α,k)‐anonymity model using both the progressive local recoding and (adapted) Mondrian algorithms. Experiments demonstrate the advantage of p‐proportion and Mondrian over (α,k)‐anonymity and progressive local recoding in terms of reduced information loss, measured using the normalized certainty penalty, discernibility metric, and classification metric.
ISSN:2475-6725
2475-6725
DOI:10.1002/spy2.202