Importance partitioning in micro-aggregation

One of the techniques of data holders for the protection of confidentiality of continuous data is that of micro-aggregation. Rather than releasing raw data (individual records), micro-aggregation releases the averages of small groups and thus reduces the risk of identity disclosure. At the same time...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Computational statistics & data analysis 2009-05, Vol.53 (7), p.2439-2445
Hauptverfasser:	Kokolakis, G., Fouskakis, D.
Format:	Artikel
Sprache:	eng
Schlagworte:	Exact sciences and technology General topics Mathematics Multivariate analysis Numerical analysis Numerical analysis. Scientific computation Numerical methods in probability and statistics Probability and statistics Sciences and techniques of general use Statistics
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	One of the techniques of data holders for the protection of confidentiality of continuous data is that of micro-aggregation. Rather than releasing raw data (individual records), micro-aggregation releases the averages of small groups and thus reduces the risk of identity disclosure. At the same time the method implies loss of information and often distorts the data. Thus, the choice of groups is very crucial to minimize the information loss and the data distortion. No exact polynomial algorithms exist up to date for optimal micro-aggregation, and so the usage of heuristic methods is necessary. A heuristic algorithm, based on the notion of importance partitioning, is proposed and it is shown that compared with other micro-aggregation heuristics achieves improved performance.
ISSN:	0167-9473 1872-7352
DOI:	10.1016/j.csda.2008.09.028