Adaptative generalisation over a value hierarchy for the k-anonymisation of Origin-Destination matrices
The study of transportation relies on mobility data, containing information on the whereabouts and movements of individuals in a study area. These data are often represented in the simple form of origin–destination (OD)-matrices, which are a valuable indicator for the management of transportation ne...
Gespeichert in:
Veröffentlicht in: | Transportation research. Part C, Emerging technologies Emerging technologies, 2023-09, Vol.154, p.104236, Article 104236 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The study of transportation relies on mobility data, containing information on the whereabouts and movements of individuals in a study area. These data are often represented in the simple form of origin–destination (OD)-matrices, which are a valuable indicator for the management of transportation networks but also present risks to the privacy of the individuals. In particular, the significant size (i.e., number of distinct flows) and high number of modalities (produced by a high-resolution zoning) of OD-matrices call for an adapted, fast algorithm that can efficiently anonymise them. In this paper, we develop a lightweight approach for the k-anonymisation of OD-matrices that exploits the low dimension of the data to explore a larger solution space than regular generalisation algorithms, while keeping relevant restrictions of the search space in order to be scalable on matrices with high number of flows. We apply it to a variety of real-world large-scale O-D matrices collected by the New York City Taxi and Limousine Commission and derived from the Data for Development (D4D) challenge organised by Orange in Senegal and Côte d’Ivoire. Compared to an extensive benchmark of regular generalisation algorithms and mobility anonymisation state-of-the-art, we show that our method is 27% more precise and 9 times faster than comparable approaches able to scale on the same datasets.
•We study the anonymisation Origin Destination (OD)-matrices via generalisation and suppression.•We provide an adaptative approach based on a hierarchy of values.•We take advantage of the high number of modalities, but low number of attributes of OD-matrices to efficiently find a solution.•We compare our approach with a variety of solutions from the general-case anonymisation and from the more specific domain of mobility anonymisation.•We illustrate on a variety of datasets that our approach scales better that state of the art while providing a finer generalisation of values. |
---|---|
ISSN: | 0968-090X 1879-2359 |
DOI: | 10.1016/j.trc.2023.104236 |