DE-IDENTIFICATION OF PROTECTED INFORMATION

The present disclosure is directed to methods and apparatus for centralized de-identification of protected data associated with subjects. In various embodiments, de-identified data may be received (1102) that includes de-identified data set(s) associated with subject(s) that is generated from raw da...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: He, Ze, Jain, Anshul, Viswanathan, Sreramkumar Sitaraman, Carlson, Eric Thomas, Khuntia, Sunil Ranjan, Manakkaparambil Sivanandan, Sreekanth, Sorower, Mohammad Shahed
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The present disclosure is directed to methods and apparatus for centralized de-identification of protected data associated with subjects. In various embodiments, de-identified data may be received (1102) that includes de-identified data set(s) associated with subject(s) that is generated from raw data set(s) associated with the subjects. Each of the raw data set(s) may include identifying feature(s) that are usable to identify the respective subject. At least some of the identifying feature(s) may be absent from or obfuscated in the de-identified data. Labels associated with each of the de-identified data sets may be determined (1104). At least some of the de-identified data sets may be applied (1108) as input across a trained machine learning model to generate respective outputs, which may be compared (1110) to the labels to determine a measure of vulnerability of the de-identified data to re-identification.