BT24 Pseudonymization for artificial intelligence skin lesion datasets: a real-world feasibility study

The use of patient data for artificial intelligence (AI) research should be transparent, rigorous and accountable. In the UK, the General Data Protection Regulation, Data Protection Act 2018 and General Medical Council govern data handling and patients’ rights to privacy. We report on our multistep...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:British journal of dermatology (1951) 2024-06, Vol.191 (Supplement_1), p.i199-i200
Hauptverfasser: Chin, Trisha J M, Chin, Gillian X M, Sutherland, James, Coon, Andrew, Morton, Colin, Fleming, Colin
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page i200
container_issue Supplement_1
container_start_page i199
container_title British journal of dermatology (1951)
container_volume 191
creator Chin, Trisha J M
Chin, Gillian X M
Sutherland, James
Coon, Andrew
Morton, Colin
Fleming, Colin
description The use of patient data for artificial intelligence (AI) research should be transparent, rigorous and accountable. In the UK, the General Data Protection Regulation, Data Protection Act 2018 and General Medical Council govern data handling and patients’ rights to privacy. We report on our multistep pseudonymization protocol for real-world skin lesion datasets, in preparation for research within a trusted research environment (TRE). Firstly, patients referred from primary care are triaged for community locality and imaging centre (CLIC) suitability. There, trained healthcare professionals capture lesion images (dermoscopic, macroscopic and regional) and patient information using a mobile application on trust-certified devices. Training is standardized across all CLIC sites, with specific anonymization training on removing in-frame clothing and jewellery, device positioning, and magnification to minimize identifiable features like eyes, nose and ears. Lesion datasets (paired images and clinical information) are subsequently transferred to an image management system (IMS) hosted on our trust-secured network. Within the IMS, images are manually inspected, and those with identifiable tattoos and piercings are excluded. All regional images are also excluded from transfer to the TRE. Before transfer to the TRE, images undergo a further round of review. Data fields are manually checked for identifiable patient information, patient names are removed, and dates of birth are rounded to 3-month granularity. The job ID, patient’s hospital number, date of clinical episode and responsible photographer are replaced with randomly generated project-specific identifiers. In an initial study period, 658 of 963 (68%) captured lesion datasets have undergone IMS manual inspection. Of these, 24 lesion datasets were excluded for identifiable features, 10 (41%) for more than one-third of the face being visible, 9 (38%) for full iris visibility, and 5 (21%) for tattoos. On breakdown by anatomical location these images were of the face (19, 80%), torso (2, 8%), limbs (2, 8%) and neck (1, 4%). The remaining 634 datasets (96%) were securely transferred to the TRE, where a further 5% were excluded due to potential identifiability. Although full anonymization is desirable, it is usually achieved by aggregating patient data. Pseudonymization, which allows for future reidentification in a secured fashion, strikes the balance between patient data privacy and clinical governance, while retai
doi_str_mv 10.1093/bjd/ljae090.421
format Article
fullrecord <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_1093_bjd_ljae090_421</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1093_bjd_ljae090_421</sourcerecordid><originalsourceid>FETCH-LOGICAL-c811-233148710719e7bddeffe301d2fb708228ae74dc2a4f987831f843e9b8ac42473</originalsourceid><addsrcrecordid>eNot0D1PwzAUhWELgUQpzKz-A2mvP1o7bFBBQaoEQ_foJr5GLm6CbFco_Hpa0eksr87wMHYvYCagVvN25-ZxhwQ1zLQUF2wi1HJRSaHUJZsAgKmgXqprdpPzDkAoWMCE-aet1Pwj08EN_bgPv1jC0HM_JI6pBB-6gJGHvlCM4ZP6jnj-Cj2PlE-dw4KZSn7gyBNhrH6GFB33hDm0IYYy8lwObrxlVx5jprvzTtn25Xm7eq027-u31eOm6qwQlVRKaGsEGFGTaZ0j70mBcNK3BqyUFslo10nUvrbGKuGtVlS3FjsttVFTNv-_7dKQcyLffKewxzQ2ApqTUnNUas5KzVFJ_QE7oF3s</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>BT24 Pseudonymization for artificial intelligence skin lesion datasets: a real-world feasibility study</title><source>Oxford University Press Journals All Titles (1996-Current)</source><creator>Chin, Trisha J M ; Chin, Gillian X M ; Sutherland, James ; Coon, Andrew ; Morton, Colin ; Fleming, Colin</creator><creatorcontrib>Chin, Trisha J M ; Chin, Gillian X M ; Sutherland, James ; Coon, Andrew ; Morton, Colin ; Fleming, Colin</creatorcontrib><description>The use of patient data for artificial intelligence (AI) research should be transparent, rigorous and accountable. In the UK, the General Data Protection Regulation, Data Protection Act 2018 and General Medical Council govern data handling and patients’ rights to privacy. We report on our multistep pseudonymization protocol for real-world skin lesion datasets, in preparation for research within a trusted research environment (TRE). Firstly, patients referred from primary care are triaged for community locality and imaging centre (CLIC) suitability. There, trained healthcare professionals capture lesion images (dermoscopic, macroscopic and regional) and patient information using a mobile application on trust-certified devices. Training is standardized across all CLIC sites, with specific anonymization training on removing in-frame clothing and jewellery, device positioning, and magnification to minimize identifiable features like eyes, nose and ears. Lesion datasets (paired images and clinical information) are subsequently transferred to an image management system (IMS) hosted on our trust-secured network. Within the IMS, images are manually inspected, and those with identifiable tattoos and piercings are excluded. All regional images are also excluded from transfer to the TRE. Before transfer to the TRE, images undergo a further round of review. Data fields are manually checked for identifiable patient information, patient names are removed, and dates of birth are rounded to 3-month granularity. The job ID, patient’s hospital number, date of clinical episode and responsible photographer are replaced with randomly generated project-specific identifiers. In an initial study period, 658 of 963 (68%) captured lesion datasets have undergone IMS manual inspection. Of these, 24 lesion datasets were excluded for identifiable features, 10 (41%) for more than one-third of the face being visible, 9 (38%) for full iris visibility, and 5 (21%) for tattoos. On breakdown by anatomical location these images were of the face (19, 80%), torso (2, 8%), limbs (2, 8%) and neck (1, 4%). The remaining 634 datasets (96%) were securely transferred to the TRE, where a further 5% were excluded due to potential identifiability. Although full anonymization is desirable, it is usually achieved by aggregating patient data. Pseudonymization, which allows for future reidentification in a secured fashion, strikes the balance between patient data privacy and clinical governance, while retaining a level of granularity sufficient for meaningful analysis. Currently, this protocol is manually intensive with room to partly automate. Use of common standardized protocols will strengthen the public trust in clinical AI.</description><identifier>ISSN: 0007-0963</identifier><identifier>EISSN: 1365-2133</identifier><identifier>DOI: 10.1093/bjd/ljae090.421</identifier><language>eng</language><ispartof>British journal of dermatology (1951), 2024-06, Vol.191 (Supplement_1), p.i199-i200</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Chin, Trisha J M</creatorcontrib><creatorcontrib>Chin, Gillian X M</creatorcontrib><creatorcontrib>Sutherland, James</creatorcontrib><creatorcontrib>Coon, Andrew</creatorcontrib><creatorcontrib>Morton, Colin</creatorcontrib><creatorcontrib>Fleming, Colin</creatorcontrib><title>BT24 Pseudonymization for artificial intelligence skin lesion datasets: a real-world feasibility study</title><title>British journal of dermatology (1951)</title><description>The use of patient data for artificial intelligence (AI) research should be transparent, rigorous and accountable. In the UK, the General Data Protection Regulation, Data Protection Act 2018 and General Medical Council govern data handling and patients’ rights to privacy. We report on our multistep pseudonymization protocol for real-world skin lesion datasets, in preparation for research within a trusted research environment (TRE). Firstly, patients referred from primary care are triaged for community locality and imaging centre (CLIC) suitability. There, trained healthcare professionals capture lesion images (dermoscopic, macroscopic and regional) and patient information using a mobile application on trust-certified devices. Training is standardized across all CLIC sites, with specific anonymization training on removing in-frame clothing and jewellery, device positioning, and magnification to minimize identifiable features like eyes, nose and ears. Lesion datasets (paired images and clinical information) are subsequently transferred to an image management system (IMS) hosted on our trust-secured network. Within the IMS, images are manually inspected, and those with identifiable tattoos and piercings are excluded. All regional images are also excluded from transfer to the TRE. Before transfer to the TRE, images undergo a further round of review. Data fields are manually checked for identifiable patient information, patient names are removed, and dates of birth are rounded to 3-month granularity. The job ID, patient’s hospital number, date of clinical episode and responsible photographer are replaced with randomly generated project-specific identifiers. In an initial study period, 658 of 963 (68%) captured lesion datasets have undergone IMS manual inspection. Of these, 24 lesion datasets were excluded for identifiable features, 10 (41%) for more than one-third of the face being visible, 9 (38%) for full iris visibility, and 5 (21%) for tattoos. On breakdown by anatomical location these images were of the face (19, 80%), torso (2, 8%), limbs (2, 8%) and neck (1, 4%). The remaining 634 datasets (96%) were securely transferred to the TRE, where a further 5% were excluded due to potential identifiability. Although full anonymization is desirable, it is usually achieved by aggregating patient data. Pseudonymization, which allows for future reidentification in a secured fashion, strikes the balance between patient data privacy and clinical governance, while retaining a level of granularity sufficient for meaningful analysis. Currently, this protocol is manually intensive with room to partly automate. Use of common standardized protocols will strengthen the public trust in clinical AI.</description><issn>0007-0963</issn><issn>1365-2133</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNot0D1PwzAUhWELgUQpzKz-A2mvP1o7bFBBQaoEQ_foJr5GLm6CbFco_Hpa0eksr87wMHYvYCagVvN25-ZxhwQ1zLQUF2wi1HJRSaHUJZsAgKmgXqprdpPzDkAoWMCE-aet1Pwj08EN_bgPv1jC0HM_JI6pBB-6gJGHvlCM4ZP6jnj-Cj2PlE-dw4KZSn7gyBNhrH6GFB33hDm0IYYy8lwObrxlVx5jprvzTtn25Xm7eq027-u31eOm6qwQlVRKaGsEGFGTaZ0j70mBcNK3BqyUFslo10nUvrbGKuGtVlS3FjsttVFTNv-_7dKQcyLffKewxzQ2ApqTUnNUas5KzVFJ_QE7oF3s</recordid><startdate>20240628</startdate><enddate>20240628</enddate><creator>Chin, Trisha J M</creator><creator>Chin, Gillian X M</creator><creator>Sutherland, James</creator><creator>Coon, Andrew</creator><creator>Morton, Colin</creator><creator>Fleming, Colin</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20240628</creationdate><title>BT24 Pseudonymization for artificial intelligence skin lesion datasets: a real-world feasibility study</title><author>Chin, Trisha J M ; Chin, Gillian X M ; Sutherland, James ; Coon, Andrew ; Morton, Colin ; Fleming, Colin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c811-233148710719e7bddeffe301d2fb708228ae74dc2a4f987831f843e9b8ac42473</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chin, Trisha J M</creatorcontrib><creatorcontrib>Chin, Gillian X M</creatorcontrib><creatorcontrib>Sutherland, James</creatorcontrib><creatorcontrib>Coon, Andrew</creatorcontrib><creatorcontrib>Morton, Colin</creatorcontrib><creatorcontrib>Fleming, Colin</creatorcontrib><collection>CrossRef</collection><jtitle>British journal of dermatology (1951)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chin, Trisha J M</au><au>Chin, Gillian X M</au><au>Sutherland, James</au><au>Coon, Andrew</au><au>Morton, Colin</au><au>Fleming, Colin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>BT24 Pseudonymization for artificial intelligence skin lesion datasets: a real-world feasibility study</atitle><jtitle>British journal of dermatology (1951)</jtitle><date>2024-06-28</date><risdate>2024</risdate><volume>191</volume><issue>Supplement_1</issue><spage>i199</spage><epage>i200</epage><pages>i199-i200</pages><issn>0007-0963</issn><eissn>1365-2133</eissn><abstract>The use of patient data for artificial intelligence (AI) research should be transparent, rigorous and accountable. In the UK, the General Data Protection Regulation, Data Protection Act 2018 and General Medical Council govern data handling and patients’ rights to privacy. We report on our multistep pseudonymization protocol for real-world skin lesion datasets, in preparation for research within a trusted research environment (TRE). Firstly, patients referred from primary care are triaged for community locality and imaging centre (CLIC) suitability. There, trained healthcare professionals capture lesion images (dermoscopic, macroscopic and regional) and patient information using a mobile application on trust-certified devices. Training is standardized across all CLIC sites, with specific anonymization training on removing in-frame clothing and jewellery, device positioning, and magnification to minimize identifiable features like eyes, nose and ears. Lesion datasets (paired images and clinical information) are subsequently transferred to an image management system (IMS) hosted on our trust-secured network. Within the IMS, images are manually inspected, and those with identifiable tattoos and piercings are excluded. All regional images are also excluded from transfer to the TRE. Before transfer to the TRE, images undergo a further round of review. Data fields are manually checked for identifiable patient information, patient names are removed, and dates of birth are rounded to 3-month granularity. The job ID, patient’s hospital number, date of clinical episode and responsible photographer are replaced with randomly generated project-specific identifiers. In an initial study period, 658 of 963 (68%) captured lesion datasets have undergone IMS manual inspection. Of these, 24 lesion datasets were excluded for identifiable features, 10 (41%) for more than one-third of the face being visible, 9 (38%) for full iris visibility, and 5 (21%) for tattoos. On breakdown by anatomical location these images were of the face (19, 80%), torso (2, 8%), limbs (2, 8%) and neck (1, 4%). The remaining 634 datasets (96%) were securely transferred to the TRE, where a further 5% were excluded due to potential identifiability. Although full anonymization is desirable, it is usually achieved by aggregating patient data. Pseudonymization, which allows for future reidentification in a secured fashion, strikes the balance between patient data privacy and clinical governance, while retaining a level of granularity sufficient for meaningful analysis. Currently, this protocol is manually intensive with room to partly automate. Use of common standardized protocols will strengthen the public trust in clinical AI.</abstract><doi>10.1093/bjd/ljae090.421</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0007-0963
ispartof British journal of dermatology (1951), 2024-06, Vol.191 (Supplement_1), p.i199-i200
issn 0007-0963
1365-2133
language eng
recordid cdi_crossref_primary_10_1093_bjd_ljae090_421
source Oxford University Press Journals All Titles (1996-Current)
title BT24 Pseudonymization for artificial intelligence skin lesion datasets: a real-world feasibility study
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T23%3A48%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=BT24%20Pseudonymization%20for%20artificial%20intelligence%20skin%20lesion%20datasets:%20a%20real-world%20feasibility%20study&rft.jtitle=British%20journal%20of%20dermatology%20(1951)&rft.au=Chin,%20Trisha%20J%20M&rft.date=2024-06-28&rft.volume=191&rft.issue=Supplement_1&rft.spage=i199&rft.epage=i200&rft.pages=i199-i200&rft.issn=0007-0963&rft.eissn=1365-2133&rft_id=info:doi/10.1093/bjd/ljae090.421&rft_dat=%3Ccrossref%3E10_1093_bjd_ljae090_421%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true