Highdicom: A Python library for standardized encoding of image annotations and machine learning model outputs in pathology and radiology

Machine learning is revolutionizing image-based diagnostics in pathology and radiology. ML models have shown promising results in research settings, but their lack of interoperability has been a major barrier for clinical integration and evaluation. The DICOM a standard specifies Information Object...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2022-05
Hauptverfasser: Bridge, Christopher P, Gorman, Chris, Pieper, Steven, Doyle, Sean W, Lennerz, Jochen K, Kalpathy-Cramer, Jayashree, Clunie, David A, Fedorov, Andriy Y, Herrmann, Markus D
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Bridge, Christopher P
Gorman, Chris
Pieper, Steven
Doyle, Sean W
Lennerz, Jochen K
Kalpathy-Cramer, Jayashree
Clunie, David A
Fedorov, Andriy Y
Herrmann, Markus D
description Machine learning is revolutionizing image-based diagnostics in pathology and radiology. ML models have shown promising results in research settings, but their lack of interoperability has been a major barrier for clinical integration and evaluation. The DICOM a standard specifies Information Object Definitions and Services for the representation and communication of digital images and related information, including image-derived annotations and analysis results. However, the complexity of the standard represents an obstacle for its adoption in the ML community and creates a need for software libraries and tools that simplify working with data sets in DICOM format. Here we present the highdicom library, which provides a high-level application programming interface for the Python programming language that abstracts low-level details of the standard and enables encoding and decoding of image-derived information in DICOM format in a few lines of Python code. The highdicom library ties into the extensive Python ecosystem for image processing and machine learning. Simultaneously, by simplifying creation and parsing of DICOM-compliant files, highdicom achieves interoperability with the medical imaging systems that hold the data used to train and run ML models, and ultimately communicate and store model outputs for clinical use. We demonstrate through experiments with slide microscopy and computed tomography imaging, that, by bridging these two ecosystems, highdicom enables developers to train and evaluate state-of-the-art ML models in pathology and radiology while remaining compliant with the DICOM standard and interoperable with clinical systems at all stages. To promote standardization of ML research and streamline the ML model development and deployment process, we made the library available free and open-source.
doi_str_mv 10.48550/arxiv.2106.07806
format Article
fullrecord <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2106_07806</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2541570829</sourcerecordid><originalsourceid>FETCH-LOGICAL-a956-a7cc670d4a9ba0aefd9361b6521844e0c70ec044f30ef6827b1c31767756216b3</originalsourceid><addsrcrecordid>eNotkM9Kw0AQhxdBsNQ-gCcXPKfubvZP4q0UtUJBD72HSXaTbkl24yYR4xP42Kapp5mBj5n5fQjdUbLmiRDkEcK3_VozSuSaqITIK7RgcUyjhDN2g1ZddyKEMKmYEPEC_e5sddS28M0T3uCPsT96h2ubBwgjLn3AXQ9OQ9D2x2hsXOG1dRX2JbYNVAaDc76H3nrXTb3GDRRH6wyuDQR3JhuvTY390LdD32HrcAvTjdpX48wH0HaebtF1CXVnVv91iQ4vz4ftLtq_v75tN_sIUiEjUEUhFdEc0hwImFKnsaS5FIwmnBtSKGIKwnkZE1PKhKmcFjFVUikhGZV5vET3l7WzpqwNU4wwZmdd2axrIh4uRBv852C6Pjv5Ibjpp4wJToUiCUvjP2fAb5E</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2541570829</pqid></control><display><type>article</type><title>Highdicom: A Python library for standardized encoding of image annotations and machine learning model outputs in pathology and radiology</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Bridge, Christopher P ; Gorman, Chris ; Pieper, Steven ; Doyle, Sean W ; Lennerz, Jochen K ; Kalpathy-Cramer, Jayashree ; Clunie, David A ; Fedorov, Andriy Y ; Herrmann, Markus D</creator><creatorcontrib>Bridge, Christopher P ; Gorman, Chris ; Pieper, Steven ; Doyle, Sean W ; Lennerz, Jochen K ; Kalpathy-Cramer, Jayashree ; Clunie, David A ; Fedorov, Andriy Y ; Herrmann, Markus D</creatorcontrib><description>Machine learning is revolutionizing image-based diagnostics in pathology and radiology. ML models have shown promising results in research settings, but their lack of interoperability has been a major barrier for clinical integration and evaluation. The DICOM a standard specifies Information Object Definitions and Services for the representation and communication of digital images and related information, including image-derived annotations and analysis results. However, the complexity of the standard represents an obstacle for its adoption in the ML community and creates a need for software libraries and tools that simplify working with data sets in DICOM format. Here we present the highdicom library, which provides a high-level application programming interface for the Python programming language that abstracts low-level details of the standard and enables encoding and decoding of image-derived information in DICOM format in a few lines of Python code. The highdicom library ties into the extensive Python ecosystem for image processing and machine learning. Simultaneously, by simplifying creation and parsing of DICOM-compliant files, highdicom achieves interoperability with the medical imaging systems that hold the data used to train and run ML models, and ultimately communicate and store model outputs for clinical use. We demonstrate through experiments with slide microscopy and computed tomography imaging, that, by bridging these two ecosystems, highdicom enables developers to train and evaluate state-of-the-art ML models in pathology and radiology while remaining compliant with the DICOM standard and interoperable with clinical systems at all stages. To promote standardization of ML research and streamline the ML model development and deployment process, we made the library available free and open-source.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2106.07806</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Application programming interface ; Computed tomography ; Computer Science - Computer Vision and Pattern Recognition ; Computer Science - Learning ; Decoding ; Digital imaging ; Format ; Freeware ; Image annotation ; Image processing ; Interoperability ; Libraries ; Machine learning ; Medical imaging ; Pathology ; Programming languages ; Python ; Radiology ; Software ; Source code ; Standardization ; State-of-the-art reviews</subject><ispartof>arXiv.org, 2022-05</ispartof><rights>2022. This work is published under http://creativecommons.org/licenses/by-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://creativecommons.org/licenses/by-sa/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,780,881,27904</link.rule.ids><backlink>$$Uhttps://doi.org/10.48550/arXiv.2106.07806$$DView paper in arXiv$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.1007/s10278-022-00683-y$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink></links><search><creatorcontrib>Bridge, Christopher P</creatorcontrib><creatorcontrib>Gorman, Chris</creatorcontrib><creatorcontrib>Pieper, Steven</creatorcontrib><creatorcontrib>Doyle, Sean W</creatorcontrib><creatorcontrib>Lennerz, Jochen K</creatorcontrib><creatorcontrib>Kalpathy-Cramer, Jayashree</creatorcontrib><creatorcontrib>Clunie, David A</creatorcontrib><creatorcontrib>Fedorov, Andriy Y</creatorcontrib><creatorcontrib>Herrmann, Markus D</creatorcontrib><title>Highdicom: A Python library for standardized encoding of image annotations and machine learning model outputs in pathology and radiology</title><title>arXiv.org</title><description>Machine learning is revolutionizing image-based diagnostics in pathology and radiology. ML models have shown promising results in research settings, but their lack of interoperability has been a major barrier for clinical integration and evaluation. The DICOM a standard specifies Information Object Definitions and Services for the representation and communication of digital images and related information, including image-derived annotations and analysis results. However, the complexity of the standard represents an obstacle for its adoption in the ML community and creates a need for software libraries and tools that simplify working with data sets in DICOM format. Here we present the highdicom library, which provides a high-level application programming interface for the Python programming language that abstracts low-level details of the standard and enables encoding and decoding of image-derived information in DICOM format in a few lines of Python code. The highdicom library ties into the extensive Python ecosystem for image processing and machine learning. Simultaneously, by simplifying creation and parsing of DICOM-compliant files, highdicom achieves interoperability with the medical imaging systems that hold the data used to train and run ML models, and ultimately communicate and store model outputs for clinical use. We demonstrate through experiments with slide microscopy and computed tomography imaging, that, by bridging these two ecosystems, highdicom enables developers to train and evaluate state-of-the-art ML models in pathology and radiology while remaining compliant with the DICOM standard and interoperable with clinical systems at all stages. To promote standardization of ML research and streamline the ML model development and deployment process, we made the library available free and open-source.</description><subject>Application programming interface</subject><subject>Computed tomography</subject><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Computer Science - Learning</subject><subject>Decoding</subject><subject>Digital imaging</subject><subject>Format</subject><subject>Freeware</subject><subject>Image annotation</subject><subject>Image processing</subject><subject>Interoperability</subject><subject>Libraries</subject><subject>Machine learning</subject><subject>Medical imaging</subject><subject>Pathology</subject><subject>Programming languages</subject><subject>Python</subject><subject>Radiology</subject><subject>Software</subject><subject>Source code</subject><subject>Standardization</subject><subject>State-of-the-art reviews</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><sourceid>GOX</sourceid><recordid>eNotkM9Kw0AQhxdBsNQ-gCcXPKfubvZP4q0UtUJBD72HSXaTbkl24yYR4xP42Kapp5mBj5n5fQjdUbLmiRDkEcK3_VozSuSaqITIK7RgcUyjhDN2g1ZddyKEMKmYEPEC_e5sddS28M0T3uCPsT96h2ubBwgjLn3AXQ9OQ9D2x2hsXOG1dRX2JbYNVAaDc76H3nrXTb3GDRRH6wyuDQR3JhuvTY390LdD32HrcAvTjdpX48wH0HaebtF1CXVnVv91iQ4vz4ftLtq_v75tN_sIUiEjUEUhFdEc0hwImFKnsaS5FIwmnBtSKGIKwnkZE1PKhKmcFjFVUikhGZV5vET3l7WzpqwNU4wwZmdd2axrIh4uRBv852C6Pjv5Ibjpp4wJToUiCUvjP2fAb5E</recordid><startdate>20220508</startdate><enddate>20220508</enddate><creator>Bridge, Christopher P</creator><creator>Gorman, Chris</creator><creator>Pieper, Steven</creator><creator>Doyle, Sean W</creator><creator>Lennerz, Jochen K</creator><creator>Kalpathy-Cramer, Jayashree</creator><creator>Clunie, David A</creator><creator>Fedorov, Andriy Y</creator><creator>Herrmann, Markus D</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20220508</creationdate><title>Highdicom: A Python library for standardized encoding of image annotations and machine learning model outputs in pathology and radiology</title><author>Bridge, Christopher P ; Gorman, Chris ; Pieper, Steven ; Doyle, Sean W ; Lennerz, Jochen K ; Kalpathy-Cramer, Jayashree ; Clunie, David A ; Fedorov, Andriy Y ; Herrmann, Markus D</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a956-a7cc670d4a9ba0aefd9361b6521844e0c70ec044f30ef6827b1c31767756216b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Application programming interface</topic><topic>Computed tomography</topic><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Computer Science - Learning</topic><topic>Decoding</topic><topic>Digital imaging</topic><topic>Format</topic><topic>Freeware</topic><topic>Image annotation</topic><topic>Image processing</topic><topic>Interoperability</topic><topic>Libraries</topic><topic>Machine learning</topic><topic>Medical imaging</topic><topic>Pathology</topic><topic>Programming languages</topic><topic>Python</topic><topic>Radiology</topic><topic>Software</topic><topic>Source code</topic><topic>Standardization</topic><topic>State-of-the-art reviews</topic><toplevel>online_resources</toplevel><creatorcontrib>Bridge, Christopher P</creatorcontrib><creatorcontrib>Gorman, Chris</creatorcontrib><creatorcontrib>Pieper, Steven</creatorcontrib><creatorcontrib>Doyle, Sean W</creatorcontrib><creatorcontrib>Lennerz, Jochen K</creatorcontrib><creatorcontrib>Kalpathy-Cramer, Jayashree</creatorcontrib><creatorcontrib>Clunie, David A</creatorcontrib><creatorcontrib>Fedorov, Andriy Y</creatorcontrib><creatorcontrib>Herrmann, Markus D</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv Computer Science</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Bridge, Christopher P</au><au>Gorman, Chris</au><au>Pieper, Steven</au><au>Doyle, Sean W</au><au>Lennerz, Jochen K</au><au>Kalpathy-Cramer, Jayashree</au><au>Clunie, David A</au><au>Fedorov, Andriy Y</au><au>Herrmann, Markus D</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Highdicom: A Python library for standardized encoding of image annotations and machine learning model outputs in pathology and radiology</atitle><jtitle>arXiv.org</jtitle><date>2022-05-08</date><risdate>2022</risdate><eissn>2331-8422</eissn><abstract>Machine learning is revolutionizing image-based diagnostics in pathology and radiology. ML models have shown promising results in research settings, but their lack of interoperability has been a major barrier for clinical integration and evaluation. The DICOM a standard specifies Information Object Definitions and Services for the representation and communication of digital images and related information, including image-derived annotations and analysis results. However, the complexity of the standard represents an obstacle for its adoption in the ML community and creates a need for software libraries and tools that simplify working with data sets in DICOM format. Here we present the highdicom library, which provides a high-level application programming interface for the Python programming language that abstracts low-level details of the standard and enables encoding and decoding of image-derived information in DICOM format in a few lines of Python code. The highdicom library ties into the extensive Python ecosystem for image processing and machine learning. Simultaneously, by simplifying creation and parsing of DICOM-compliant files, highdicom achieves interoperability with the medical imaging systems that hold the data used to train and run ML models, and ultimately communicate and store model outputs for clinical use. We demonstrate through experiments with slide microscopy and computed tomography imaging, that, by bridging these two ecosystems, highdicom enables developers to train and evaluate state-of-the-art ML models in pathology and radiology while remaining compliant with the DICOM standard and interoperable with clinical systems at all stages. To promote standardization of ML research and streamline the ML model development and deployment process, we made the library available free and open-source.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2106.07806</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2022-05
issn 2331-8422
language eng
recordid cdi_arxiv_primary_2106_07806
source arXiv.org; Free E- Journals
subjects Application programming interface
Computed tomography
Computer Science - Computer Vision and Pattern Recognition
Computer Science - Learning
Decoding
Digital imaging
Format
Freeware
Image annotation
Image processing
Interoperability
Libraries
Machine learning
Medical imaging
Pathology
Programming languages
Python
Radiology
Software
Source code
Standardization
State-of-the-art reviews
title Highdicom: A Python library for standardized encoding of image annotations and machine learning model outputs in pathology and radiology
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T13%3A36%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Highdicom:%20A%20Python%20library%20for%20standardized%20encoding%20of%20image%20annotations%20and%20machine%20learning%20model%20outputs%20in%20pathology%20and%20radiology&rft.jtitle=arXiv.org&rft.au=Bridge,%20Christopher%20P&rft.date=2022-05-08&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2106.07806&rft_dat=%3Cproquest_arxiv%3E2541570829%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2541570829&rft_id=info:pmid/&rfr_iscdi=true