Machine learning based natural language processing of radiology reports in orthopaedic trauma

•BERT Natural Language Processing (NLP) outperforms traditional ML and rule-based classifiers when applied to radiology reports in orthopaedic trauma.•Traditional ML classification performance is mostly determined by the extracted features rather than classifier-type, and more complex features are n...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Computer methods and programs in biomedicine 2021-09, Vol.208, p.106304-106304, Article 106304
Hauptverfasser:	Olthof, A.W., Shouche, P., Fennema, E.M., IJpma, F.F.A., Koolstra, R.H.C., Stirler, V.M.A., van Ooijen, P.M.A., Cornelissen, L.J.
Format:	Artikel
Sprache:	eng
Schlagworte:	Informatics Machine learning MeSH Natural language processing Orthopaedic trauma Radiology
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	106304
container_issue
container_start_page	106304
container_title	Computer methods and programs in biomedicine
container_volume	208
creator	Olthof, A.W. Shouche, P. Fennema, E.M. IJpma, F.F.A. Koolstra, R.H.C. Stirler, V.M.A. van Ooijen, P.M.A. Cornelissen, L.J.
description	•BERT Natural Language Processing (NLP) outperforms traditional ML and rule-based classifiers when applied to radiology reports in orthopaedic trauma.•Traditional ML classification performance is mostly determined by the extracted features rather than classifier-type, and more complex features are not always better than simple methods.•Positivity rate assessment and automated label generation from radiology reports are feasible with NLP. To compare different Machine Learning (ML) Natural Language Processing (NLP) methods to classify radiology reports in orthopaedic trauma for the presence of injuries. Assessing NLP performance is a prerequisite for downstream tasks and therefore of importance from a clinical perspective (avoiding missed injuries, quality check, insight in diagnostic yield) as well as from a research perspective (identification of patient cohorts, annotation of radiographs). Datasets of Dutch radiology reports of injured extremities (n = 2469, 33% fractures) and chest radiographs (n = 799, 20% pneumothorax) were collected in two different hospitals and labeled by radiologists and trauma surgeons for the presence or absence of injuries. NLP classification was applied and optimized by testing different preprocessing steps and different classifiers (Rule-based, ML, and Bidirectional Encoder Representations from Transformers (BERT)). Performance was assessed by F1-score, AUC, sensitivity, specificity and accuracy. The deep learning based BERT model outperforms all other classification methods which were assessed. The model achieved an F1-score of (95 ± 2)% and accuracy of (96 ± 1)% on a dataset of simple reports (n= 2469), and an F1 of (83 ± 7)% with accuracy (93 ± 2)% on a dataset of complex reports (n= 799). BERT NLP outperforms traditional ML and rule-base classifiers when applied to Dutch radiology reports in orthopaedic trauma.
doi_str_mv	10.1016/j.cmpb.2021.106304
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2557539570</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0169260721003783</els_id><sourcerecordid>2557539570</sourcerecordid><originalsourceid>FETCH-LOGICAL-c377t-fe8efed826630b7bc378a2f53597b5288862a804c12432c418f67a5c1a38f4f63</originalsourceid><addsrcrecordid>eNp9kEtLxDAUhYMoOD7-gKss3XRM0uYx4EbEFyhudCnhNr0ZM3SamrSC_94M49rVhcM5l3M-Qi44W3LG1dVm6bZjuxRM8CKomjUHZMGNFpWWSh6SRTGtKqGYPiYnOW8YY0JKtSAfL-A-w4C0R0hDGNa0hYwdHWCaE_S0h2E9wxrpmKLDnHeO6GmCLsQ-rn9owjGmKdMw0HI_4wjYBUenBPMWzsiRhz7j-d89Je_3d2-3j9Xz68PT7c1z5Wqtp8qjQY-dEao0b3VbVAPCy1qudCuFMUYJMKxxXDS1cA03XmmQjkNtfONVfUou939Ly68Z82S3ITvsS3uMc7Zlq5b1SmpWrGJvdSnmnNDbMYUtpB_Lmd2xtBu7Y2l3LO2eZQld70NYRnwHTDa7gIMrUxO6yXYx_Bf_BSsEfds</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2557539570</pqid></control><display><type>article</type><title>Machine learning based natural language processing of radiology reports in orthopaedic trauma</title><source>Access via ScienceDirect (Elsevier)</source><creator>Olthof, A.W. ; Shouche, P. ; Fennema, E.M. ; IJpma, F.F.A. ; Koolstra, R.H.C. ; Stirler, V.M.A. ; van Ooijen, P.M.A. ; Cornelissen, L.J.</creator><creatorcontrib>Olthof, A.W. ; Shouche, P. ; Fennema, E.M. ; IJpma, F.F.A. ; Koolstra, R.H.C. ; Stirler, V.M.A. ; van Ooijen, P.M.A. ; Cornelissen, L.J.</creatorcontrib><description>•BERT Natural Language Processing (NLP) outperforms traditional ML and rule-based classifiers when applied to radiology reports in orthopaedic trauma.•Traditional ML classification performance is mostly determined by the extracted features rather than classifier-type, and more complex features are not always better than simple methods.•Positivity rate assessment and automated label generation from radiology reports are feasible with NLP. To compare different Machine Learning (ML) Natural Language Processing (NLP) methods to classify radiology reports in orthopaedic trauma for the presence of injuries. Assessing NLP performance is a prerequisite for downstream tasks and therefore of importance from a clinical perspective (avoiding missed injuries, quality check, insight in diagnostic yield) as well as from a research perspective (identification of patient cohorts, annotation of radiographs). Datasets of Dutch radiology reports of injured extremities (n = 2469, 33% fractures) and chest radiographs (n = 799, 20% pneumothorax) were collected in two different hospitals and labeled by radiologists and trauma surgeons for the presence or absence of injuries. NLP classification was applied and optimized by testing different preprocessing steps and different classifiers (Rule-based, ML, and Bidirectional Encoder Representations from Transformers (BERT)). Performance was assessed by F1-score, AUC, sensitivity, specificity and accuracy. The deep learning based BERT model outperforms all other classification methods which were assessed. The model achieved an F1-score of (95 ± 2)% and accuracy of (96 ± 1)% on a dataset of simple reports (n= 2469), and an F1 of (83 ± 7)% with accuracy (93 ± 2)% on a dataset of complex reports (n= 799). BERT NLP outperforms traditional ML and rule-base classifiers when applied to Dutch radiology reports in orthopaedic trauma.</description><identifier>ISSN: 0169-2607</identifier><identifier>EISSN: 1872-7565</identifier><identifier>DOI: 10.1016/j.cmpb.2021.106304</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Informatics ; Machine learning ; MeSH ; Natural language processing ; Orthopaedic trauma ; Radiology</subject><ispartof>Computer methods and programs in biomedicine, 2021-09, Vol.208, p.106304-106304, Article 106304</ispartof><rights>2021</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c377t-fe8efed826630b7bc378a2f53597b5288862a804c12432c418f67a5c1a38f4f63</citedby><cites>FETCH-LOGICAL-c377t-fe8efed826630b7bc378a2f53597b5288862a804c12432c418f67a5c1a38f4f63</cites><orcidid>0000-0002-8995-1210 ; 0000-0003-0745-0074</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.cmpb.2021.106304$$EHTML$$P50$$Gelsevier$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids></links><search><creatorcontrib>Olthof, A.W.</creatorcontrib><creatorcontrib>Shouche, P.</creatorcontrib><creatorcontrib>Fennema, E.M.</creatorcontrib><creatorcontrib>IJpma, F.F.A.</creatorcontrib><creatorcontrib>Koolstra, R.H.C.</creatorcontrib><creatorcontrib>Stirler, V.M.A.</creatorcontrib><creatorcontrib>van Ooijen, P.M.A.</creatorcontrib><creatorcontrib>Cornelissen, L.J.</creatorcontrib><title>Machine learning based natural language processing of radiology reports in orthopaedic trauma</title><title>Computer methods and programs in biomedicine</title><description>•BERT Natural Language Processing (NLP) outperforms traditional ML and rule-based classifiers when applied to radiology reports in orthopaedic trauma.•Traditional ML classification performance is mostly determined by the extracted features rather than classifier-type, and more complex features are not always better than simple methods.•Positivity rate assessment and automated label generation from radiology reports are feasible with NLP. To compare different Machine Learning (ML) Natural Language Processing (NLP) methods to classify radiology reports in orthopaedic trauma for the presence of injuries. Assessing NLP performance is a prerequisite for downstream tasks and therefore of importance from a clinical perspective (avoiding missed injuries, quality check, insight in diagnostic yield) as well as from a research perspective (identification of patient cohorts, annotation of radiographs). Datasets of Dutch radiology reports of injured extremities (n = 2469, 33% fractures) and chest radiographs (n = 799, 20% pneumothorax) were collected in two different hospitals and labeled by radiologists and trauma surgeons for the presence or absence of injuries. NLP classification was applied and optimized by testing different preprocessing steps and different classifiers (Rule-based, ML, and Bidirectional Encoder Representations from Transformers (BERT)). Performance was assessed by F1-score, AUC, sensitivity, specificity and accuracy. The deep learning based BERT model outperforms all other classification methods which were assessed. The model achieved an F1-score of (95 ± 2)% and accuracy of (96 ± 1)% on a dataset of simple reports (n= 2469), and an F1 of (83 ± 7)% with accuracy (93 ± 2)% on a dataset of complex reports (n= 799). BERT NLP outperforms traditional ML and rule-base classifiers when applied to Dutch radiology reports in orthopaedic trauma.</description><subject>Informatics</subject><subject>Machine learning</subject><subject>MeSH</subject><subject>Natural language processing</subject><subject>Orthopaedic trauma</subject><subject>Radiology</subject><issn>0169-2607</issn><issn>1872-7565</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9kEtLxDAUhYMoOD7-gKss3XRM0uYx4EbEFyhudCnhNr0ZM3SamrSC_94M49rVhcM5l3M-Qi44W3LG1dVm6bZjuxRM8CKomjUHZMGNFpWWSh6SRTGtKqGYPiYnOW8YY0JKtSAfL-A-w4C0R0hDGNa0hYwdHWCaE_S0h2E9wxrpmKLDnHeO6GmCLsQ-rn9owjGmKdMw0HI_4wjYBUenBPMWzsiRhz7j-d89Je_3d2-3j9Xz68PT7c1z5Wqtp8qjQY-dEao0b3VbVAPCy1qudCuFMUYJMKxxXDS1cA03XmmQjkNtfONVfUou939Ly68Z82S3ITvsS3uMc7Zlq5b1SmpWrGJvdSnmnNDbMYUtpB_Lmd2xtBu7Y2l3LO2eZQld70NYRnwHTDa7gIMrUxO6yXYx_Bf_BSsEfds</recordid><startdate>202109</startdate><enddate>202109</enddate><creator>Olthof, A.W.</creator><creator>Shouche, P.</creator><creator>Fennema, E.M.</creator><creator>IJpma, F.F.A.</creator><creator>Koolstra, R.H.C.</creator><creator>Stirler, V.M.A.</creator><creator>van Ooijen, P.M.A.</creator><creator>Cornelissen, L.J.</creator><general>Elsevier B.V</general><scope>6I.</scope><scope>AAFTH</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-8995-1210</orcidid><orcidid>https://orcid.org/0000-0003-0745-0074</orcidid></search><sort><creationdate>202109</creationdate><title>Machine learning based natural language processing of radiology reports in orthopaedic trauma</title><author>Olthof, A.W. ; Shouche, P. ; Fennema, E.M. ; IJpma, F.F.A. ; Koolstra, R.H.C. ; Stirler, V.M.A. ; van Ooijen, P.M.A. ; Cornelissen, L.J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c377t-fe8efed826630b7bc378a2f53597b5288862a804c12432c418f67a5c1a38f4f63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Informatics</topic><topic>Machine learning</topic><topic>MeSH</topic><topic>Natural language processing</topic><topic>Orthopaedic trauma</topic><topic>Radiology</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Olthof, A.W.</creatorcontrib><creatorcontrib>Shouche, P.</creatorcontrib><creatorcontrib>Fennema, E.M.</creatorcontrib><creatorcontrib>IJpma, F.F.A.</creatorcontrib><creatorcontrib>Koolstra, R.H.C.</creatorcontrib><creatorcontrib>Stirler, V.M.A.</creatorcontrib><creatorcontrib>van Ooijen, P.M.A.</creatorcontrib><creatorcontrib>Cornelissen, L.J.</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Computer methods and programs in biomedicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Olthof, A.W.</au><au>Shouche, P.</au><au>Fennema, E.M.</au><au>IJpma, F.F.A.</au><au>Koolstra, R.H.C.</au><au>Stirler, V.M.A.</au><au>van Ooijen, P.M.A.</au><au>Cornelissen, L.J.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Machine learning based natural language processing of radiology reports in orthopaedic trauma</atitle><jtitle>Computer methods and programs in biomedicine</jtitle><date>2021-09</date><risdate>2021</risdate><volume>208</volume><spage>106304</spage><epage>106304</epage><pages>106304-106304</pages><artnum>106304</artnum><issn>0169-2607</issn><eissn>1872-7565</eissn><abstract>•BERT Natural Language Processing (NLP) outperforms traditional ML and rule-based classifiers when applied to radiology reports in orthopaedic trauma.•Traditional ML classification performance is mostly determined by the extracted features rather than classifier-type, and more complex features are not always better than simple methods.•Positivity rate assessment and automated label generation from radiology reports are feasible with NLP. To compare different Machine Learning (ML) Natural Language Processing (NLP) methods to classify radiology reports in orthopaedic trauma for the presence of injuries. Assessing NLP performance is a prerequisite for downstream tasks and therefore of importance from a clinical perspective (avoiding missed injuries, quality check, insight in diagnostic yield) as well as from a research perspective (identification of patient cohorts, annotation of radiographs). Datasets of Dutch radiology reports of injured extremities (n = 2469, 33% fractures) and chest radiographs (n = 799, 20% pneumothorax) were collected in two different hospitals and labeled by radiologists and trauma surgeons for the presence or absence of injuries. NLP classification was applied and optimized by testing different preprocessing steps and different classifiers (Rule-based, ML, and Bidirectional Encoder Representations from Transformers (BERT)). Performance was assessed by F1-score, AUC, sensitivity, specificity and accuracy. The deep learning based BERT model outperforms all other classification methods which were assessed. The model achieved an F1-score of (95 ± 2)% and accuracy of (96 ± 1)% on a dataset of simple reports (n= 2469), and an F1 of (83 ± 7)% with accuracy (93 ± 2)% on a dataset of complex reports (n= 799). BERT NLP outperforms traditional ML and rule-base classifiers when applied to Dutch radiology reports in orthopaedic trauma.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.cmpb.2021.106304</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-8995-1210</orcidid><orcidid>https://orcid.org/0000-0003-0745-0074</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 0169-2607
ispartof	Computer methods and programs in biomedicine, 2021-09, Vol.208, p.106304-106304, Article 106304
issn	0169-2607 1872-7565
language	eng
recordid	cdi_proquest_miscellaneous_2557539570
source	Access via ScienceDirect (Elsevier)
subjects	Informatics Machine learning MeSH Natural language processing Orthopaedic trauma Radiology
title	Machine learning based natural language processing of radiology reports in orthopaedic trauma
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T17%3A53%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Machine%20learning%20based%20natural%20language%20processing%20of%20radiology%20reports%20in%20orthopaedic%20trauma&rft.jtitle=Computer%20methods%20and%20programs%20in%20biomedicine&rft.au=Olthof,%20A.W.&rft.date=2021-09&rft.volume=208&rft.spage=106304&rft.epage=106304&rft.pages=106304-106304&rft.artnum=106304&rft.issn=0169-2607&rft.eissn=1872-7565&rft_id=info:doi/10.1016/j.cmpb.2021.106304&rft_dat=%3Cproquest_cross%3E2557539570%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2557539570&rft_id=info:pmid/&rft_els_id=S0169260721003783&rfr_iscdi=true