Temporal Relation Extraction in Clinical Texts: A Systematic Review

Unstructured data in electronic health records, represented by clinical texts, are a vast source of healthcare information because they describe a patient's journey, including clinical findings, procedures, and information about the continuity of care. The publication of several studies on temp...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:ACM computing surveys 2022-09, Vol.54 (7), p.1-36, Article 144
Hauptverfasser: Gumiel, Yohan Bonescki, Silva e Oliveira, Lucas Emanuel, Claveau, Vincent, Grabar, Natalia, Paraiso, Emerson Cabrera, Moro, Claudia, Carvalho, Deborah Ribeiro
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 36
container_issue 7
container_start_page 1
container_title ACM computing surveys
container_volume 54
creator Gumiel, Yohan Bonescki
Silva e Oliveira, Lucas Emanuel
Claveau, Vincent
Grabar, Natalia
Paraiso, Emerson Cabrera
Moro, Claudia
Carvalho, Deborah Ribeiro
description Unstructured data in electronic health records, represented by clinical texts, are a vast source of healthcare information because they describe a patient's journey, including clinical findings, procedures, and information about the continuity of care. The publication of several studies on temporal relation extraction from clinical texts during the last decade and the realization of multiple shared tasks highlight the importance of this research theme. Therefore, we propose a review of temporal relation extraction in clinical texts. We analyzed 105 articles and verified that relations between events and document creation time, a coarse temporality type, were addressed with traditional machine learning–based models with few recent initiatives to push the state-of-the-art with deep learning–based models. For temporal relations between entities (event and temporal expressions) in the document, factors such as dataset imbalance because of candidate pair generation and task complexity directly affect the system's performance. The state-of-the-art resides on attention-based models, with contextualized word representations being fine-tuned for temporal relation extraction. However, further experiments and advances in the research topic are required until real-time clinical domain applications are released. Furthermore, most of the publications mainly reside on the same dataset, hindering the need for new annotation projects that provide datasets for different medical specialties, clinical text types, and even languages.
doi_str_mv 10.1145/3462475
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2697714359</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2697714359</sourcerecordid><originalsourceid>FETCH-LOGICAL-a164t-c4b5fcaaa1fe045cf9b3dddcc71ac43981f3539251be1f979704a6059c296ab3</originalsourceid><addsrcrecordid>eNo90E1Lw0AQBuBFFKxVvHsKePCUOpP9co8SWhUKguQeJptdSMlH3U2h_nujqZ5m4H2YgZexW4QVopCPXKhMaHnGFiilTjUXeM4WwBWkwAEu2VWMOwDIBKoFWxWu2w-B2uTDtTQ2Q5-sj2Mg-7s2fZK3Td_YKS_ccYzX7MJTG93NaS5ZsVkX-Wu6fX95y5-3KaESY2pFJb0lIvQOhLTeVLyua2s1khXcPKHnkptMYuXQG200CFIgjc2Mooov2f18dh-Gz4OLY7kbDqGfPpaZMlqj4NJM6mFWNgwxBufLfWg6Cl8lQvnTRXnqYpJ3syTb_aO_8Btnnld9</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2697714359</pqid></control><display><type>article</type><title>Temporal Relation Extraction in Clinical Texts: A Systematic Review</title><source>ACM Digital Library Complete</source><creator>Gumiel, Yohan Bonescki ; Silva e Oliveira, Lucas Emanuel ; Claveau, Vincent ; Grabar, Natalia ; Paraiso, Emerson Cabrera ; Moro, Claudia ; Carvalho, Deborah Ribeiro</creator><creatorcontrib>Gumiel, Yohan Bonescki ; Silva e Oliveira, Lucas Emanuel ; Claveau, Vincent ; Grabar, Natalia ; Paraiso, Emerson Cabrera ; Moro, Claudia ; Carvalho, Deborah Ribeiro</creatorcontrib><description>Unstructured data in electronic health records, represented by clinical texts, are a vast source of healthcare information because they describe a patient's journey, including clinical findings, procedures, and information about the continuity of care. The publication of several studies on temporal relation extraction from clinical texts during the last decade and the realization of multiple shared tasks highlight the importance of this research theme. Therefore, we propose a review of temporal relation extraction in clinical texts. We analyzed 105 articles and verified that relations between events and document creation time, a coarse temporality type, were addressed with traditional machine learning–based models with few recent initiatives to push the state-of-the-art with deep learning–based models. For temporal relations between entities (event and temporal expressions) in the document, factors such as dataset imbalance because of candidate pair generation and task complexity directly affect the system's performance. The state-of-the-art resides on attention-based models, with contextualized word representations being fine-tuned for temporal relation extraction. However, further experiments and advances in the research topic are required until real-time clinical domain applications are released. Furthermore, most of the publications mainly reside on the same dataset, hindering the need for new annotation projects that provide datasets for different medical specialties, clinical text types, and even languages.</description><identifier>ISSN: 0360-0300</identifier><identifier>EISSN: 1557-7341</identifier><identifier>DOI: 10.1145/3462475</identifier><language>eng</language><publisher>New York, NY, USA: ACM</publisher><subject>Annotations ; Applied computing ; Artificial intelligence ; Computer science ; Computing methodologies ; Datasets ; Deep learning ; Documents ; Electronic health records ; Health informatics ; Information extraction ; Life and medical sciences ; Machine learning ; Natural language processing ; Task complexity ; Texts ; Unstructured data</subject><ispartof>ACM computing surveys, 2022-09, Vol.54 (7), p.1-36, Article 144</ispartof><rights>ACM</rights><rights>Copyright Association for Computing Machinery Sep 2022</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-a164t-c4b5fcaaa1fe045cf9b3dddcc71ac43981f3539251be1f979704a6059c296ab3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://dl.acm.org/doi/pdf/10.1145/3462475$$EPDF$$P50$$Gacm$$Hfree_for_read</linktopdf><link.rule.ids>314,776,780,2276,27901,27902,40172,75970</link.rule.ids></links><search><creatorcontrib>Gumiel, Yohan Bonescki</creatorcontrib><creatorcontrib>Silva e Oliveira, Lucas Emanuel</creatorcontrib><creatorcontrib>Claveau, Vincent</creatorcontrib><creatorcontrib>Grabar, Natalia</creatorcontrib><creatorcontrib>Paraiso, Emerson Cabrera</creatorcontrib><creatorcontrib>Moro, Claudia</creatorcontrib><creatorcontrib>Carvalho, Deborah Ribeiro</creatorcontrib><title>Temporal Relation Extraction in Clinical Texts: A Systematic Review</title><title>ACM computing surveys</title><addtitle>ACM CSUR</addtitle><description>Unstructured data in electronic health records, represented by clinical texts, are a vast source of healthcare information because they describe a patient's journey, including clinical findings, procedures, and information about the continuity of care. The publication of several studies on temporal relation extraction from clinical texts during the last decade and the realization of multiple shared tasks highlight the importance of this research theme. Therefore, we propose a review of temporal relation extraction in clinical texts. We analyzed 105 articles and verified that relations between events and document creation time, a coarse temporality type, were addressed with traditional machine learning–based models with few recent initiatives to push the state-of-the-art with deep learning–based models. For temporal relations between entities (event and temporal expressions) in the document, factors such as dataset imbalance because of candidate pair generation and task complexity directly affect the system's performance. The state-of-the-art resides on attention-based models, with contextualized word representations being fine-tuned for temporal relation extraction. However, further experiments and advances in the research topic are required until real-time clinical domain applications are released. Furthermore, most of the publications mainly reside on the same dataset, hindering the need for new annotation projects that provide datasets for different medical specialties, clinical text types, and even languages.</description><subject>Annotations</subject><subject>Applied computing</subject><subject>Artificial intelligence</subject><subject>Computer science</subject><subject>Computing methodologies</subject><subject>Datasets</subject><subject>Deep learning</subject><subject>Documents</subject><subject>Electronic health records</subject><subject>Health informatics</subject><subject>Information extraction</subject><subject>Life and medical sciences</subject><subject>Machine learning</subject><subject>Natural language processing</subject><subject>Task complexity</subject><subject>Texts</subject><subject>Unstructured data</subject><issn>0360-0300</issn><issn>1557-7341</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNo90E1Lw0AQBuBFFKxVvHsKePCUOpP9co8SWhUKguQeJptdSMlH3U2h_nujqZ5m4H2YgZexW4QVopCPXKhMaHnGFiilTjUXeM4WwBWkwAEu2VWMOwDIBKoFWxWu2w-B2uTDtTQ2Q5-sj2Mg-7s2fZK3Td_YKS_ccYzX7MJTG93NaS5ZsVkX-Wu6fX95y5-3KaESY2pFJb0lIvQOhLTeVLyua2s1khXcPKHnkptMYuXQG200CFIgjc2Mooov2f18dh-Gz4OLY7kbDqGfPpaZMlqj4NJM6mFWNgwxBufLfWg6Cl8lQvnTRXnqYpJ3syTb_aO_8Btnnld9</recordid><startdate>20220930</startdate><enddate>20220930</enddate><creator>Gumiel, Yohan Bonescki</creator><creator>Silva e Oliveira, Lucas Emanuel</creator><creator>Claveau, Vincent</creator><creator>Grabar, Natalia</creator><creator>Paraiso, Emerson Cabrera</creator><creator>Moro, Claudia</creator><creator>Carvalho, Deborah Ribeiro</creator><general>ACM</general><general>Association for Computing Machinery</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20220930</creationdate><title>Temporal Relation Extraction in Clinical Texts</title><author>Gumiel, Yohan Bonescki ; Silva e Oliveira, Lucas Emanuel ; Claveau, Vincent ; Grabar, Natalia ; Paraiso, Emerson Cabrera ; Moro, Claudia ; Carvalho, Deborah Ribeiro</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a164t-c4b5fcaaa1fe045cf9b3dddcc71ac43981f3539251be1f979704a6059c296ab3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Annotations</topic><topic>Applied computing</topic><topic>Artificial intelligence</topic><topic>Computer science</topic><topic>Computing methodologies</topic><topic>Datasets</topic><topic>Deep learning</topic><topic>Documents</topic><topic>Electronic health records</topic><topic>Health informatics</topic><topic>Information extraction</topic><topic>Life and medical sciences</topic><topic>Machine learning</topic><topic>Natural language processing</topic><topic>Task complexity</topic><topic>Texts</topic><topic>Unstructured data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Gumiel, Yohan Bonescki</creatorcontrib><creatorcontrib>Silva e Oliveira, Lucas Emanuel</creatorcontrib><creatorcontrib>Claveau, Vincent</creatorcontrib><creatorcontrib>Grabar, Natalia</creatorcontrib><creatorcontrib>Paraiso, Emerson Cabrera</creatorcontrib><creatorcontrib>Moro, Claudia</creatorcontrib><creatorcontrib>Carvalho, Deborah Ribeiro</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>ACM computing surveys</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gumiel, Yohan Bonescki</au><au>Silva e Oliveira, Lucas Emanuel</au><au>Claveau, Vincent</au><au>Grabar, Natalia</au><au>Paraiso, Emerson Cabrera</au><au>Moro, Claudia</au><au>Carvalho, Deborah Ribeiro</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Temporal Relation Extraction in Clinical Texts: A Systematic Review</atitle><jtitle>ACM computing surveys</jtitle><stitle>ACM CSUR</stitle><date>2022-09-30</date><risdate>2022</risdate><volume>54</volume><issue>7</issue><spage>1</spage><epage>36</epage><pages>1-36</pages><artnum>144</artnum><issn>0360-0300</issn><eissn>1557-7341</eissn><abstract>Unstructured data in electronic health records, represented by clinical texts, are a vast source of healthcare information because they describe a patient's journey, including clinical findings, procedures, and information about the continuity of care. The publication of several studies on temporal relation extraction from clinical texts during the last decade and the realization of multiple shared tasks highlight the importance of this research theme. Therefore, we propose a review of temporal relation extraction in clinical texts. We analyzed 105 articles and verified that relations between events and document creation time, a coarse temporality type, were addressed with traditional machine learning–based models with few recent initiatives to push the state-of-the-art with deep learning–based models. For temporal relations between entities (event and temporal expressions) in the document, factors such as dataset imbalance because of candidate pair generation and task complexity directly affect the system's performance. The state-of-the-art resides on attention-based models, with contextualized word representations being fine-tuned for temporal relation extraction. However, further experiments and advances in the research topic are required until real-time clinical domain applications are released. Furthermore, most of the publications mainly reside on the same dataset, hindering the need for new annotation projects that provide datasets for different medical specialties, clinical text types, and even languages.</abstract><cop>New York, NY, USA</cop><pub>ACM</pub><doi>10.1145/3462475</doi><tpages>36</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0360-0300
ispartof ACM computing surveys, 2022-09, Vol.54 (7), p.1-36, Article 144
issn 0360-0300
1557-7341
language eng
recordid cdi_proquest_journals_2697714359
source ACM Digital Library Complete
subjects Annotations
Applied computing
Artificial intelligence
Computer science
Computing methodologies
Datasets
Deep learning
Documents
Electronic health records
Health informatics
Information extraction
Life and medical sciences
Machine learning
Natural language processing
Task complexity
Texts
Unstructured data
title Temporal Relation Extraction in Clinical Texts: A Systematic Review
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T14%3A47%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Temporal%20Relation%20Extraction%20in%20Clinical%20Texts:%20A%20Systematic%20Review&rft.jtitle=ACM%20computing%20surveys&rft.au=Gumiel,%20Yohan%20Bonescki&rft.date=2022-09-30&rft.volume=54&rft.issue=7&rft.spage=1&rft.epage=36&rft.pages=1-36&rft.artnum=144&rft.issn=0360-0300&rft.eissn=1557-7341&rft_id=info:doi/10.1145/3462475&rft_dat=%3Cproquest_cross%3E2697714359%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2697714359&rft_id=info:pmid/&rfr_iscdi=true