Creating Reusable Annotated Corpora with the Clinical Document Architecture

Manual annotation of clinical documents creates reference standards to train and evaluate natural language processing (NLP) systems, but can also be used in other aspects of patient care and research. Manual annotation is costly and time-consuming and is usually done for specific use cases. We prese...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: DuVall, S L, Boone, K W, Gundlapalli, A, South, B R, Shuying Shen, Nebeker, J R, D'Avolio, L W, Samore, M H
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Manual annotation of clinical documents creates reference standards to train and evaluate natural language processing (NLP) systems, but can also be used in other aspects of patient care and research. Manual annotation is costly and time-consuming and is usually done for specific use cases. We present a practical approach to the storage of annotations along with text using the Clinical Document Architecture (CDA). We describe the types of annotations commonly made on clinical text and show how they map to CDA elements. This provides standard, reusable documents that allow multiple users to modify and add annotations for new or expanded clinical use cases. As clinical texts become more widely available and NLP systems are applied to extract information from the text, annotated corpora will become an increasingly essential resource. We demonstrate how the CDA provides an appropriate mechanism for storing annotated corpora and describe how it supports interoperability and reusability.
ISSN:1530-1605
2572-6862
DOI:10.1109/HICSS.2011.133