Deep learning for electronic health records: A comparative review of multiple deep neural architectures
[Display omitted] •Deep Learning (DL) is becoming the main way to study electronic health records (EHR).•The first comparative review of the key DL architectures used for EHR is carried out.•One of the largest EHR databases, containing data from 4 M people, is introduced.•A set of best practices to...
Gespeichert in:
Veröffentlicht in: | Journal of biomedical informatics 2020-01, Vol.101, p.103337-103337, Article 103337 |
---|---|
Hauptverfasser: | , , , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | [Display omitted]
•Deep Learning (DL) is becoming the main way to study electronic health records (EHR).•The first comparative review of the key DL architectures used for EHR is carried out.•One of the largest EHR databases, containing data from 4 M people, is introduced.•A set of best practices to work with EHR using DL has been shared.•Recurrent DL architectures showed superior flexibility and predictive power.
Despite the recent developments in deep learning models, their applications in clinical decision-support systems have been very limited. Recent digitalisation of health records, however, has provided a great platform for the assessment of the usability of such techniques in healthcare. As a result, the field is starting to see a growing number of research papers that employ deep learning on electronic health records (EHR) for personalised prediction of risks and health trajectories. While this can be a promising trend, vast paper-to-paper variability (from data sources and models they use to the clinical questions they attempt to answer) have hampered the field’s ability to simply compare and contrast such models for a given application of interest. Thus, in this paper, we aim to provide a comparative review of the key deep learning architectures that have been applied to EHR data. Furthermore, we also aim to: (1) introduce and use one of the world’s largest and most complex linked primary care EHR datasets (i.e., Clinical Practice Research Datalink, or CPRD) as a new asset for training such data-hungry models; (2) provide a guideline for working with EHR data for deep learning; (3) share some of the best practices for assessing the “goodness” of deep-learning models in clinical risk prediction; (4) and propose future research ideas for making deep learning models more suitable for the EHR data. Our results highlight the difficulties of working with highly imbalanced datasets, and show that sequential deep learning architectures such as RNN may be more suitable to deal with the temporal nature of EHR. |
---|---|
ISSN: | 1532-0464 1532-0480 |
DOI: | 10.1016/j.jbi.2019.103337 |