CBN: Constructing a clinical Bayesian network based on data from the electronic medical record

[Display omitted] •Deriving a high-quality topology and ontology from EMR data is viable.•The determination of the relationship type optimizes the input of topology learning.•Combined operation of OR and K2 algorithm aides Bayesian topology construction.•Clinical Bayesian Network assists ontology fo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of biomedical informatics 2018-12, Vol.88, p.1-10
Hauptverfasser: Shen, Ying, Zhang, Lizhu, Zhang, Jin, Yang, Min, Tang, Buzhou, Li, Yaliang, Lei, Kai
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:[Display omitted] •Deriving a high-quality topology and ontology from EMR data is viable.•The determination of the relationship type optimizes the input of topology learning.•Combined operation of OR and K2 algorithm aides Bayesian topology construction.•Clinical Bayesian Network assists ontology for medical probabilistic inference. The process of learning candidate causal relationships involving diseases and symptoms from electronic medical records (EMRs) is the first step towards learning models that perform diagnostic inference directly from real healthcare data. However, the existing diagnostic inference systems rely on knowledge bases such as ontology that are manually compiled through a labour-intensive process or automatically derived using simple pairwise statistics. We explore CBN, a Clinical Bayesian Network construction for medical ontology probabilistic inference, to learn high-quality Bayesian topology and complete ontology directly from EMRs. Specifically, we first extract medical entity relationships from over 10,000 deidentified patient records and adopt the odds ratio (OR value) calculation and the K2 greedy algorithm to automatically construct a Bayesian topology. Then, Bayesian estimation is used for the probability distribution. Finally, we employ a Bayesian network to complete the causal relationship and probability distribution of ontology to enhance the ontology inference capability. By evaluating the learned topology versus the expert opinions of physicians and entropy calculations and by calculating the ontology-based diagnosis classification, our study demonstrates that the direct and automated construction of a high-quality health topology and ontology from medical records is feasible. Our results are reproducible, and we will release the source code and CN-Stroke knowledge graph of this work after publication.1Source code: https://github.com/anonymousBioinformatics/MedSim. Ontology: https://github.com/anonymousAuthor111/CN-Stroke-Knowledge-Graph.git.1
ISSN:1532-0464
1532-0480
DOI:10.1016/j.jbi.2018.10.007