Disease network delineates the disease progression profile of cardiovascular diseases

[Display omitted] •Designed a flexible and robust algorithm to measure the progression rate.•Built a cardiovascular disease network based on 14.3 million patients data.•Screened a series of salient and sufficient features for risk model design.•Discover the imbalanced age group distribution in sever...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of biomedical informatics 2021-03, Vol.115, p.103686-103686, Article 103686
Hauptverfasser: Tang, Zefang, Yu, Yiqin, Ng, Kenney, Sow, Daby, Hu, Jianying, Mei, Jing
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:[Display omitted] •Designed a flexible and robust algorithm to measure the progression rate.•Built a cardiovascular disease network based on 14.3 million patients data.•Screened a series of salient and sufficient features for risk model design.•Discover the imbalanced age group distribution in several diseases. As Electronic Health Records (EHR) data accumulated explosively in recent years, the tremendous amount of patient clinical data provided opportunities to discover real world evidence. In this study, a graphical disease network, named progressive cardiovascular disease network (progCDN), was built to delineate the progression profiles of cardiovascular diseases (CVD). The EHR data of 14.3 million patients with CVD diagnoses were collected for building disease network and further analysis. We applied a new designed method, progression rates (PR), to calculate the progression relationship among different diagnoses. Based on the disease network outcome, 23 disease progression pair were selected to screen for salient features. The network depicted the dominant diseases in CVD development, such as the heart failure and coronary arteriosclerosis. Novel progression relationships were also discovered, such as the progression path from long QT syndrome to major depression. In addition, three age-group progCDNs identified a series of age-associated disease progression paths and important successor diseases with age bias. Furthermore, a list of important features with sufficient abundance and high correlation was extracted for building disease risk models. The PR method designed for identifying the progression relationship could be widely applied in any EHR database due to its flexibility and robust functionality. Meanwhile, researchers could use the progCDN network to validate or explore novel disease relationships in real world data. The first-time interrogation of such a huge CVD patients cohort enabled us to explore the general and age-specific disease progression patterns in CVD development.
ISSN:1532-0464
1532-0480
DOI:10.1016/j.jbi.2021.103686