Developing Phenotypes from Electronic Health Records for Chronic Disease Surveillance

ObjectiveTo utilize clinical data in Electronic Health Records (EHRs) to develop chronic disease phenotypes appropriate for conducting population health surveillance.IntroductionChronic diseases, including hypertension, type 2 diabetes mellitus (diabetes), obesity, and hyperlipidemia, are some of th...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Online journal of public health informatics 2019-05, Vol.11 (1)
Hauptverfasser: Conderino, Sarah, Feldman, Justin, Carton, Tom, Thorpe, Lorna
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:ObjectiveTo utilize clinical data in Electronic Health Records (EHRs) to develop chronic disease phenotypes appropriate for conducting population health surveillance.IntroductionChronic diseases, including hypertension, type 2 diabetes mellitus (diabetes), obesity, and hyperlipidemia, are some of the leading causes of morbidity and mortality in the United States. Monitoring disease prevalence guides public health programs and policies that help prevent this burden. EHRs can supplement traditional sources of chronic disease surveillance, such as health surveys and administrative claims datasets, by offering near real-time data, large sample sizes, and a rich source of clinical data. However, few studies have provided clear, consistent EHR phenotypes that were developed to inform population health surveillance.MethodsRetrospective EHR data were obtained for patients seen at New York University Langone Health in 2017 (n=1,397,446). To better estimate chronic disease burden among New York City (NYC) adults, the patient population was limited to NYC residents aged 20 or older, who were seen in the ambulatory primary care setting (n=153,653). Rule-based algorithms for identifying patients with hypertension, statin-eligibility, diabetes, and obesity were developed based on a combination of diagnostic codes, lab results or vitals, and relevant prescriptions. We compared the performance of our metric definitions to selected phenotypes from the literature using percent agreement and Cohen’s kappa. Patients with discordant disease classifications between the two sets of definitions were analyzed through natural language processing (NLP) on the patients’ 2017 medical notes using a support vector machine model. Statin-eligibility is a novel phenotype and therefore did not have a comparable definition in the literature. Sensitivity analyses were conducted to determine how disease burden changed under alternative rules for each metric.ResultsOf 153,653 adult ambulatory care patients in 2017, an estimated 53.7% had hypertension, 12.4% had diabetes, 27.8% were obese, and 30.0% were statin-eligible under our proposed definitions. The estimated prevalence of hypertension increased from 28.1% to 53.7% when diagnostic codes were supplemented with blood pressure measurements and anti-hypertensive medications, while the estimated prevalence of diabetes increased less than one percentage point with inclusion of diabetes-related medications and elevated A1C measurements. There was
ISSN:1947-2579
1947-2579
DOI:10.5210/ojphi.v11i1.9744