C-A5-04: A Simple, Accurate SAS Algorithm for Electronic Abstraction of Race from Digitized Progress Notes
Background and Aims: Individual-level race/ethnicity is important for research into causes and consequences of health disparities. For various non-research reasons, it has rarely been collected on enrollees in integrated delivery systems. Individual-level race/ethnicity can be found in medical recor...
Gespeichert in:
Veröffentlicht in: | Clinical medicine & research 2010-12, Vol.8 (3-4), p.188-188 |
---|---|
Hauptverfasser: | , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Background and Aims:
Individual-level race/ethnicity is important for research into causes and consequences of health disparities. For various non-research reasons, it has rarely been collected on enrollees in integrated delivery systems. Individual-level race/ethnicity can be found in medical record documentation. Manual abstraction on large numbers of medical records is costly. We developed a simple SAS algorithm for electronic abstraction of white and African American race from digitized progress notes and evaluated its accuracy by comparing electronically abstracted race with other data sources.
Methods:
A simple SAS algorithm, based on text search strings (e.g. white male, African American woman), scanned digitized progress notes for provider face-to-face visits from 2005 through July 2009 in Kaiser Permanente Georgia’s (KPG) and Group Health Cooperative’s (GHC) electronic medical record systems. White and African American race was abstracted. If the patient had more than 1 visit with abstracted race, the patient was classified using the earliest visit. Abstracted race was linked at the individual-level to survey datasets with self-reported race (2005 survey of working age adults, 2007 survey of adults with hypertension, 2000–2005 Medicare surveys) and mother’s race on 2000–2006 birth certificates. White and African American race was abstracted from GHC progress notes from 2005 through July 2009 using the same algorithm and compared to self-reported race on health risk appraisals. Accuracy of the SAS algorithm was assessed by overall proportion matching race from the other datasets, Cohen’s kappa, and McNemar’s test.
Results:
White or African American race was electronically abstracted for 56,261 KPG and 6,427 GHC enrollees. Abstracted race matched race from the other datasets in 97–99% of enrollees. Cohen’s kappas were highly significant (p |
---|---|
ISSN: | 1539-4182 1554-6179 |
DOI: | 10.3121/cmr.2010.943.c-a5-04 |