C-A5-04: A Simple, Accurate SAS Algorithm for Electronic Abstraction of Race from Digitized Progress Notes
Background and Aims: Individual-level race/ethnicity is important for research into causes and consequences of health disparities. For various non-research reasons, it has rarely been collected on enrollees in integrated delivery systems. Individual-level race/ethnicity can be found in medical recor...
Gespeichert in:
Veröffentlicht in: | Clinical medicine & research 2010-12, Vol.8 (3-4), p.188-188 |
---|---|
Hauptverfasser: | , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 188 |
---|---|
container_issue | 3-4 |
container_start_page | 188 |
container_title | Clinical medicine & research |
container_volume | 8 |
creator | Roblin, D. Joski, P. Ren, J. Farmer, R. Baldwin, D. Carrell, D. Hart, G. Pardee, R. Bachman, D. |
description | Background and Aims:
Individual-level race/ethnicity is important for research into causes and consequences of health disparities. For various non-research reasons, it has rarely been collected on enrollees in integrated delivery systems. Individual-level race/ethnicity can be found in medical record documentation. Manual abstraction on large numbers of medical records is costly. We developed a simple SAS algorithm for electronic abstraction of white and African American race from digitized progress notes and evaluated its accuracy by comparing electronically abstracted race with other data sources.
Methods:
A simple SAS algorithm, based on text search strings (e.g. white male, African American woman), scanned digitized progress notes for provider face-to-face visits from 2005 through July 2009 in Kaiser Permanente Georgia’s (KPG) and Group Health Cooperative’s (GHC) electronic medical record systems. White and African American race was abstracted. If the patient had more than 1 visit with abstracted race, the patient was classified using the earliest visit. Abstracted race was linked at the individual-level to survey datasets with self-reported race (2005 survey of working age adults, 2007 survey of adults with hypertension, 2000–2005 Medicare surveys) and mother’s race on 2000–2006 birth certificates. White and African American race was abstracted from GHC progress notes from 2005 through July 2009 using the same algorithm and compared to self-reported race on health risk appraisals. Accuracy of the SAS algorithm was assessed by overall proportion matching race from the other datasets, Cohen’s kappa, and McNemar’s test.
Results:
White or African American race was electronically abstracted for 56,261 KPG and 6,427 GHC enrollees. Abstracted race matched race from the other datasets in 97–99% of enrollees. Cohen’s kappas were highly significant (p |
doi_str_mv | 10.3121/cmr.2010.943.c-a5-04 |
format | Article |
fullrecord | <record><control><sourceid>pubmedcentral_cross</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_3006602</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>pubmedcentral_primary_oai_pubmedcentral_nih_gov_3006602</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1794-1ad00d15b35d9c65dd57d4b0bfdfa626522f18f5c2338c1acabf3682886f66053</originalsourceid><addsrcrecordid>eNpVkElLA0EQhQdRcP0HHvrowYm9TjoehCHGBUTF6Lno6WXSMpMO3aNBf70dIoKnqqLqvXp8RXFK8IgRSi50H0cU52nC2UiXSpSY7xQHRAheVmQ82d30bFJyIul-cZjSO8ZMUDY-KN6nZb05v0Q1mvt-1dlzVGv9EdVg0byeo7prQ_TDokcuRDTrrB5iWHqN6iYNUenBhyUKDr0obZGLoUfXvvWD_7YGPcfQRpsSegyDTcfFnlNdsie_9ah4u5m9Tu_Kh6fb-2n9UOoclZdEGYwNEQ0TZqIrYYwYG97gxhmnKloJSh2RTmjKmNREadU4VkkqZeWqCgt2VFxtfVcfTW-Ntsucs4NV9L2KXxCUh_-bpV9AGz6BYZwNaDbgWwMdQ0rRuj8twbABDhk4bIBDBg4alADMs-xsK1v4drH20ULqVdflXxTW67UEBhyIlNCwHyCkhDk</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>C-A5-04: A Simple, Accurate SAS Algorithm for Electronic Abstraction of Race from Digitized Progress Notes</title><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><creator>Roblin, D. ; Joski, P. ; Ren, J. ; Farmer, R. ; Baldwin, D. ; Carrell, D. ; Hart, G. ; Pardee, R. ; Bachman, D.</creator><creatorcontrib>Roblin, D. ; Joski, P. ; Ren, J. ; Farmer, R. ; Baldwin, D. ; Carrell, D. ; Hart, G. ; Pardee, R. ; Bachman, D.</creatorcontrib><description>Background and Aims:
Individual-level race/ethnicity is important for research into causes and consequences of health disparities. For various non-research reasons, it has rarely been collected on enrollees in integrated delivery systems. Individual-level race/ethnicity can be found in medical record documentation. Manual abstraction on large numbers of medical records is costly. We developed a simple SAS algorithm for electronic abstraction of white and African American race from digitized progress notes and evaluated its accuracy by comparing electronically abstracted race with other data sources.
Methods:
A simple SAS algorithm, based on text search strings (e.g. white male, African American woman), scanned digitized progress notes for provider face-to-face visits from 2005 through July 2009 in Kaiser Permanente Georgia’s (KPG) and Group Health Cooperative’s (GHC) electronic medical record systems. White and African American race was abstracted. If the patient had more than 1 visit with abstracted race, the patient was classified using the earliest visit. Abstracted race was linked at the individual-level to survey datasets with self-reported race (2005 survey of working age adults, 2007 survey of adults with hypertension, 2000–2005 Medicare surveys) and mother’s race on 2000–2006 birth certificates. White and African American race was abstracted from GHC progress notes from 2005 through July 2009 using the same algorithm and compared to self-reported race on health risk appraisals. Accuracy of the SAS algorithm was assessed by overall proportion matching race from the other datasets, Cohen’s kappa, and McNemar’s test.
Results:
White or African American race was electronically abstracted for 56,261 KPG and 6,427 GHC enrollees. Abstracted race matched race from the other datasets in 97–99% of enrollees. Cohen’s kappas were highly significant (p<0.05), ranging from 0.939 ± 0.013 (N=657 matches with hypertension survey records) to 0.994 ± 0.006 (N=518 matches with Medicare surveys). McNemar’s tests were marginally significant for several datasets; and, misclassification was not systematically biased toward white or African American race.
Conclusions:
The SAS algorithm was highly accurate in electronically abstracting white and African American race from digitized progress notes of provider visits at KPG and GHC. We are expanding the evaluation to include additional sites and additional race/ ethnic categories (e.g. Asian, Hispanic).</description><identifier>ISSN: 1539-4182</identifier><identifier>EISSN: 1554-6179</identifier><identifier>DOI: 10.3121/cmr.2010.943.c-a5-04</identifier><language>eng</language><publisher>Marshfield Clinic</publisher><subject>SELECTED ABSTRACTS - HMORN 2010: Virtual Data Warehouse</subject><ispartof>Clinical medicine & research, 2010-12, Vol.8 (3-4), p.188-188</ispartof><rights>2010. Clinical Medicine & Research 2010</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3006602/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3006602/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,727,780,784,885,27924,27925,53791,53793</link.rule.ids></links><search><creatorcontrib>Roblin, D.</creatorcontrib><creatorcontrib>Joski, P.</creatorcontrib><creatorcontrib>Ren, J.</creatorcontrib><creatorcontrib>Farmer, R.</creatorcontrib><creatorcontrib>Baldwin, D.</creatorcontrib><creatorcontrib>Carrell, D.</creatorcontrib><creatorcontrib>Hart, G.</creatorcontrib><creatorcontrib>Pardee, R.</creatorcontrib><creatorcontrib>Bachman, D.</creatorcontrib><title>C-A5-04: A Simple, Accurate SAS Algorithm for Electronic Abstraction of Race from Digitized Progress Notes</title><title>Clinical medicine & research</title><description>Background and Aims:
Individual-level race/ethnicity is important for research into causes and consequences of health disparities. For various non-research reasons, it has rarely been collected on enrollees in integrated delivery systems. Individual-level race/ethnicity can be found in medical record documentation. Manual abstraction on large numbers of medical records is costly. We developed a simple SAS algorithm for electronic abstraction of white and African American race from digitized progress notes and evaluated its accuracy by comparing electronically abstracted race with other data sources.
Methods:
A simple SAS algorithm, based on text search strings (e.g. white male, African American woman), scanned digitized progress notes for provider face-to-face visits from 2005 through July 2009 in Kaiser Permanente Georgia’s (KPG) and Group Health Cooperative’s (GHC) electronic medical record systems. White and African American race was abstracted. If the patient had more than 1 visit with abstracted race, the patient was classified using the earliest visit. Abstracted race was linked at the individual-level to survey datasets with self-reported race (2005 survey of working age adults, 2007 survey of adults with hypertension, 2000–2005 Medicare surveys) and mother’s race on 2000–2006 birth certificates. White and African American race was abstracted from GHC progress notes from 2005 through July 2009 using the same algorithm and compared to self-reported race on health risk appraisals. Accuracy of the SAS algorithm was assessed by overall proportion matching race from the other datasets, Cohen’s kappa, and McNemar’s test.
Results:
White or African American race was electronically abstracted for 56,261 KPG and 6,427 GHC enrollees. Abstracted race matched race from the other datasets in 97–99% of enrollees. Cohen’s kappas were highly significant (p<0.05), ranging from 0.939 ± 0.013 (N=657 matches with hypertension survey records) to 0.994 ± 0.006 (N=518 matches with Medicare surveys). McNemar’s tests were marginally significant for several datasets; and, misclassification was not systematically biased toward white or African American race.
Conclusions:
The SAS algorithm was highly accurate in electronically abstracting white and African American race from digitized progress notes of provider visits at KPG and GHC. We are expanding the evaluation to include additional sites and additional race/ ethnic categories (e.g. Asian, Hispanic).</description><subject>SELECTED ABSTRACTS - HMORN 2010: Virtual Data Warehouse</subject><issn>1539-4182</issn><issn>1554-6179</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><recordid>eNpVkElLA0EQhQdRcP0HHvrowYm9TjoehCHGBUTF6Lno6WXSMpMO3aNBf70dIoKnqqLqvXp8RXFK8IgRSi50H0cU52nC2UiXSpSY7xQHRAheVmQ82d30bFJyIul-cZjSO8ZMUDY-KN6nZb05v0Q1mvt-1dlzVGv9EdVg0byeo7prQ_TDokcuRDTrrB5iWHqN6iYNUenBhyUKDr0obZGLoUfXvvWD_7YGPcfQRpsSegyDTcfFnlNdsie_9ah4u5m9Tu_Kh6fb-2n9UOoclZdEGYwNEQ0TZqIrYYwYG97gxhmnKloJSh2RTmjKmNREadU4VkkqZeWqCgt2VFxtfVcfTW-Ntsucs4NV9L2KXxCUh_-bpV9AGz6BYZwNaDbgWwMdQ0rRuj8twbABDhk4bIBDBg4alADMs-xsK1v4drH20ULqVdflXxTW67UEBhyIlNCwHyCkhDk</recordid><startdate>20101201</startdate><enddate>20101201</enddate><creator>Roblin, D.</creator><creator>Joski, P.</creator><creator>Ren, J.</creator><creator>Farmer, R.</creator><creator>Baldwin, D.</creator><creator>Carrell, D.</creator><creator>Hart, G.</creator><creator>Pardee, R.</creator><creator>Bachman, D.</creator><general>Marshfield Clinic</general><scope>AAYXX</scope><scope>CITATION</scope><scope>5PM</scope></search><sort><creationdate>20101201</creationdate><title>C-A5-04: A Simple, Accurate SAS Algorithm for Electronic Abstraction of Race from Digitized Progress Notes</title><author>Roblin, D. ; Joski, P. ; Ren, J. ; Farmer, R. ; Baldwin, D. ; Carrell, D. ; Hart, G. ; Pardee, R. ; Bachman, D.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1794-1ad00d15b35d9c65dd57d4b0bfdfa626522f18f5c2338c1acabf3682886f66053</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>SELECTED ABSTRACTS - HMORN 2010: Virtual Data Warehouse</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Roblin, D.</creatorcontrib><creatorcontrib>Joski, P.</creatorcontrib><creatorcontrib>Ren, J.</creatorcontrib><creatorcontrib>Farmer, R.</creatorcontrib><creatorcontrib>Baldwin, D.</creatorcontrib><creatorcontrib>Carrell, D.</creatorcontrib><creatorcontrib>Hart, G.</creatorcontrib><creatorcontrib>Pardee, R.</creatorcontrib><creatorcontrib>Bachman, D.</creatorcontrib><collection>CrossRef</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Clinical medicine & research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Roblin, D.</au><au>Joski, P.</au><au>Ren, J.</au><au>Farmer, R.</au><au>Baldwin, D.</au><au>Carrell, D.</au><au>Hart, G.</au><au>Pardee, R.</au><au>Bachman, D.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>C-A5-04: A Simple, Accurate SAS Algorithm for Electronic Abstraction of Race from Digitized Progress Notes</atitle><jtitle>Clinical medicine & research</jtitle><date>2010-12-01</date><risdate>2010</risdate><volume>8</volume><issue>3-4</issue><spage>188</spage><epage>188</epage><pages>188-188</pages><issn>1539-4182</issn><eissn>1554-6179</eissn><abstract>Background and Aims:
Individual-level race/ethnicity is important for research into causes and consequences of health disparities. For various non-research reasons, it has rarely been collected on enrollees in integrated delivery systems. Individual-level race/ethnicity can be found in medical record documentation. Manual abstraction on large numbers of medical records is costly. We developed a simple SAS algorithm for electronic abstraction of white and African American race from digitized progress notes and evaluated its accuracy by comparing electronically abstracted race with other data sources.
Methods:
A simple SAS algorithm, based on text search strings (e.g. white male, African American woman), scanned digitized progress notes for provider face-to-face visits from 2005 through July 2009 in Kaiser Permanente Georgia’s (KPG) and Group Health Cooperative’s (GHC) electronic medical record systems. White and African American race was abstracted. If the patient had more than 1 visit with abstracted race, the patient was classified using the earliest visit. Abstracted race was linked at the individual-level to survey datasets with self-reported race (2005 survey of working age adults, 2007 survey of adults with hypertension, 2000–2005 Medicare surveys) and mother’s race on 2000–2006 birth certificates. White and African American race was abstracted from GHC progress notes from 2005 through July 2009 using the same algorithm and compared to self-reported race on health risk appraisals. Accuracy of the SAS algorithm was assessed by overall proportion matching race from the other datasets, Cohen’s kappa, and McNemar’s test.
Results:
White or African American race was electronically abstracted for 56,261 KPG and 6,427 GHC enrollees. Abstracted race matched race from the other datasets in 97–99% of enrollees. Cohen’s kappas were highly significant (p<0.05), ranging from 0.939 ± 0.013 (N=657 matches with hypertension survey records) to 0.994 ± 0.006 (N=518 matches with Medicare surveys). McNemar’s tests were marginally significant for several datasets; and, misclassification was not systematically biased toward white or African American race.
Conclusions:
The SAS algorithm was highly accurate in electronically abstracting white and African American race from digitized progress notes of provider visits at KPG and GHC. We are expanding the evaluation to include additional sites and additional race/ ethnic categories (e.g. Asian, Hispanic).</abstract><pub>Marshfield Clinic</pub><doi>10.3121/cmr.2010.943.c-a5-04</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1539-4182 |
ispartof | Clinical medicine & research, 2010-12, Vol.8 (3-4), p.188-188 |
issn | 1539-4182 1554-6179 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_3006602 |
source | Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central |
subjects | SELECTED ABSTRACTS - HMORN 2010: Virtual Data Warehouse |
title | C-A5-04: A Simple, Accurate SAS Algorithm for Electronic Abstraction of Race from Digitized Progress Notes |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T23%3A12%3A20IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pubmedcentral_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=C-A5-04:%20A%20Simple,%20Accurate%20SAS%20Algorithm%20for%20Electronic%20Abstraction%20of%20Race%20from%20Digitized%20Progress%20Notes&rft.jtitle=Clinical%20medicine%20&%20research&rft.au=Roblin,%20D.&rft.date=2010-12-01&rft.volume=8&rft.issue=3-4&rft.spage=188&rft.epage=188&rft.pages=188-188&rft.issn=1539-4182&rft.eissn=1554-6179&rft_id=info:doi/10.3121/cmr.2010.943.c-a5-04&rft_dat=%3Cpubmedcentral_cross%3Epubmedcentral_primary_oai_pubmedcentral_nih_gov_3006602%3C/pubmedcentral_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |