Identifying minimally acceptable interpretive performance criteria for screening mammography

To develop criteria to identify thresholds for minimally acceptable physician performance in interpreting screening mammography studies and to profile the impact that implementing these criteria may have on the practice of radiology in the United States. In an institutional review board-approved, HI...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Radiology 2010-05, Vol.255 (2), p.354-361
Hauptverfasser:	Carney, Patricia A, Sickles, Edward A, Monsees, Barbara S, Bassett, Lawrence W, Brenner, R James, Feig, Stephen A, Smith, Robert A, Rosenberg, Robert D, Bogart, T Andrew, Browning, Sally, Barry, Jane W, Kelly, Mary M, Tran, Khai A, Miglioretti, Diana L
Format:	Artikel
Sprache:	eng
Schlagworte:	Biopsy Breast Neoplasms - diagnostic imaging Clinical Competence - standards Female Humans Mammography - standards Mass Screening - standards Original Research Predictive Value of Tests Radiology - standards Sensitivity and Specificity United States
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	361
container_issue	2
container_start_page	354
container_title	Radiology
container_volume	255
creator	Carney, Patricia A Sickles, Edward A Monsees, Barbara S Bassett, Lawrence W Brenner, R James Feig, Stephen A Smith, Robert A Rosenberg, Robert D Bogart, T Andrew Browning, Sally Barry, Jane W Kelly, Mary M Tran, Khai A Miglioretti, Diana L
description	To develop criteria to identify thresholds for minimally acceptable physician performance in interpreting screening mammography studies and to profile the impact that implementing these criteria may have on the practice of radiology in the United States. In an institutional review board-approved, HIPAA-compliant study, an Angoff approach was used in two phases to set criteria for identifying minimally acceptable interpretive performance at screening mammography as measured by sensitivity, specificity, recall rate, positive predictive value (PPV) of recall (PPV(1)) and of biopsy recommendation (PPV(2)), and cancer detection rate. Performance measures were considered separately. In phase I, a group of 10 expert radiologists considered a hypothetical pool of 100 interpreting physicians and conveyed their cut points of minimally acceptable performance. The experts were informed that a physician's performance falling outside the cut points would result in a recommendation to consider additional training. During each round of scoring, all expert radiologists' cut points were summarized into a mean, median, mode, and range; these were presented back to the group. In phase II, normative data on performance were shown to illustrate the potential impact cut points would have on radiology practice. Rescoring was done until consensus among experts was achieved. Simulation methods were used to estimate the potential impact of performance that improved to acceptable levels if effective additional training was provided. Final cut points to identify low performance were as follows: sensitivity less than 75%, specificity less than 88% or greater than 95%, recall rate less than 5% or greater than 12%, PPV(1) less than 3% or greater than 8%, PPV(2) less than 20% or greater than 40%, and cancer detection rate less than 2.5 per 1000 interpretations. The selected cut points for performance measures would likely result in 18%-28% of interpreting physicians being considered for additional training on the basis of sensitivity and cancer detection rate, while the cut points for specificity, recall, and PPV(1) and PPV(2) would likely affect 34%-49% of practicing interpreters. If underperforming physicians moved into the acceptable range, detection of an additional 14 cancers per 100000 women screened and a reduction in the number of false-positive examinations by 880 per 100000 women screened would be expected. This study identified minimally acceptable performance levels for interp
doi_str_mv	10.1148/radiol.10091636
format	Article
fullrecord	<record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_2858814</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>733558031</sourcerecordid><originalsourceid>FETCH-LOGICAL-c458t-985e6ce6a6fb542558cdfeffd22ac7a53465c4a442fd88ca8e6dbb1b8247b27a3</originalsourceid><addsrcrecordid>eNpVUctKA0EQHETR-Dh7k7152jjP3clFkOAjEPCiN2Hone2JI_tyZhPI3zsaFT01dFVXV1GEnDM6ZUzqqwC175spo3TGClHskQlTvMyZYGqfTCgVIteSzY7IcYxvlDKpdHlIjjiVTJSKTsjLosZu9G7ru1XW-s630DTbDKzFYYSqwcx3I4Yh4Og3mA0YXB9a6CxmNviEeMjSJos2IHZfItC2_SrA8Lo9JQcOmohn3_OEPN_dPs0f8uXj_WJ-s8xt8jPmM62wsFhA4SoluVLa1g6dqzkHW4ISslBWgpTc1Vpb0FjUVcUqzWVZ8RLECbne6Q7rqsXapkQBGjOElCZsTQ_e_Ec6_2pW_cZwrbRmMglcfguE_n2NcTStjxabBjrs19GUQiRXVLDEvNoxbehjDOh-vzBqPisxu0rMTyXp4uKvuV_-TwfiA_jNjTw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>733558031</pqid></control><display><type>article</type><title>Identifying minimally acceptable interpretive performance criteria for screening mammography</title><source>MEDLINE</source><source>EZB-FREE-00999 freely available EZB journals</source><source>Alma/SFX Local Collection</source><creator>Carney, Patricia A ; Sickles, Edward A ; Monsees, Barbara S ; Bassett, Lawrence W ; Brenner, R James ; Feig, Stephen A ; Smith, Robert A ; Rosenberg, Robert D ; Bogart, T Andrew ; Browning, Sally ; Barry, Jane W ; Kelly, Mary M ; Tran, Khai A ; Miglioretti, Diana L</creator><creatorcontrib>Carney, Patricia A ; Sickles, Edward A ; Monsees, Barbara S ; Bassett, Lawrence W ; Brenner, R James ; Feig, Stephen A ; Smith, Robert A ; Rosenberg, Robert D ; Bogart, T Andrew ; Browning, Sally ; Barry, Jane W ; Kelly, Mary M ; Tran, Khai A ; Miglioretti, Diana L</creatorcontrib><description>To develop criteria to identify thresholds for minimally acceptable physician performance in interpreting screening mammography studies and to profile the impact that implementing these criteria may have on the practice of radiology in the United States. In an institutional review board-approved, HIPAA-compliant study, an Angoff approach was used in two phases to set criteria for identifying minimally acceptable interpretive performance at screening mammography as measured by sensitivity, specificity, recall rate, positive predictive value (PPV) of recall (PPV(1)) and of biopsy recommendation (PPV(2)), and cancer detection rate. Performance measures were considered separately. In phase I, a group of 10 expert radiologists considered a hypothetical pool of 100 interpreting physicians and conveyed their cut points of minimally acceptable performance. The experts were informed that a physician's performance falling outside the cut points would result in a recommendation to consider additional training. During each round of scoring, all expert radiologists' cut points were summarized into a mean, median, mode, and range; these were presented back to the group. In phase II, normative data on performance were shown to illustrate the potential impact cut points would have on radiology practice. Rescoring was done until consensus among experts was achieved. Simulation methods were used to estimate the potential impact of performance that improved to acceptable levels if effective additional training was provided. Final cut points to identify low performance were as follows: sensitivity less than 75%, specificity less than 88% or greater than 95%, recall rate less than 5% or greater than 12%, PPV(1) less than 3% or greater than 8%, PPV(2) less than 20% or greater than 40%, and cancer detection rate less than 2.5 per 1000 interpretations. The selected cut points for performance measures would likely result in 18%-28% of interpreting physicians being considered for additional training on the basis of sensitivity and cancer detection rate, while the cut points for specificity, recall, and PPV(1) and PPV(2) would likely affect 34%-49% of practicing interpreters. If underperforming physicians moved into the acceptable range, detection of an additional 14 cancers per 100000 women screened and a reduction in the number of false-positive examinations by 880 per 100000 women screened would be expected. This study identified minimally acceptable performance levels for interpreters of screening mammography studies. Interpreting physicians whose performance falls outside the identified cut points should be reviewed in the context of their specific practice settings and be considered for additional training.</description><identifier>ISSN: 0033-8419</identifier><identifier>EISSN: 1527-1315</identifier><identifier>DOI: 10.1148/radiol.10091636</identifier><identifier>PMID: 20413750</identifier><language>eng</language><publisher>United States: Radiological Society of North America, Inc</publisher><subject>Biopsy ; Breast Neoplasms - diagnostic imaging ; Clinical Competence - standards ; Female ; Humans ; Mammography - standards ; Mass Screening - standards ; Original Research ; Predictive Value of Tests ; Radiology - standards ; Sensitivity and Specificity ; United States</subject><ispartof>Radiology, 2010-05, Vol.255 (2), p.354-361</ispartof><rights>RSNA, 2010</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c458t-985e6ce6a6fb542558cdfeffd22ac7a53465c4a442fd88ca8e6dbb1b8247b27a3</citedby><cites>FETCH-LOGICAL-c458t-985e6ce6a6fb542558cdfeffd22ac7a53465c4a442fd88ca8e6dbb1b8247b27a3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,777,781,882,27905,27906</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/20413750$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Carney, Patricia A</creatorcontrib><creatorcontrib>Sickles, Edward A</creatorcontrib><creatorcontrib>Monsees, Barbara S</creatorcontrib><creatorcontrib>Bassett, Lawrence W</creatorcontrib><creatorcontrib>Brenner, R James</creatorcontrib><creatorcontrib>Feig, Stephen A</creatorcontrib><creatorcontrib>Smith, Robert A</creatorcontrib><creatorcontrib>Rosenberg, Robert D</creatorcontrib><creatorcontrib>Bogart, T Andrew</creatorcontrib><creatorcontrib>Browning, Sally</creatorcontrib><creatorcontrib>Barry, Jane W</creatorcontrib><creatorcontrib>Kelly, Mary M</creatorcontrib><creatorcontrib>Tran, Khai A</creatorcontrib><creatorcontrib>Miglioretti, Diana L</creatorcontrib><title>Identifying minimally acceptable interpretive performance criteria for screening mammography</title><title>Radiology</title><addtitle>Radiology</addtitle><description>To develop criteria to identify thresholds for minimally acceptable physician performance in interpreting screening mammography studies and to profile the impact that implementing these criteria may have on the practice of radiology in the United States. In an institutional review board-approved, HIPAA-compliant study, an Angoff approach was used in two phases to set criteria for identifying minimally acceptable interpretive performance at screening mammography as measured by sensitivity, specificity, recall rate, positive predictive value (PPV) of recall (PPV(1)) and of biopsy recommendation (PPV(2)), and cancer detection rate. Performance measures were considered separately. In phase I, a group of 10 expert radiologists considered a hypothetical pool of 100 interpreting physicians and conveyed their cut points of minimally acceptable performance. The experts were informed that a physician's performance falling outside the cut points would result in a recommendation to consider additional training. During each round of scoring, all expert radiologists' cut points were summarized into a mean, median, mode, and range; these were presented back to the group. In phase II, normative data on performance were shown to illustrate the potential impact cut points would have on radiology practice. Rescoring was done until consensus among experts was achieved. Simulation methods were used to estimate the potential impact of performance that improved to acceptable levels if effective additional training was provided. Final cut points to identify low performance were as follows: sensitivity less than 75%, specificity less than 88% or greater than 95%, recall rate less than 5% or greater than 12%, PPV(1) less than 3% or greater than 8%, PPV(2) less than 20% or greater than 40%, and cancer detection rate less than 2.5 per 1000 interpretations. The selected cut points for performance measures would likely result in 18%-28% of interpreting physicians being considered for additional training on the basis of sensitivity and cancer detection rate, while the cut points for specificity, recall, and PPV(1) and PPV(2) would likely affect 34%-49% of practicing interpreters. If underperforming physicians moved into the acceptable range, detection of an additional 14 cancers per 100000 women screened and a reduction in the number of false-positive examinations by 880 per 100000 women screened would be expected. This study identified minimally acceptable performance levels for interpreters of screening mammography studies. Interpreting physicians whose performance falls outside the identified cut points should be reviewed in the context of their specific practice settings and be considered for additional training.</description><subject>Biopsy</subject><subject>Breast Neoplasms - diagnostic imaging</subject><subject>Clinical Competence - standards</subject><subject>Female</subject><subject>Humans</subject><subject>Mammography - standards</subject><subject>Mass Screening - standards</subject><subject>Original Research</subject><subject>Predictive Value of Tests</subject><subject>Radiology - standards</subject><subject>Sensitivity and Specificity</subject><subject>United States</subject><issn>0033-8419</issn><issn>1527-1315</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpVUctKA0EQHETR-Dh7k7152jjP3clFkOAjEPCiN2Hone2JI_tyZhPI3zsaFT01dFVXV1GEnDM6ZUzqqwC175spo3TGClHskQlTvMyZYGqfTCgVIteSzY7IcYxvlDKpdHlIjjiVTJSKTsjLosZu9G7ru1XW-s630DTbDKzFYYSqwcx3I4Yh4Og3mA0YXB9a6CxmNviEeMjSJos2IHZfItC2_SrA8Lo9JQcOmohn3_OEPN_dPs0f8uXj_WJ-s8xt8jPmM62wsFhA4SoluVLa1g6dqzkHW4ISslBWgpTc1Vpb0FjUVcUqzWVZ8RLECbne6Q7rqsXapkQBGjOElCZsTQ_e_Ec6_2pW_cZwrbRmMglcfguE_n2NcTStjxabBjrs19GUQiRXVLDEvNoxbehjDOh-vzBqPisxu0rMTyXp4uKvuV_-TwfiA_jNjTw</recordid><startdate>20100501</startdate><enddate>20100501</enddate><creator>Carney, Patricia A</creator><creator>Sickles, Edward A</creator><creator>Monsees, Barbara S</creator><creator>Bassett, Lawrence W</creator><creator>Brenner, R James</creator><creator>Feig, Stephen A</creator><creator>Smith, Robert A</creator><creator>Rosenberg, Robert D</creator><creator>Bogart, T Andrew</creator><creator>Browning, Sally</creator><creator>Barry, Jane W</creator><creator>Kelly, Mary M</creator><creator>Tran, Khai A</creator><creator>Miglioretti, Diana L</creator><general>Radiological Society of North America, Inc</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20100501</creationdate><title>Identifying minimally acceptable interpretive performance criteria for screening mammography</title><author>Carney, Patricia A ; Sickles, Edward A ; Monsees, Barbara S ; Bassett, Lawrence W ; Brenner, R James ; Feig, Stephen A ; Smith, Robert A ; Rosenberg, Robert D ; Bogart, T Andrew ; Browning, Sally ; Barry, Jane W ; Kelly, Mary M ; Tran, Khai A ; Miglioretti, Diana L</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c458t-985e6ce6a6fb542558cdfeffd22ac7a53465c4a442fd88ca8e6dbb1b8247b27a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Biopsy</topic><topic>Breast Neoplasms - diagnostic imaging</topic><topic>Clinical Competence - standards</topic><topic>Female</topic><topic>Humans</topic><topic>Mammography - standards</topic><topic>Mass Screening - standards</topic><topic>Original Research</topic><topic>Predictive Value of Tests</topic><topic>Radiology - standards</topic><topic>Sensitivity and Specificity</topic><topic>United States</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Carney, Patricia A</creatorcontrib><creatorcontrib>Sickles, Edward A</creatorcontrib><creatorcontrib>Monsees, Barbara S</creatorcontrib><creatorcontrib>Bassett, Lawrence W</creatorcontrib><creatorcontrib>Brenner, R James</creatorcontrib><creatorcontrib>Feig, Stephen A</creatorcontrib><creatorcontrib>Smith, Robert A</creatorcontrib><creatorcontrib>Rosenberg, Robert D</creatorcontrib><creatorcontrib>Bogart, T Andrew</creatorcontrib><creatorcontrib>Browning, Sally</creatorcontrib><creatorcontrib>Barry, Jane W</creatorcontrib><creatorcontrib>Kelly, Mary M</creatorcontrib><creatorcontrib>Tran, Khai A</creatorcontrib><creatorcontrib>Miglioretti, Diana L</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Radiology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Carney, Patricia A</au><au>Sickles, Edward A</au><au>Monsees, Barbara S</au><au>Bassett, Lawrence W</au><au>Brenner, R James</au><au>Feig, Stephen A</au><au>Smith, Robert A</au><au>Rosenberg, Robert D</au><au>Bogart, T Andrew</au><au>Browning, Sally</au><au>Barry, Jane W</au><au>Kelly, Mary M</au><au>Tran, Khai A</au><au>Miglioretti, Diana L</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Identifying minimally acceptable interpretive performance criteria for screening mammography</atitle><jtitle>Radiology</jtitle><addtitle>Radiology</addtitle><date>2010-05-01</date><risdate>2010</risdate><volume>255</volume><issue>2</issue><spage>354</spage><epage>361</epage><pages>354-361</pages><issn>0033-8419</issn><eissn>1527-1315</eissn><abstract>To develop criteria to identify thresholds for minimally acceptable physician performance in interpreting screening mammography studies and to profile the impact that implementing these criteria may have on the practice of radiology in the United States. In an institutional review board-approved, HIPAA-compliant study, an Angoff approach was used in two phases to set criteria for identifying minimally acceptable interpretive performance at screening mammography as measured by sensitivity, specificity, recall rate, positive predictive value (PPV) of recall (PPV(1)) and of biopsy recommendation (PPV(2)), and cancer detection rate. Performance measures were considered separately. In phase I, a group of 10 expert radiologists considered a hypothetical pool of 100 interpreting physicians and conveyed their cut points of minimally acceptable performance. The experts were informed that a physician's performance falling outside the cut points would result in a recommendation to consider additional training. During each round of scoring, all expert radiologists' cut points were summarized into a mean, median, mode, and range; these were presented back to the group. In phase II, normative data on performance were shown to illustrate the potential impact cut points would have on radiology practice. Rescoring was done until consensus among experts was achieved. Simulation methods were used to estimate the potential impact of performance that improved to acceptable levels if effective additional training was provided. Final cut points to identify low performance were as follows: sensitivity less than 75%, specificity less than 88% or greater than 95%, recall rate less than 5% or greater than 12%, PPV(1) less than 3% or greater than 8%, PPV(2) less than 20% or greater than 40%, and cancer detection rate less than 2.5 per 1000 interpretations. The selected cut points for performance measures would likely result in 18%-28% of interpreting physicians being considered for additional training on the basis of sensitivity and cancer detection rate, while the cut points for specificity, recall, and PPV(1) and PPV(2) would likely affect 34%-49% of practicing interpreters. If underperforming physicians moved into the acceptable range, detection of an additional 14 cancers per 100000 women screened and a reduction in the number of false-positive examinations by 880 per 100000 women screened would be expected. This study identified minimally acceptable performance levels for interpreters of screening mammography studies. Interpreting physicians whose performance falls outside the identified cut points should be reviewed in the context of their specific practice settings and be considered for additional training.</abstract><cop>United States</cop><pub>Radiological Society of North America, Inc</pub><pmid>20413750</pmid><doi>10.1148/radiol.10091636</doi><tpages>8</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 0033-8419
ispartof	Radiology, 2010-05, Vol.255 (2), p.354-361
issn	0033-8419 1527-1315
language	eng
recordid	cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_2858814
source	MEDLINE; EZB-FREE-00999 freely available EZB journals; Alma/SFX Local Collection
subjects	Biopsy Breast Neoplasms - diagnostic imaging Clinical Competence - standards Female Humans Mammography - standards Mass Screening - standards Original Research Predictive Value of Tests Radiology - standards Sensitivity and Specificity United States
title	Identifying minimally acceptable interpretive performance criteria for screening mammography
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T08%3A54%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Identifying%20minimally%20acceptable%20interpretive%20performance%20criteria%20for%20screening%20mammography&rft.jtitle=Radiology&rft.au=Carney,%20Patricia%20A&rft.date=2010-05-01&rft.volume=255&rft.issue=2&rft.spage=354&rft.epage=361&rft.pages=354-361&rft.issn=0033-8419&rft.eissn=1527-1315&rft_id=info:doi/10.1148/radiol.10091636&rft_dat=%3Cproquest_pubme%3E733558031%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=733558031&rft_id=info:pmid/20413750&rfr_iscdi=true