Unbiased evaluation of predicted gamma passing rate by an event‐mixing technique

Background Predicting models of the gamma passing rate (GPR) have been studied to substitute the measurement‐based gamma analysis. Since these studies used data from different radiotherapy systems comprising TPS, linear accelerator, and detector array, it has been difficult to compare the performanc...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Medical physics (Lancaster) 2024-01, Vol.51 (1), p.5-17
Hauptverfasser: Koganezawa, Akito S, Matsuura, Takaaki, Kawahara, Daisuke, Nakashima, Takeo, Shiba, Eiji, Murakami, Yuji, Nagata, Yasushi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 17
container_issue 1
container_start_page 5
container_title Medical physics (Lancaster)
container_volume 51
creator Koganezawa, Akito S
Matsuura, Takaaki
Kawahara, Daisuke
Nakashima, Takeo
Shiba, Eiji
Murakami, Yuji
Nagata, Yasushi
description Background Predicting models of the gamma passing rate (GPR) have been studied to substitute the measurement‐based gamma analysis. Since these studies used data from different radiotherapy systems comprising TPS, linear accelerator, and detector array, it has been difficult to compare the performances of the predicting models among institutions with different radiotherapy systems. Purpose We aimed to develop unbiased scoring methods to evaluate the performance of the models predicting the GPR, by introducing both best and worst limits for the performance of the GPR prediction. Methods Two hundred head‐and‐neck VMAT plans were used to develop a framework. The GPRs were measured using the ArcCHECK device. The predicted GPR [p] was generated using a deep learning‐based model [pDL]. The predicting model was evaluated using four metrics: standard deviation (SD) [σ], Pearson's correlation coefficient (CC) [r], mean squared error (MSE) [s], and mean absolute error (MAE) [a]. The best limit [σm${\sigma _m}$, rm${r_m}$, sm${s_m}$, and am${a_m}$] was estimated by measuring the SD of measured GPR [m] by shifting the device along the longitudinal direction to measure different sampling points. Mimicked best and worst p’s [pbest and pworst] were generated from pDL. The worst limit was defined such that m and p have no correlation [CC ∼ 0]. The worst limit [σMix, rMix, sMix, and aMix] was generated using the event‐mixing (EM) technique originally introduced in high‐energy physics experiments. The range of σ, r, s, and a was defined to be [σm,σMix]$[ {{\sigma _m},{\sigma _{{\mathrm{Mix}}} ]$, [0,rm]$[ {0,{r_m}} ]$, [sm,sMix]$[ {{s_m},{s_{{\mathrm{Mix}}} ]$, and [am,aMix]$[ {{a_m},{a_{{\mathrm{Mix}}} ]$. The achievement score (AS) independently based on σ, r, s, and a were calculated for pDL, pbest and pworst. The probability that p fails the gamma analysis (alert frequency; AF) was estimated as a function of σd${\sigma _d}$ values within the [σm${\sigma _m}$, σMix] range for the 3%/2 mm data with a 95% criterion. Results SDs of the best limit were well reproduced by σm=0.531100−m${\sigma _m} = \;0.531\sqrt {100 - m} $. The EM technique successfully generated the (m,p)$( {m,p} )$ pairs with no correlation. The AS using four metrics showed good agreement. This agreement indicates successful definitions of both best and worst limits, consistent definitions of the AS, and successful generations of mixed events. The AF for the DL‐based model with the 3%/2 mm tolerance was 31.
doi_str_mv 10.1002/mp.16848
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2894722956</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2894722956</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2828-6551fcba27dd79e75d93777c61676d1ab148bd4e3bc1f160236697f8b0dc963a3</originalsourceid><addsrcrecordid>eNp1kMtKw0AUhgdRbK2CTyBZukmdW2YmSxFvUFHEroe5pY5kkphJ1O58BJ_RJzG1VVeuDpz_O9-BH4BDBKcIQnwSmiligootMMaUk5RimG-DMYQ5TTGF2QjsxfgEIWQkg7tgRMQQZRyOwf280l5FZxP3ospedb6ukrpImtZZb7phv1AhqKRRMfpqkbSqc4leJqoaDlzVfb5_BP-2SjpnHiv_3Lt9sFOoMrqDzZyA-cX5w9lVOru9vD47naUGCyxSlmWoMFphbi3PHc9sTjjnhiHGmUVKIyq0pY5ogwrEICaM5bwQGlqTM6LIBByvvU1bD29jJ4OPxpWlqlzdR4lFTjnGecb-UNPWMbaukE3rg2qXEkG5alCGRn43OKBHG2uvg7O_4E9lA5CugVdfuuW_InlztxZ-AZ_yelc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2894722956</pqid></control><display><type>article</type><title>Unbiased evaluation of predicted gamma passing rate by an event‐mixing technique</title><source>MEDLINE</source><source>Wiley Online Library Journals Frontfile Complete</source><source>Alma/SFX Local Collection</source><creator>Koganezawa, Akito S ; Matsuura, Takaaki ; Kawahara, Daisuke ; Nakashima, Takeo ; Shiba, Eiji ; Murakami, Yuji ; Nagata, Yasushi</creator><creatorcontrib>Koganezawa, Akito S ; Matsuura, Takaaki ; Kawahara, Daisuke ; Nakashima, Takeo ; Shiba, Eiji ; Murakami, Yuji ; Nagata, Yasushi</creatorcontrib><description>Background Predicting models of the gamma passing rate (GPR) have been studied to substitute the measurement‐based gamma analysis. Since these studies used data from different radiotherapy systems comprising TPS, linear accelerator, and detector array, it has been difficult to compare the performances of the predicting models among institutions with different radiotherapy systems. Purpose We aimed to develop unbiased scoring methods to evaluate the performance of the models predicting the GPR, by introducing both best and worst limits for the performance of the GPR prediction. Methods Two hundred head‐and‐neck VMAT plans were used to develop a framework. The GPRs were measured using the ArcCHECK device. The predicted GPR [p] was generated using a deep learning‐based model [pDL]. The predicting model was evaluated using four metrics: standard deviation (SD) [σ], Pearson's correlation coefficient (CC) [r], mean squared error (MSE) [s], and mean absolute error (MAE) [a]. The best limit [σm${\sigma _m}$, rm${r_m}$, sm${s_m}$, and am${a_m}$] was estimated by measuring the SD of measured GPR [m] by shifting the device along the longitudinal direction to measure different sampling points. Mimicked best and worst p’s [pbest and pworst] were generated from pDL. The worst limit was defined such that m and p have no correlation [CC ∼ 0]. The worst limit [σMix, rMix, sMix, and aMix] was generated using the event‐mixing (EM) technique originally introduced in high‐energy physics experiments. The range of σ, r, s, and a was defined to be [σm,σMix]$[ {{\sigma _m},{\sigma _{{\mathrm{Mix}}} ]$, [0,rm]$[ {0,{r_m}} ]$, [sm,sMix]$[ {{s_m},{s_{{\mathrm{Mix}}} ]$, and [am,aMix]$[ {{a_m},{a_{{\mathrm{Mix}}} ]$. The achievement score (AS) independently based on σ, r, s, and a were calculated for pDL, pbest and pworst. The probability that p fails the gamma analysis (alert frequency; AF) was estimated as a function of σd${\sigma _d}$ values within the [σm${\sigma _m}$, σMix] range for the 3%/2 mm data with a 95% criterion. Results SDs of the best limit were well reproduced by σm=0.531100−m${\sigma _m} = \;0.531\sqrt {100 - m} $. The EM technique successfully generated the (m,p)$( {m,p} )$ pairs with no correlation. The AS using four metrics showed good agreement. This agreement indicates successful definitions of both best and worst limits, consistent definitions of the AS, and successful generations of mixed events. The AF for the DL‐based model with the 3%/2 mm tolerance was 31.5% and 63.0% with CL's 99% and 99.9%, respectively. Conclusion We developed the AS to evaluate the predicting model of the GPR in an unbiased manner by excluding the effects of the precision of the radiotherapy system and the spreading of the GPR. The best and worst limits of the GPR prediction were successfully generated using the measured precision of the GPR and the EM technique, respectively. The AS and σp${\sigma _p}$ are expected to enable objective evaluation of the predicting model and setting exact achievement goal of precision for the predicted GPR.</description><identifier>ISSN: 0094-2405</identifier><identifier>EISSN: 2473-4209</identifier><identifier>DOI: 10.1002/mp.16848</identifier><identifier>PMID: 38009570</identifier><language>eng</language><publisher>United States</publisher><subject>Benchmarking ; event‐mixing ; gamma passing rate ; Gamma Rays ; quality assurance ; Radiotherapy Dosage ; Radiotherapy Planning, Computer-Assisted - methods ; Radiotherapy, Intensity-Modulated - methods ; volumetric‐modulated arc therapy</subject><ispartof>Medical physics (Lancaster), 2024-01, Vol.51 (1), p.5-17</ispartof><rights>2023 American Association of Physicists in Medicine.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c2828-6551fcba27dd79e75d93777c61676d1ab148bd4e3bc1f160236697f8b0dc963a3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1002%2Fmp.16848$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1002%2Fmp.16848$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,776,780,1411,27901,27902,45550,45551</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38009570$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Koganezawa, Akito S</creatorcontrib><creatorcontrib>Matsuura, Takaaki</creatorcontrib><creatorcontrib>Kawahara, Daisuke</creatorcontrib><creatorcontrib>Nakashima, Takeo</creatorcontrib><creatorcontrib>Shiba, Eiji</creatorcontrib><creatorcontrib>Murakami, Yuji</creatorcontrib><creatorcontrib>Nagata, Yasushi</creatorcontrib><title>Unbiased evaluation of predicted gamma passing rate by an event‐mixing technique</title><title>Medical physics (Lancaster)</title><addtitle>Med Phys</addtitle><description>Background Predicting models of the gamma passing rate (GPR) have been studied to substitute the measurement‐based gamma analysis. Since these studies used data from different radiotherapy systems comprising TPS, linear accelerator, and detector array, it has been difficult to compare the performances of the predicting models among institutions with different radiotherapy systems. Purpose We aimed to develop unbiased scoring methods to evaluate the performance of the models predicting the GPR, by introducing both best and worst limits for the performance of the GPR prediction. Methods Two hundred head‐and‐neck VMAT plans were used to develop a framework. The GPRs were measured using the ArcCHECK device. The predicted GPR [p] was generated using a deep learning‐based model [pDL]. The predicting model was evaluated using four metrics: standard deviation (SD) [σ], Pearson's correlation coefficient (CC) [r], mean squared error (MSE) [s], and mean absolute error (MAE) [a]. The best limit [σm${\sigma _m}$, rm${r_m}$, sm${s_m}$, and am${a_m}$] was estimated by measuring the SD of measured GPR [m] by shifting the device along the longitudinal direction to measure different sampling points. Mimicked best and worst p’s [pbest and pworst] were generated from pDL. The worst limit was defined such that m and p have no correlation [CC ∼ 0]. The worst limit [σMix, rMix, sMix, and aMix] was generated using the event‐mixing (EM) technique originally introduced in high‐energy physics experiments. The range of σ, r, s, and a was defined to be [σm,σMix]$[ {{\sigma _m},{\sigma _{{\mathrm{Mix}}} ]$, [0,rm]$[ {0,{r_m}} ]$, [sm,sMix]$[ {{s_m},{s_{{\mathrm{Mix}}} ]$, and [am,aMix]$[ {{a_m},{a_{{\mathrm{Mix}}} ]$. The achievement score (AS) independently based on σ, r, s, and a were calculated for pDL, pbest and pworst. The probability that p fails the gamma analysis (alert frequency; AF) was estimated as a function of σd${\sigma _d}$ values within the [σm${\sigma _m}$, σMix] range for the 3%/2 mm data with a 95% criterion. Results SDs of the best limit were well reproduced by σm=0.531100−m${\sigma _m} = \;0.531\sqrt {100 - m} $. The EM technique successfully generated the (m,p)$( {m,p} )$ pairs with no correlation. The AS using four metrics showed good agreement. This agreement indicates successful definitions of both best and worst limits, consistent definitions of the AS, and successful generations of mixed events. The AF for the DL‐based model with the 3%/2 mm tolerance was 31.5% and 63.0% with CL's 99% and 99.9%, respectively. Conclusion We developed the AS to evaluate the predicting model of the GPR in an unbiased manner by excluding the effects of the precision of the radiotherapy system and the spreading of the GPR. The best and worst limits of the GPR prediction were successfully generated using the measured precision of the GPR and the EM technique, respectively. The AS and σp${\sigma _p}$ are expected to enable objective evaluation of the predicting model and setting exact achievement goal of precision for the predicted GPR.</description><subject>Benchmarking</subject><subject>event‐mixing</subject><subject>gamma passing rate</subject><subject>Gamma Rays</subject><subject>quality assurance</subject><subject>Radiotherapy Dosage</subject><subject>Radiotherapy Planning, Computer-Assisted - methods</subject><subject>Radiotherapy, Intensity-Modulated - methods</subject><subject>volumetric‐modulated arc therapy</subject><issn>0094-2405</issn><issn>2473-4209</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp1kMtKw0AUhgdRbK2CTyBZukmdW2YmSxFvUFHEroe5pY5kkphJ1O58BJ_RJzG1VVeuDpz_O9-BH4BDBKcIQnwSmiligootMMaUk5RimG-DMYQ5TTGF2QjsxfgEIWQkg7tgRMQQZRyOwf280l5FZxP3ospedb6ukrpImtZZb7phv1AhqKRRMfpqkbSqc4leJqoaDlzVfb5_BP-2SjpnHiv_3Lt9sFOoMrqDzZyA-cX5w9lVOru9vD47naUGCyxSlmWoMFphbi3PHc9sTjjnhiHGmUVKIyq0pY5ogwrEICaM5bwQGlqTM6LIBByvvU1bD29jJ4OPxpWlqlzdR4lFTjnGecb-UNPWMbaukE3rg2qXEkG5alCGRn43OKBHG2uvg7O_4E9lA5CugVdfuuW_InlztxZ-AZ_yelc</recordid><startdate>202401</startdate><enddate>202401</enddate><creator>Koganezawa, Akito S</creator><creator>Matsuura, Takaaki</creator><creator>Kawahara, Daisuke</creator><creator>Nakashima, Takeo</creator><creator>Shiba, Eiji</creator><creator>Murakami, Yuji</creator><creator>Nagata, Yasushi</creator><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>202401</creationdate><title>Unbiased evaluation of predicted gamma passing rate by an event‐mixing technique</title><author>Koganezawa, Akito S ; Matsuura, Takaaki ; Kawahara, Daisuke ; Nakashima, Takeo ; Shiba, Eiji ; Murakami, Yuji ; Nagata, Yasushi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2828-6551fcba27dd79e75d93777c61676d1ab148bd4e3bc1f160236697f8b0dc963a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Benchmarking</topic><topic>event‐mixing</topic><topic>gamma passing rate</topic><topic>Gamma Rays</topic><topic>quality assurance</topic><topic>Radiotherapy Dosage</topic><topic>Radiotherapy Planning, Computer-Assisted - methods</topic><topic>Radiotherapy, Intensity-Modulated - methods</topic><topic>volumetric‐modulated arc therapy</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Koganezawa, Akito S</creatorcontrib><creatorcontrib>Matsuura, Takaaki</creatorcontrib><creatorcontrib>Kawahara, Daisuke</creatorcontrib><creatorcontrib>Nakashima, Takeo</creatorcontrib><creatorcontrib>Shiba, Eiji</creatorcontrib><creatorcontrib>Murakami, Yuji</creatorcontrib><creatorcontrib>Nagata, Yasushi</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Medical physics (Lancaster)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Koganezawa, Akito S</au><au>Matsuura, Takaaki</au><au>Kawahara, Daisuke</au><au>Nakashima, Takeo</au><au>Shiba, Eiji</au><au>Murakami, Yuji</au><au>Nagata, Yasushi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Unbiased evaluation of predicted gamma passing rate by an event‐mixing technique</atitle><jtitle>Medical physics (Lancaster)</jtitle><addtitle>Med Phys</addtitle><date>2024-01</date><risdate>2024</risdate><volume>51</volume><issue>1</issue><spage>5</spage><epage>17</epage><pages>5-17</pages><issn>0094-2405</issn><eissn>2473-4209</eissn><abstract>Background Predicting models of the gamma passing rate (GPR) have been studied to substitute the measurement‐based gamma analysis. Since these studies used data from different radiotherapy systems comprising TPS, linear accelerator, and detector array, it has been difficult to compare the performances of the predicting models among institutions with different radiotherapy systems. Purpose We aimed to develop unbiased scoring methods to evaluate the performance of the models predicting the GPR, by introducing both best and worst limits for the performance of the GPR prediction. Methods Two hundred head‐and‐neck VMAT plans were used to develop a framework. The GPRs were measured using the ArcCHECK device. The predicted GPR [p] was generated using a deep learning‐based model [pDL]. The predicting model was evaluated using four metrics: standard deviation (SD) [σ], Pearson's correlation coefficient (CC) [r], mean squared error (MSE) [s], and mean absolute error (MAE) [a]. The best limit [σm${\sigma _m}$, rm${r_m}$, sm${s_m}$, and am${a_m}$] was estimated by measuring the SD of measured GPR [m] by shifting the device along the longitudinal direction to measure different sampling points. Mimicked best and worst p’s [pbest and pworst] were generated from pDL. The worst limit was defined such that m and p have no correlation [CC ∼ 0]. The worst limit [σMix, rMix, sMix, and aMix] was generated using the event‐mixing (EM) technique originally introduced in high‐energy physics experiments. The range of σ, r, s, and a was defined to be [σm,σMix]$[ {{\sigma _m},{\sigma _{{\mathrm{Mix}}} ]$, [0,rm]$[ {0,{r_m}} ]$, [sm,sMix]$[ {{s_m},{s_{{\mathrm{Mix}}} ]$, and [am,aMix]$[ {{a_m},{a_{{\mathrm{Mix}}} ]$. The achievement score (AS) independently based on σ, r, s, and a were calculated for pDL, pbest and pworst. The probability that p fails the gamma analysis (alert frequency; AF) was estimated as a function of σd${\sigma _d}$ values within the [σm${\sigma _m}$, σMix] range for the 3%/2 mm data with a 95% criterion. Results SDs of the best limit were well reproduced by σm=0.531100−m${\sigma _m} = \;0.531\sqrt {100 - m} $. The EM technique successfully generated the (m,p)$( {m,p} )$ pairs with no correlation. The AS using four metrics showed good agreement. This agreement indicates successful definitions of both best and worst limits, consistent definitions of the AS, and successful generations of mixed events. The AF for the DL‐based model with the 3%/2 mm tolerance was 31.5% and 63.0% with CL's 99% and 99.9%, respectively. Conclusion We developed the AS to evaluate the predicting model of the GPR in an unbiased manner by excluding the effects of the precision of the radiotherapy system and the spreading of the GPR. The best and worst limits of the GPR prediction were successfully generated using the measured precision of the GPR and the EM technique, respectively. The AS and σp${\sigma _p}$ are expected to enable objective evaluation of the predicting model and setting exact achievement goal of precision for the predicted GPR.</abstract><cop>United States</cop><pmid>38009570</pmid><doi>10.1002/mp.16848</doi><tpages>13</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0094-2405
ispartof Medical physics (Lancaster), 2024-01, Vol.51 (1), p.5-17
issn 0094-2405
2473-4209
language eng
recordid cdi_proquest_miscellaneous_2894722956
source MEDLINE; Wiley Online Library Journals Frontfile Complete; Alma/SFX Local Collection
subjects Benchmarking
event‐mixing
gamma passing rate
Gamma Rays
quality assurance
Radiotherapy Dosage
Radiotherapy Planning, Computer-Assisted - methods
Radiotherapy, Intensity-Modulated - methods
volumetric‐modulated arc therapy
title Unbiased evaluation of predicted gamma passing rate by an event‐mixing technique
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T08%3A00%3A28IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Unbiased%20evaluation%20of%20predicted%20gamma%20passing%20rate%20by%20an%20event%E2%80%90mixing%20technique&rft.jtitle=Medical%20physics%20(Lancaster)&rft.au=Koganezawa,%20Akito%20S&rft.date=2024-01&rft.volume=51&rft.issue=1&rft.spage=5&rft.epage=17&rft.pages=5-17&rft.issn=0094-2405&rft.eissn=2473-4209&rft_id=info:doi/10.1002/mp.16848&rft_dat=%3Cproquest_cross%3E2894722956%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2894722956&rft_id=info:pmid/38009570&rfr_iscdi=true