Probabilistic Mixture Models Improve Calibration of Panel-derived Tumor Mutational Burden in the Context of both Tumor-normal and Tumor-only Sequencing
Tumor mutational burden (TMB) has been investigated as a biomarker for immune checkpoint blockade (ICB) therapy. Increasingly, TMB is being estimated with gene panel-based assays (as opposed to full exome sequencing) and different gene panels cover overlapping but distinct genomic coordinates, makin...
Gespeichert in:
Veröffentlicht in: | Cancer research communications 2023-03, Vol.3 (3), p.501-509 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 509 |
---|---|
container_issue | 3 |
container_start_page | 501 |
container_title | Cancer research communications |
container_volume | 3 |
creator | Anaya, Jordan Sidhom, John-William Cummings, Craig A Baras, Alexander S |
description | Tumor mutational burden (TMB) has been investigated as a biomarker for immune checkpoint blockade (ICB) therapy. Increasingly, TMB is being estimated with gene panel-based assays (as opposed to full exome sequencing) and different gene panels cover overlapping but distinct genomic coordinates, making comparisons across panels difficult. Previous studies have suggested that standardization and calibration to exome-derived TMB be done for each panel to ensure comparability. With TMB cutoffs being developed from panel-based assays, there is a need to understand how to properly estimate exomic TMB values from different panel-based assays.
Our approach to calibration of panel-derived TMB to exomic TMB proposes the use of probabilistic mixture models that allow for nonlinear relationships along with heteroscedastic error. We examined various inputs including nonsynonymous, synonymous, and hotspot counts along with genetic ancestry. Using The Cancer Genome Atlas cohort, we generated a tumor-only version of the panel-restricted data by reintroducing private germline variants.
We were able to model more accurately the distribution of both tumor-normal and tumor-only data using the proposed probabilistic mixture models as compared with linear regression. Applying a model trained on tumor-normal data to tumor-only input results in biased TMB predictions. Including synonymous mutations resulted in better regression metrics across both data types, but ultimately a model able to dynamically weight the various input mutation types exhibited optimal performance. Including genetic ancestry improved model performance only in the context of tumor-only data, wherein private germline variants are observed.
A probabilistic mixture model better models the nonlinearity and heteroscedasticity of the data as compared with linear regression. Tumor-only panel data are needed to properly calibrate tumor-only panels to exomic TMB. Leveraging the uncertainty of point estimates from these models better informs cohort stratification in terms of TMB. |
doi_str_mv | 10.1158/2767-9764.CRC-22-0339 |
format | Article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10044680</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2793984050</sourcerecordid><originalsourceid>FETCH-LOGICAL-c342t-edf92b8fe4e3f8e20a7bb7e7845d3309c46d649dfe09339f1d9cb3cfed6926923</originalsourceid><addsrcrecordid>eNpVkd9qHCEUxofQkoQ0j9DiZW9MHHX-eFXaIW0CWRqS9Fqc8UzW4uhWnSV5kr5une52SUFQPN_3ncP5FcX7klyUZdVe0qZusGhqftHdd5hSTBgTR8Xp4f_Nq_dJcR7jT0IIbRpe1ey4OGG1EIJwflr8vgu-V72xJiYzoJV5TnMAtPIabEQ30yb4LaBOWdMHlYx3yI_oTjmwWEMwW9DocZ58QKs5_a0ri77MQYNDxqG0zl7vEjynxdf7tN7JsfNhylLl9n7snX1BD_BrBjcY9_SueDsqG-F8f58VP75ePXbX-Pb7t5vu8y0eGKcJgx4F7dsROLCxBUpU0_cNNC2vNGNEDLzWNRd6BCLyisZSi6Fnwwi6FjQfdlZ82uVu5n4CPYBLQVm5CWZS4UV6ZeT_FWfW8slvZUny_uqW5ISP-4Tg8_QxycnEAazNS_JzlLQRTLScVIu02kmH4GMMMB76lEQuYOUCTS7QZAYrKZUL2Oz78HrIg-sfRvYH1vCjqQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2793984050</pqid></control><display><type>article</type><title>Probabilistic Mixture Models Improve Calibration of Panel-derived Tumor Mutational Burden in the Context of both Tumor-normal and Tumor-only Sequencing</title><source>DOAJ Directory of Open Access Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Anaya, Jordan ; Sidhom, John-William ; Cummings, Craig A ; Baras, Alexander S</creator><creatorcontrib>Anaya, Jordan ; Sidhom, John-William ; Cummings, Craig A ; Baras, Alexander S ; for AACR Project GENIE Consortium</creatorcontrib><description>Tumor mutational burden (TMB) has been investigated as a biomarker for immune checkpoint blockade (ICB) therapy. Increasingly, TMB is being estimated with gene panel-based assays (as opposed to full exome sequencing) and different gene panels cover overlapping but distinct genomic coordinates, making comparisons across panels difficult. Previous studies have suggested that standardization and calibration to exome-derived TMB be done for each panel to ensure comparability. With TMB cutoffs being developed from panel-based assays, there is a need to understand how to properly estimate exomic TMB values from different panel-based assays.
Our approach to calibration of panel-derived TMB to exomic TMB proposes the use of probabilistic mixture models that allow for nonlinear relationships along with heteroscedastic error. We examined various inputs including nonsynonymous, synonymous, and hotspot counts along with genetic ancestry. Using The Cancer Genome Atlas cohort, we generated a tumor-only version of the panel-restricted data by reintroducing private germline variants.
We were able to model more accurately the distribution of both tumor-normal and tumor-only data using the proposed probabilistic mixture models as compared with linear regression. Applying a model trained on tumor-normal data to tumor-only input results in biased TMB predictions. Including synonymous mutations resulted in better regression metrics across both data types, but ultimately a model able to dynamically weight the various input mutation types exhibited optimal performance. Including genetic ancestry improved model performance only in the context of tumor-only data, wherein private germline variants are observed.
A probabilistic mixture model better models the nonlinearity and heteroscedasticity of the data as compared with linear regression. Tumor-only panel data are needed to properly calibrate tumor-only panels to exomic TMB. Leveraging the uncertainty of point estimates from these models better informs cohort stratification in terms of TMB.</description><identifier>ISSN: 2767-9764</identifier><identifier>EISSN: 2767-9764</identifier><identifier>DOI: 10.1158/2767-9764.CRC-22-0339</identifier><identifier>PMID: 36999044</identifier><language>eng</language><publisher>United States: American Association for Cancer Research</publisher><subject>Biomarkers ; Computational Methods ; Methodology And Modeling</subject><ispartof>Cancer research communications, 2023-03, Vol.3 (3), p.501-509</ispartof><rights>2023 The Authors; Published by the American Association for Cancer Research.</rights><rights>2023 The Authors; Published by the American Association for Cancer Research 2023 Copyright held by the owner/author(s).</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c342t-edf92b8fe4e3f8e20a7bb7e7845d3309c46d649dfe09339f1d9cb3cfed6926923</citedby><cites>FETCH-LOGICAL-c342t-edf92b8fe4e3f8e20a7bb7e7845d3309c46d649dfe09339f1d9cb3cfed6926923</cites><orcidid>0000-0003-2397-3342 ; 0000-0002-6166-4113 ; 0000-0002-5575-0285 ; 0000-0001-7276-9474</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC10044680/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC10044680/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,860,881,27901,27902,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/36999044$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Anaya, Jordan</creatorcontrib><creatorcontrib>Sidhom, John-William</creatorcontrib><creatorcontrib>Cummings, Craig A</creatorcontrib><creatorcontrib>Baras, Alexander S</creatorcontrib><creatorcontrib>for AACR Project GENIE Consortium</creatorcontrib><title>Probabilistic Mixture Models Improve Calibration of Panel-derived Tumor Mutational Burden in the Context of both Tumor-normal and Tumor-only Sequencing</title><title>Cancer research communications</title><addtitle>Cancer Res Commun</addtitle><description>Tumor mutational burden (TMB) has been investigated as a biomarker for immune checkpoint blockade (ICB) therapy. Increasingly, TMB is being estimated with gene panel-based assays (as opposed to full exome sequencing) and different gene panels cover overlapping but distinct genomic coordinates, making comparisons across panels difficult. Previous studies have suggested that standardization and calibration to exome-derived TMB be done for each panel to ensure comparability. With TMB cutoffs being developed from panel-based assays, there is a need to understand how to properly estimate exomic TMB values from different panel-based assays.
Our approach to calibration of panel-derived TMB to exomic TMB proposes the use of probabilistic mixture models that allow for nonlinear relationships along with heteroscedastic error. We examined various inputs including nonsynonymous, synonymous, and hotspot counts along with genetic ancestry. Using The Cancer Genome Atlas cohort, we generated a tumor-only version of the panel-restricted data by reintroducing private germline variants.
We were able to model more accurately the distribution of both tumor-normal and tumor-only data using the proposed probabilistic mixture models as compared with linear regression. Applying a model trained on tumor-normal data to tumor-only input results in biased TMB predictions. Including synonymous mutations resulted in better regression metrics across both data types, but ultimately a model able to dynamically weight the various input mutation types exhibited optimal performance. Including genetic ancestry improved model performance only in the context of tumor-only data, wherein private germline variants are observed.
A probabilistic mixture model better models the nonlinearity and heteroscedasticity of the data as compared with linear regression. Tumor-only panel data are needed to properly calibrate tumor-only panels to exomic TMB. Leveraging the uncertainty of point estimates from these models better informs cohort stratification in terms of TMB.</description><subject>Biomarkers</subject><subject>Computational Methods</subject><subject>Methodology And Modeling</subject><issn>2767-9764</issn><issn>2767-9764</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNpVkd9qHCEUxofQkoQ0j9DiZW9MHHX-eFXaIW0CWRqS9Fqc8UzW4uhWnSV5kr5une52SUFQPN_3ncP5FcX7klyUZdVe0qZusGhqftHdd5hSTBgTR8Xp4f_Nq_dJcR7jT0IIbRpe1ey4OGG1EIJwflr8vgu-V72xJiYzoJV5TnMAtPIabEQ30yb4LaBOWdMHlYx3yI_oTjmwWEMwW9DocZ58QKs5_a0ri77MQYNDxqG0zl7vEjynxdf7tN7JsfNhylLl9n7snX1BD_BrBjcY9_SueDsqG-F8f58VP75ePXbX-Pb7t5vu8y0eGKcJgx4F7dsROLCxBUpU0_cNNC2vNGNEDLzWNRd6BCLyisZSi6Fnwwi6FjQfdlZ82uVu5n4CPYBLQVm5CWZS4UV6ZeT_FWfW8slvZUny_uqW5ISP-4Tg8_QxycnEAazNS_JzlLQRTLScVIu02kmH4GMMMB76lEQuYOUCTS7QZAYrKZUL2Oz78HrIg-sfRvYH1vCjqQ</recordid><startdate>20230301</startdate><enddate>20230301</enddate><creator>Anaya, Jordan</creator><creator>Sidhom, John-William</creator><creator>Cummings, Craig A</creator><creator>Baras, Alexander S</creator><general>American Association for Cancer Research</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0003-2397-3342</orcidid><orcidid>https://orcid.org/0000-0002-6166-4113</orcidid><orcidid>https://orcid.org/0000-0002-5575-0285</orcidid><orcidid>https://orcid.org/0000-0001-7276-9474</orcidid></search><sort><creationdate>20230301</creationdate><title>Probabilistic Mixture Models Improve Calibration of Panel-derived Tumor Mutational Burden in the Context of both Tumor-normal and Tumor-only Sequencing</title><author>Anaya, Jordan ; Sidhom, John-William ; Cummings, Craig A ; Baras, Alexander S</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c342t-edf92b8fe4e3f8e20a7bb7e7845d3309c46d649dfe09339f1d9cb3cfed6926923</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Biomarkers</topic><topic>Computational Methods</topic><topic>Methodology And Modeling</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Anaya, Jordan</creatorcontrib><creatorcontrib>Sidhom, John-William</creatorcontrib><creatorcontrib>Cummings, Craig A</creatorcontrib><creatorcontrib>Baras, Alexander S</creatorcontrib><creatorcontrib>for AACR Project GENIE Consortium</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Cancer research communications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Anaya, Jordan</au><au>Sidhom, John-William</au><au>Cummings, Craig A</au><au>Baras, Alexander S</au><aucorp>for AACR Project GENIE Consortium</aucorp><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Probabilistic Mixture Models Improve Calibration of Panel-derived Tumor Mutational Burden in the Context of both Tumor-normal and Tumor-only Sequencing</atitle><jtitle>Cancer research communications</jtitle><addtitle>Cancer Res Commun</addtitle><date>2023-03-01</date><risdate>2023</risdate><volume>3</volume><issue>3</issue><spage>501</spage><epage>509</epage><pages>501-509</pages><issn>2767-9764</issn><eissn>2767-9764</eissn><abstract>Tumor mutational burden (TMB) has been investigated as a biomarker for immune checkpoint blockade (ICB) therapy. Increasingly, TMB is being estimated with gene panel-based assays (as opposed to full exome sequencing) and different gene panels cover overlapping but distinct genomic coordinates, making comparisons across panels difficult. Previous studies have suggested that standardization and calibration to exome-derived TMB be done for each panel to ensure comparability. With TMB cutoffs being developed from panel-based assays, there is a need to understand how to properly estimate exomic TMB values from different panel-based assays.
Our approach to calibration of panel-derived TMB to exomic TMB proposes the use of probabilistic mixture models that allow for nonlinear relationships along with heteroscedastic error. We examined various inputs including nonsynonymous, synonymous, and hotspot counts along with genetic ancestry. Using The Cancer Genome Atlas cohort, we generated a tumor-only version of the panel-restricted data by reintroducing private germline variants.
We were able to model more accurately the distribution of both tumor-normal and tumor-only data using the proposed probabilistic mixture models as compared with linear regression. Applying a model trained on tumor-normal data to tumor-only input results in biased TMB predictions. Including synonymous mutations resulted in better regression metrics across both data types, but ultimately a model able to dynamically weight the various input mutation types exhibited optimal performance. Including genetic ancestry improved model performance only in the context of tumor-only data, wherein private germline variants are observed.
A probabilistic mixture model better models the nonlinearity and heteroscedasticity of the data as compared with linear regression. Tumor-only panel data are needed to properly calibrate tumor-only panels to exomic TMB. Leveraging the uncertainty of point estimates from these models better informs cohort stratification in terms of TMB.</abstract><cop>United States</cop><pub>American Association for Cancer Research</pub><pmid>36999044</pmid><doi>10.1158/2767-9764.CRC-22-0339</doi><tpages>9</tpages><orcidid>https://orcid.org/0000-0003-2397-3342</orcidid><orcidid>https://orcid.org/0000-0002-6166-4113</orcidid><orcidid>https://orcid.org/0000-0002-5575-0285</orcidid><orcidid>https://orcid.org/0000-0001-7276-9474</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2767-9764 |
ispartof | Cancer research communications, 2023-03, Vol.3 (3), p.501-509 |
issn | 2767-9764 2767-9764 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10044680 |
source | DOAJ Directory of Open Access Journals; EZB-FREE-00999 freely available EZB journals; PubMed Central |
subjects | Biomarkers Computational Methods Methodology And Modeling |
title | Probabilistic Mixture Models Improve Calibration of Panel-derived Tumor Mutational Burden in the Context of both Tumor-normal and Tumor-only Sequencing |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T18%3A47%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Probabilistic%20Mixture%20Models%20Improve%20Calibration%20of%20Panel-derived%20Tumor%20Mutational%20Burden%20in%20the%20Context%20of%20both%20Tumor-normal%20and%20Tumor-only%20Sequencing&rft.jtitle=Cancer%20research%20communications&rft.au=Anaya,%20Jordan&rft.aucorp=for%20AACR%20Project%20GENIE%20Consortium&rft.date=2023-03-01&rft.volume=3&rft.issue=3&rft.spage=501&rft.epage=509&rft.pages=501-509&rft.issn=2767-9764&rft.eissn=2767-9764&rft_id=info:doi/10.1158/2767-9764.CRC-22-0339&rft_dat=%3Cproquest_pubme%3E2793984050%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2793984050&rft_id=info:pmid/36999044&rfr_iscdi=true |