Systematic comparison of multi-omics survival models reveals a widespread lack of noise resistance

As observed in several previous studies, integrating more molecular modalities in multi-omics cancer survival models may not always improve model accuracy. In this study, we compared eight deep learning and four statistical integration techniques for survival prediction on 17 multi-omics datasets, e...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Cell reports methods 2023-04, Vol.3 (4), p.100461-100461, Article 100461
Hauptverfasser: Wissel, David, Rowson, Daniel, Boeva, Valentina
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 100461
container_issue 4
container_start_page 100461
container_title Cell reports methods
container_volume 3
creator Wissel, David
Rowson, Daniel
Boeva, Valentina
description As observed in several previous studies, integrating more molecular modalities in multi-omics cancer survival models may not always improve model accuracy. In this study, we compared eight deep learning and four statistical integration techniques for survival prediction on 17 multi-omics datasets, examining model performance in terms of overall accuracy and noise resistance. We found that one deep learning method, mean late fusion, and two statistical methods, PriorityLasso and BlockForest, performed best in terms of both noise resistance and overall discriminative and calibration performance. Nevertheless, all methods struggled to adequately handle noise when too many modalities were added. In summary, we confirmed that current multi-omics survival methods are not sufficiently noise resistant. We recommend relying on only modalities for which there is known predictive value for a particular cancer type until models that have stronger noise-resistance properties are developed. [Display omitted] •Multi-omics survival models are compared across The Cancer Genome Atlas (TCGA) datasets•Only one model performs better than a model trained using clinical data alone•PriorityLasso is recommended when the informativeness of modalities is uncertain With decreasing costs of high-throughput sequencing, more and more studies have started to provide multi-omics molecular profiles of patients with cancer. This has led to various developments of novel survival analysis approaches integrating these heterogeneous molecular and clinical groups of variables. Although some of these methods have reached state-of-the-art results in cancer survival prediction, many have demonstrated a decay in performance when integrating larger numbers of omics modalities. Wissel et al. perform an empirical comparison of 12 models for predicting survival from multi-omics data on 17 datasets from The Cancer Genome Atlas (TCGA). The authors show that all considered methods suffer from a lack of noise resistance. The study emphasizes the need for more noise-resistant multi-omics survival methods.
doi_str_mv 10.1016/j.crmeth.2023.100461
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10162996</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S2667237523000802</els_id><sourcerecordid>2811938017</sourcerecordid><originalsourceid>FETCH-LOGICAL-c464t-7abab7ea6594667febfb567a8678b4225a6aa64bd3f9f634a18704a5e46c82143</originalsourceid><addsrcrecordid>eNp9kc1u1TAQhS1ERau2b4BQlmxy8V_sZANCFQWkSiyAtTVxJtSXOL7YTlDfvr6kVGXDaizPN2dG5xDyktEdo0y92e9s9Jhvd5xyUb6oVOwZOeNK6ZoL3Tx_8j4llyntKaW8YUJ07AU5FZo1nVLdGem_3qWMHrKzlQ3-ANGlMFdhrPwyZVcH72yq0hJXt8JU-TDglKqIK0KpUP12A6ZDRBiqCezP4-AcXMKCJJcyzBYvyMlYYLx8qOfk-_WHb1ef6psvHz9fvb-prVQy1xp66DWCajpZTh-xH_tGaWiVbnvJeQMKQMl-EGM3KiGBtZpKaFAq23ImxTl5t-kelt7jYHHOESZziM5DvDMBnPm3M7tb8yOs5ugo7zpVFF4_KMTwa8GUjXfJ4jTBjGFJhreMdaKlTBdUbqiNIaWI4-MeRv8Imr3ZIjLHiMwWURl79fTGx6G_gRTg7QYUm3F1GE2yDouLg4tosxmC-_-GeygIpqs</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2811938017</pqid></control><display><type>article</type><title>Systematic comparison of multi-omics survival models reveals a widespread lack of noise resistance</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><creator>Wissel, David ; Rowson, Daniel ; Boeva, Valentina</creator><creatorcontrib>Wissel, David ; Rowson, Daniel ; Boeva, Valentina</creatorcontrib><description>As observed in several previous studies, integrating more molecular modalities in multi-omics cancer survival models may not always improve model accuracy. In this study, we compared eight deep learning and four statistical integration techniques for survival prediction on 17 multi-omics datasets, examining model performance in terms of overall accuracy and noise resistance. We found that one deep learning method, mean late fusion, and two statistical methods, PriorityLasso and BlockForest, performed best in terms of both noise resistance and overall discriminative and calibration performance. Nevertheless, all methods struggled to adequately handle noise when too many modalities were added. In summary, we confirmed that current multi-omics survival methods are not sufficiently noise resistant. We recommend relying on only modalities for which there is known predictive value for a particular cancer type until models that have stronger noise-resistance properties are developed. [Display omitted] •Multi-omics survival models are compared across The Cancer Genome Atlas (TCGA) datasets•Only one model performs better than a model trained using clinical data alone•PriorityLasso is recommended when the informativeness of modalities is uncertain With decreasing costs of high-throughput sequencing, more and more studies have started to provide multi-omics molecular profiles of patients with cancer. This has led to various developments of novel survival analysis approaches integrating these heterogeneous molecular and clinical groups of variables. Although some of these methods have reached state-of-the-art results in cancer survival prediction, many have demonstrated a decay in performance when integrating larger numbers of omics modalities. Wissel et al. perform an empirical comparison of 12 models for predicting survival from multi-omics data on 17 datasets from The Cancer Genome Atlas (TCGA). The authors show that all considered methods suffer from a lack of noise resistance. The study emphasizes the need for more noise-resistant multi-omics survival methods.</description><identifier>ISSN: 2667-2375</identifier><identifier>EISSN: 2667-2375</identifier><identifier>DOI: 10.1016/j.crmeth.2023.100461</identifier><identifier>PMID: 37159669</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Calibration ; cancer ; deep learning models ; multi-modal integration ; multi-omics ; Multiomics ; noise resistance ; Resource ; statistical models ; survival analysis</subject><ispartof>Cell reports methods, 2023-04, Vol.3 (4), p.100461-100461, Article 100461</ispartof><rights>2023 The Author(s)</rights><rights>2023 The Author(s).</rights><rights>2023 The Author(s) 2023</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c464t-7abab7ea6594667febfb567a8678b4225a6aa64bd3f9f634a18704a5e46c82143</citedby><cites>FETCH-LOGICAL-c464t-7abab7ea6594667febfb567a8678b4225a6aa64bd3f9f634a18704a5e46c82143</cites><orcidid>0000-0002-4382-7185</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC10162996/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC10162996/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,864,885,27924,27925,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/37159669$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Wissel, David</creatorcontrib><creatorcontrib>Rowson, Daniel</creatorcontrib><creatorcontrib>Boeva, Valentina</creatorcontrib><title>Systematic comparison of multi-omics survival models reveals a widespread lack of noise resistance</title><title>Cell reports methods</title><addtitle>Cell Rep Methods</addtitle><description>As observed in several previous studies, integrating more molecular modalities in multi-omics cancer survival models may not always improve model accuracy. In this study, we compared eight deep learning and four statistical integration techniques for survival prediction on 17 multi-omics datasets, examining model performance in terms of overall accuracy and noise resistance. We found that one deep learning method, mean late fusion, and two statistical methods, PriorityLasso and BlockForest, performed best in terms of both noise resistance and overall discriminative and calibration performance. Nevertheless, all methods struggled to adequately handle noise when too many modalities were added. In summary, we confirmed that current multi-omics survival methods are not sufficiently noise resistant. We recommend relying on only modalities for which there is known predictive value for a particular cancer type until models that have stronger noise-resistance properties are developed. [Display omitted] •Multi-omics survival models are compared across The Cancer Genome Atlas (TCGA) datasets•Only one model performs better than a model trained using clinical data alone•PriorityLasso is recommended when the informativeness of modalities is uncertain With decreasing costs of high-throughput sequencing, more and more studies have started to provide multi-omics molecular profiles of patients with cancer. This has led to various developments of novel survival analysis approaches integrating these heterogeneous molecular and clinical groups of variables. Although some of these methods have reached state-of-the-art results in cancer survival prediction, many have demonstrated a decay in performance when integrating larger numbers of omics modalities. Wissel et al. perform an empirical comparison of 12 models for predicting survival from multi-omics data on 17 datasets from The Cancer Genome Atlas (TCGA). The authors show that all considered methods suffer from a lack of noise resistance. The study emphasizes the need for more noise-resistant multi-omics survival methods.</description><subject>Calibration</subject><subject>cancer</subject><subject>deep learning models</subject><subject>multi-modal integration</subject><subject>multi-omics</subject><subject>Multiomics</subject><subject>noise resistance</subject><subject>Resource</subject><subject>statistical models</subject><subject>survival analysis</subject><issn>2667-2375</issn><issn>2667-2375</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kc1u1TAQhS1ERau2b4BQlmxy8V_sZANCFQWkSiyAtTVxJtSXOL7YTlDfvr6kVGXDaizPN2dG5xDyktEdo0y92e9s9Jhvd5xyUb6oVOwZOeNK6ZoL3Tx_8j4llyntKaW8YUJ07AU5FZo1nVLdGem_3qWMHrKzlQ3-ANGlMFdhrPwyZVcH72yq0hJXt8JU-TDglKqIK0KpUP12A6ZDRBiqCezP4-AcXMKCJJcyzBYvyMlYYLx8qOfk-_WHb1ef6psvHz9fvb-prVQy1xp66DWCajpZTh-xH_tGaWiVbnvJeQMKQMl-EGM3KiGBtZpKaFAq23ImxTl5t-kelt7jYHHOESZziM5DvDMBnPm3M7tb8yOs5ugo7zpVFF4_KMTwa8GUjXfJ4jTBjGFJhreMdaKlTBdUbqiNIaWI4-MeRv8Imr3ZIjLHiMwWURl79fTGx6G_gRTg7QYUm3F1GE2yDouLg4tosxmC-_-GeygIpqs</recordid><startdate>20230424</startdate><enddate>20230424</enddate><creator>Wissel, David</creator><creator>Rowson, Daniel</creator><creator>Boeva, Valentina</creator><general>Elsevier Inc</general><general>Elsevier</general><scope>6I.</scope><scope>AAFTH</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-4382-7185</orcidid></search><sort><creationdate>20230424</creationdate><title>Systematic comparison of multi-omics survival models reveals a widespread lack of noise resistance</title><author>Wissel, David ; Rowson, Daniel ; Boeva, Valentina</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c464t-7abab7ea6594667febfb567a8678b4225a6aa64bd3f9f634a18704a5e46c82143</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Calibration</topic><topic>cancer</topic><topic>deep learning models</topic><topic>multi-modal integration</topic><topic>multi-omics</topic><topic>Multiomics</topic><topic>noise resistance</topic><topic>Resource</topic><topic>statistical models</topic><topic>survival analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wissel, David</creatorcontrib><creatorcontrib>Rowson, Daniel</creatorcontrib><creatorcontrib>Boeva, Valentina</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Cell reports methods</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wissel, David</au><au>Rowson, Daniel</au><au>Boeva, Valentina</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Systematic comparison of multi-omics survival models reveals a widespread lack of noise resistance</atitle><jtitle>Cell reports methods</jtitle><addtitle>Cell Rep Methods</addtitle><date>2023-04-24</date><risdate>2023</risdate><volume>3</volume><issue>4</issue><spage>100461</spage><epage>100461</epage><pages>100461-100461</pages><artnum>100461</artnum><issn>2667-2375</issn><eissn>2667-2375</eissn><abstract>As observed in several previous studies, integrating more molecular modalities in multi-omics cancer survival models may not always improve model accuracy. In this study, we compared eight deep learning and four statistical integration techniques for survival prediction on 17 multi-omics datasets, examining model performance in terms of overall accuracy and noise resistance. We found that one deep learning method, mean late fusion, and two statistical methods, PriorityLasso and BlockForest, performed best in terms of both noise resistance and overall discriminative and calibration performance. Nevertheless, all methods struggled to adequately handle noise when too many modalities were added. In summary, we confirmed that current multi-omics survival methods are not sufficiently noise resistant. We recommend relying on only modalities for which there is known predictive value for a particular cancer type until models that have stronger noise-resistance properties are developed. [Display omitted] •Multi-omics survival models are compared across The Cancer Genome Atlas (TCGA) datasets•Only one model performs better than a model trained using clinical data alone•PriorityLasso is recommended when the informativeness of modalities is uncertain With decreasing costs of high-throughput sequencing, more and more studies have started to provide multi-omics molecular profiles of patients with cancer. This has led to various developments of novel survival analysis approaches integrating these heterogeneous molecular and clinical groups of variables. Although some of these methods have reached state-of-the-art results in cancer survival prediction, many have demonstrated a decay in performance when integrating larger numbers of omics modalities. Wissel et al. perform an empirical comparison of 12 models for predicting survival from multi-omics data on 17 datasets from The Cancer Genome Atlas (TCGA). The authors show that all considered methods suffer from a lack of noise resistance. The study emphasizes the need for more noise-resistant multi-omics survival methods.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>37159669</pmid><doi>10.1016/j.crmeth.2023.100461</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-4382-7185</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2667-2375
ispartof Cell reports methods, 2023-04, Vol.3 (4), p.100461-100461, Article 100461
issn 2667-2375
2667-2375
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10162996
source MEDLINE; DOAJ Directory of Open Access Journals; EZB-FREE-00999 freely available EZB journals; PubMed Central; Alma/SFX Local Collection
subjects Calibration
cancer
deep learning models
multi-modal integration
multi-omics
Multiomics
noise resistance
Resource
statistical models
survival analysis
title Systematic comparison of multi-omics survival models reveals a widespread lack of noise resistance
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T03%3A57%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Systematic%20comparison%20of%20multi-omics%20survival%20models%20reveals%20a%20widespread%20lack%20of%20noise%20resistance&rft.jtitle=Cell%20reports%20methods&rft.au=Wissel,%20David&rft.date=2023-04-24&rft.volume=3&rft.issue=4&rft.spage=100461&rft.epage=100461&rft.pages=100461-100461&rft.artnum=100461&rft.issn=2667-2375&rft.eissn=2667-2375&rft_id=info:doi/10.1016/j.crmeth.2023.100461&rft_dat=%3Cproquest_pubme%3E2811938017%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2811938017&rft_id=info:pmid/37159669&rft_els_id=S2667237523000802&rfr_iscdi=true