VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data

A major challenge in interpreting the large volume of mutation data identified by next-generation sequencing (NGS) is to distinguish driver mutations from neutral passenger mutations to facilitate the identification of targetable genes and new drugs. Current approaches are primarily based on mutatio...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:PLoS computational biology 2014-02, Vol.10 (2), p.e1003460-e1003460
Hauptverfasser: Jia, Peilin, Zhao, Zhongming
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page e1003460
container_issue 2
container_start_page e1003460
container_title PLoS computational biology
container_volume 10
creator Jia, Peilin
Zhao, Zhongming
description A major challenge in interpreting the large volume of mutation data identified by next-generation sequencing (NGS) is to distinguish driver mutations from neutral passenger mutations to facilitate the identification of targetable genes and new drugs. Current approaches are primarily based on mutation frequencies of single-genes, which lack the power to detect infrequently mutated driver genes and ignore functional interconnection and regulation among cancer genes. We propose a novel mutation network method, VarWalker, to prioritize driver genes in large scale cancer mutation data. VarWalker fits generalized additive models for each sample based on sample-specific mutation profiles and builds on the joint frequency of both mutation genes and their close interactors. These interactors are selected and optimized using the Random Walk with Restart algorithm in a protein-protein interaction network. We applied the method in >300 tumor genomes in two large-scale NGS benchmark datasets: 183 lung adenocarcinoma samples and 121 melanoma samples. In each cancer, we derived a consensus mutation subnetwork containing significantly enriched consensus cancer genes and cancer-related functional pathways. These cancer-specific mutation networks were then validated using independent datasets for each cancer. Importantly, VarWalker prioritizes well-known, infrequently mutated genes, which are shown to interact with highly recurrently mutated genes yet have been ignored by conventional single-gene-based approaches. Utilizing VarWalker, we demonstrated that network-assisted approaches can be effectively adapted to facilitate the detection of cancer driver genes in NGS data.
doi_str_mv 10.1371/journal.pcbi.1003460
format Article
fullrecord <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_1507831260</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A369062955</galeid><doaj_id>oai_doaj_org_article_ba4de83ddffb425b8ae753e0fae3aa91</doaj_id><sourcerecordid>A369062955</sourcerecordid><originalsourceid>FETCH-LOGICAL-c633t-1a57bdce8f228a5081922aa725adae3c2201752aec991977bafc9ce62155fa833</originalsourceid><addsrcrecordid>eNqVkk1v1DAQhiMEomXhHyCIxAUOWfwRxwkHpKriY6UKJD6P1sQZB28Te7GTQvn1eLvbqitxQT7E9jzvm_HMZNljSpaUS_py7efgYFhudGuXlBBeVuROdkyF4IXkor57a3-UPYhxnRhRN9X97IiVglZcsuPMf4PwHYZzDK_yDYbok6X9g10-zhNM1rvc4fTLh_McUuQy2ph7k2-ugheYa3AaQ96jw5ib4MeE_56K7Tns5BF_zui0dX3ewQQPs3sGhoiP9t9F9vXtmy-n74uzj-9Wpydnha44nwoKQradxtowVoMgNW0YA5BMQAfINWOESsEAddPQRsoWjG40Viw92UDN-SJ7uvPdDD6qfa2iooLImlNWkUSsdkTnYa02wY4QLpUHq64ufOgVhMnqAVULZYc17zpj2pKJtgaUgiMxKRWAhiav1_u_ze2IKW83BRgOTA8jzv5Qvb9QvKEVYzIZPN8bBJ_qFSc12qhxGMChn1PeZXpn2QjJEvpsh_aQUrPO-OSot7g64VVDKtakti-y5T-otDocrfYOjU33B4IXB4LETKmVPcwxqtXnT__Bfjhkyx2rg48xoLmpCiVqO8jXzVHbQVb7QU6yJ7creiO6nlz-F9-t8kQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1499149572</pqid></control><display><type>article</type><title>VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>Public Library of Science (PLoS)</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Jia, Peilin ; Zhao, Zhongming</creator><contributor>Radivojac, Predrag</contributor><creatorcontrib>Jia, Peilin ; Zhao, Zhongming ; Radivojac, Predrag</creatorcontrib><description>A major challenge in interpreting the large volume of mutation data identified by next-generation sequencing (NGS) is to distinguish driver mutations from neutral passenger mutations to facilitate the identification of targetable genes and new drugs. Current approaches are primarily based on mutation frequencies of single-genes, which lack the power to detect infrequently mutated driver genes and ignore functional interconnection and regulation among cancer genes. We propose a novel mutation network method, VarWalker, to prioritize driver genes in large scale cancer mutation data. VarWalker fits generalized additive models for each sample based on sample-specific mutation profiles and builds on the joint frequency of both mutation genes and their close interactors. These interactors are selected and optimized using the Random Walk with Restart algorithm in a protein-protein interaction network. We applied the method in &gt;300 tumor genomes in two large-scale NGS benchmark datasets: 183 lung adenocarcinoma samples and 121 melanoma samples. In each cancer, we derived a consensus mutation subnetwork containing significantly enriched consensus cancer genes and cancer-related functional pathways. These cancer-specific mutation networks were then validated using independent datasets for each cancer. Importantly, VarWalker prioritizes well-known, infrequently mutated genes, which are shown to interact with highly recurrently mutated genes yet have been ignored by conventional single-gene-based approaches. Utilizing VarWalker, we demonstrated that network-assisted approaches can be effectively adapted to facilitate the detection of cancer driver genes in NGS data.</description><identifier>ISSN: 1553-7358</identifier><identifier>ISSN: 1553-734X</identifier><identifier>EISSN: 1553-7358</identifier><identifier>DOI: 10.1371/journal.pcbi.1003460</identifier><identifier>PMID: 24516372</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Adenocarcinoma - genetics ; Adenocarcinoma of Lung ; Algorithms ; Biology ; Cancer ; Complications and side effects ; Computational Biology ; Computer Science ; Consensus Sequence ; Databases, Genetic - statistics &amp; numerical data ; Development and progression ; DNA Mutational Analysis - statistics &amp; numerical data ; DNA, Neoplasm - genetics ; Gene Frequency ; Gene mutations ; Gene Regulatory Networks ; Genes ; Genetic aspects ; Genomes ; High-Throughput Nucleotide Sequencing - statistics &amp; numerical data ; Humans ; Lung Neoplasms - genetics ; Medicine ; Melanoma ; Melanoma - genetics ; Models, Genetic ; Molecular Sequence Annotation - statistics &amp; numerical data ; Mutation ; Neoplasms - genetics ; Oncogenes ; Ontology ; Proteins ; Risk factors ; Tumorigenesis</subject><ispartof>PLoS computational biology, 2014-02, Vol.10 (2), p.e1003460-e1003460</ispartof><rights>COPYRIGHT 2014 Public Library of Science</rights><rights>2014 Jia, Zhao 2014 Jia, Zhao</rights><rights>2014 Jia, Zhao. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited: Jia P, Zhao Z (2014) VarWalker: Personalized Mutation Network Analysis of Putative Cancer Genes from Next-Generation Sequencing Data. PLoS Comput Biol 10(2): e1003460. doi:10.1371/journal.pcbi.1003460</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c633t-1a57bdce8f228a5081922aa725adae3c2201752aec991977bafc9ce62155fa833</citedby><cites>FETCH-LOGICAL-c633t-1a57bdce8f228a5081922aa725adae3c2201752aec991977bafc9ce62155fa833</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3916227/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3916227/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,860,881,2095,2914,23846,27903,27904,53769,53771,79346,79347</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/24516372$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Radivojac, Predrag</contributor><creatorcontrib>Jia, Peilin</creatorcontrib><creatorcontrib>Zhao, Zhongming</creatorcontrib><title>VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data</title><title>PLoS computational biology</title><addtitle>PLoS Comput Biol</addtitle><description>A major challenge in interpreting the large volume of mutation data identified by next-generation sequencing (NGS) is to distinguish driver mutations from neutral passenger mutations to facilitate the identification of targetable genes and new drugs. Current approaches are primarily based on mutation frequencies of single-genes, which lack the power to detect infrequently mutated driver genes and ignore functional interconnection and regulation among cancer genes. We propose a novel mutation network method, VarWalker, to prioritize driver genes in large scale cancer mutation data. VarWalker fits generalized additive models for each sample based on sample-specific mutation profiles and builds on the joint frequency of both mutation genes and their close interactors. These interactors are selected and optimized using the Random Walk with Restart algorithm in a protein-protein interaction network. We applied the method in &gt;300 tumor genomes in two large-scale NGS benchmark datasets: 183 lung adenocarcinoma samples and 121 melanoma samples. In each cancer, we derived a consensus mutation subnetwork containing significantly enriched consensus cancer genes and cancer-related functional pathways. These cancer-specific mutation networks were then validated using independent datasets for each cancer. Importantly, VarWalker prioritizes well-known, infrequently mutated genes, which are shown to interact with highly recurrently mutated genes yet have been ignored by conventional single-gene-based approaches. Utilizing VarWalker, we demonstrated that network-assisted approaches can be effectively adapted to facilitate the detection of cancer driver genes in NGS data.</description><subject>Adenocarcinoma - genetics</subject><subject>Adenocarcinoma of Lung</subject><subject>Algorithms</subject><subject>Biology</subject><subject>Cancer</subject><subject>Complications and side effects</subject><subject>Computational Biology</subject><subject>Computer Science</subject><subject>Consensus Sequence</subject><subject>Databases, Genetic - statistics &amp; numerical data</subject><subject>Development and progression</subject><subject>DNA Mutational Analysis - statistics &amp; numerical data</subject><subject>DNA, Neoplasm - genetics</subject><subject>Gene Frequency</subject><subject>Gene mutations</subject><subject>Gene Regulatory Networks</subject><subject>Genes</subject><subject>Genetic aspects</subject><subject>Genomes</subject><subject>High-Throughput Nucleotide Sequencing - statistics &amp; numerical data</subject><subject>Humans</subject><subject>Lung Neoplasms - genetics</subject><subject>Medicine</subject><subject>Melanoma</subject><subject>Melanoma - genetics</subject><subject>Models, Genetic</subject><subject>Molecular Sequence Annotation - statistics &amp; numerical data</subject><subject>Mutation</subject><subject>Neoplasms - genetics</subject><subject>Oncogenes</subject><subject>Ontology</subject><subject>Proteins</subject><subject>Risk factors</subject><subject>Tumorigenesis</subject><issn>1553-7358</issn><issn>1553-734X</issn><issn>1553-7358</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>DOA</sourceid><recordid>eNqVkk1v1DAQhiMEomXhHyCIxAUOWfwRxwkHpKriY6UKJD6P1sQZB28Te7GTQvn1eLvbqitxQT7E9jzvm_HMZNljSpaUS_py7efgYFhudGuXlBBeVuROdkyF4IXkor57a3-UPYhxnRhRN9X97IiVglZcsuPMf4PwHYZzDK_yDYbok6X9g10-zhNM1rvc4fTLh_McUuQy2ph7k2-ugheYa3AaQ96jw5ib4MeE_56K7Tns5BF_zui0dX3ewQQPs3sGhoiP9t9F9vXtmy-n74uzj-9Wpydnha44nwoKQradxtowVoMgNW0YA5BMQAfINWOESsEAddPQRsoWjG40Viw92UDN-SJ7uvPdDD6qfa2iooLImlNWkUSsdkTnYa02wY4QLpUHq64ufOgVhMnqAVULZYc17zpj2pKJtgaUgiMxKRWAhiav1_u_ze2IKW83BRgOTA8jzv5Qvb9QvKEVYzIZPN8bBJ_qFSc12qhxGMChn1PeZXpn2QjJEvpsh_aQUrPO-OSot7g64VVDKtakti-y5T-otDocrfYOjU33B4IXB4LETKmVPcwxqtXnT__Bfjhkyx2rg48xoLmpCiVqO8jXzVHbQVb7QU6yJ7creiO6nlz-F9-t8kQ</recordid><startdate>20140201</startdate><enddate>20140201</enddate><creator>Jia, Peilin</creator><creator>Zhao, Zhongming</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISN</scope><scope>ISR</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20140201</creationdate><title>VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data</title><author>Jia, Peilin ; Zhao, Zhongming</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c633t-1a57bdce8f228a5081922aa725adae3c2201752aec991977bafc9ce62155fa833</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Adenocarcinoma - genetics</topic><topic>Adenocarcinoma of Lung</topic><topic>Algorithms</topic><topic>Biology</topic><topic>Cancer</topic><topic>Complications and side effects</topic><topic>Computational Biology</topic><topic>Computer Science</topic><topic>Consensus Sequence</topic><topic>Databases, Genetic - statistics &amp; numerical data</topic><topic>Development and progression</topic><topic>DNA Mutational Analysis - statistics &amp; numerical data</topic><topic>DNA, Neoplasm - genetics</topic><topic>Gene Frequency</topic><topic>Gene mutations</topic><topic>Gene Regulatory Networks</topic><topic>Genes</topic><topic>Genetic aspects</topic><topic>Genomes</topic><topic>High-Throughput Nucleotide Sequencing - statistics &amp; numerical data</topic><topic>Humans</topic><topic>Lung Neoplasms - genetics</topic><topic>Medicine</topic><topic>Melanoma</topic><topic>Melanoma - genetics</topic><topic>Models, Genetic</topic><topic>Molecular Sequence Annotation - statistics &amp; numerical data</topic><topic>Mutation</topic><topic>Neoplasms - genetics</topic><topic>Oncogenes</topic><topic>Ontology</topic><topic>Proteins</topic><topic>Risk factors</topic><topic>Tumorigenesis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jia, Peilin</creatorcontrib><creatorcontrib>Zhao, Zhongming</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Canada</collection><collection>Gale In Context: Science</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PLoS computational biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jia, Peilin</au><au>Zhao, Zhongming</au><au>Radivojac, Predrag</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data</atitle><jtitle>PLoS computational biology</jtitle><addtitle>PLoS Comput Biol</addtitle><date>2014-02-01</date><risdate>2014</risdate><volume>10</volume><issue>2</issue><spage>e1003460</spage><epage>e1003460</epage><pages>e1003460-e1003460</pages><issn>1553-7358</issn><issn>1553-734X</issn><eissn>1553-7358</eissn><abstract>A major challenge in interpreting the large volume of mutation data identified by next-generation sequencing (NGS) is to distinguish driver mutations from neutral passenger mutations to facilitate the identification of targetable genes and new drugs. Current approaches are primarily based on mutation frequencies of single-genes, which lack the power to detect infrequently mutated driver genes and ignore functional interconnection and regulation among cancer genes. We propose a novel mutation network method, VarWalker, to prioritize driver genes in large scale cancer mutation data. VarWalker fits generalized additive models for each sample based on sample-specific mutation profiles and builds on the joint frequency of both mutation genes and their close interactors. These interactors are selected and optimized using the Random Walk with Restart algorithm in a protein-protein interaction network. We applied the method in &gt;300 tumor genomes in two large-scale NGS benchmark datasets: 183 lung adenocarcinoma samples and 121 melanoma samples. In each cancer, we derived a consensus mutation subnetwork containing significantly enriched consensus cancer genes and cancer-related functional pathways. These cancer-specific mutation networks were then validated using independent datasets for each cancer. Importantly, VarWalker prioritizes well-known, infrequently mutated genes, which are shown to interact with highly recurrently mutated genes yet have been ignored by conventional single-gene-based approaches. Utilizing VarWalker, we demonstrated that network-assisted approaches can be effectively adapted to facilitate the detection of cancer driver genes in NGS data.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>24516372</pmid><doi>10.1371/journal.pcbi.1003460</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1553-7358
ispartof PLoS computational biology, 2014-02, Vol.10 (2), p.e1003460-e1003460
issn 1553-7358
1553-734X
1553-7358
language eng
recordid cdi_plos_journals_1507831260
source MEDLINE; DOAJ Directory of Open Access Journals; Public Library of Science (PLoS); EZB-FREE-00999 freely available EZB journals; PubMed Central
subjects Adenocarcinoma - genetics
Adenocarcinoma of Lung
Algorithms
Biology
Cancer
Complications and side effects
Computational Biology
Computer Science
Consensus Sequence
Databases, Genetic - statistics & numerical data
Development and progression
DNA Mutational Analysis - statistics & numerical data
DNA, Neoplasm - genetics
Gene Frequency
Gene mutations
Gene Regulatory Networks
Genes
Genetic aspects
Genomes
High-Throughput Nucleotide Sequencing - statistics & numerical data
Humans
Lung Neoplasms - genetics
Medicine
Melanoma
Melanoma - genetics
Models, Genetic
Molecular Sequence Annotation - statistics & numerical data
Mutation
Neoplasms - genetics
Oncogenes
Ontology
Proteins
Risk factors
Tumorigenesis
title VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T06%3A45%3A32IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=VarWalker:%20personalized%20mutation%20network%20analysis%20of%20putative%20cancer%20genes%20from%20next-generation%20sequencing%20data&rft.jtitle=PLoS%20computational%20biology&rft.au=Jia,%20Peilin&rft.date=2014-02-01&rft.volume=10&rft.issue=2&rft.spage=e1003460&rft.epage=e1003460&rft.pages=e1003460-e1003460&rft.issn=1553-7358&rft.eissn=1553-7358&rft_id=info:doi/10.1371/journal.pcbi.1003460&rft_dat=%3Cgale_plos_%3EA369062955%3C/gale_plos_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1499149572&rft_id=info:pmid/24516372&rft_galeid=A369062955&rft_doaj_id=oai_doaj_org_article_ba4de83ddffb425b8ae753e0fae3aa91&rfr_iscdi=true