VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data
A major challenge in interpreting the large volume of mutation data identified by next-generation sequencing (NGS) is to distinguish driver mutations from neutral passenger mutations to facilitate the identification of targetable genes and new drugs. Current approaches are primarily based on mutatio...
Gespeichert in:
Veröffentlicht in: | PLoS computational biology 2014-02, Vol.10 (2), p.e1003460-e1003460 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | e1003460 |
---|---|
container_issue | 2 |
container_start_page | e1003460 |
container_title | PLoS computational biology |
container_volume | 10 |
creator | Jia, Peilin Zhao, Zhongming |
description | A major challenge in interpreting the large volume of mutation data identified by next-generation sequencing (NGS) is to distinguish driver mutations from neutral passenger mutations to facilitate the identification of targetable genes and new drugs. Current approaches are primarily based on mutation frequencies of single-genes, which lack the power to detect infrequently mutated driver genes and ignore functional interconnection and regulation among cancer genes. We propose a novel mutation network method, VarWalker, to prioritize driver genes in large scale cancer mutation data. VarWalker fits generalized additive models for each sample based on sample-specific mutation profiles and builds on the joint frequency of both mutation genes and their close interactors. These interactors are selected and optimized using the Random Walk with Restart algorithm in a protein-protein interaction network. We applied the method in >300 tumor genomes in two large-scale NGS benchmark datasets: 183 lung adenocarcinoma samples and 121 melanoma samples. In each cancer, we derived a consensus mutation subnetwork containing significantly enriched consensus cancer genes and cancer-related functional pathways. These cancer-specific mutation networks were then validated using independent datasets for each cancer. Importantly, VarWalker prioritizes well-known, infrequently mutated genes, which are shown to interact with highly recurrently mutated genes yet have been ignored by conventional single-gene-based approaches. Utilizing VarWalker, we demonstrated that network-assisted approaches can be effectively adapted to facilitate the detection of cancer driver genes in NGS data. |
doi_str_mv | 10.1371/journal.pcbi.1003460 |
format | Article |
fullrecord | <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_1507831260</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A369062955</galeid><doaj_id>oai_doaj_org_article_ba4de83ddffb425b8ae753e0fae3aa91</doaj_id><sourcerecordid>A369062955</sourcerecordid><originalsourceid>FETCH-LOGICAL-c633t-1a57bdce8f228a5081922aa725adae3c2201752aec991977bafc9ce62155fa833</originalsourceid><addsrcrecordid>eNqVkk1v1DAQhiMEomXhHyCIxAUOWfwRxwkHpKriY6UKJD6P1sQZB28Te7GTQvn1eLvbqitxQT7E9jzvm_HMZNljSpaUS_py7efgYFhudGuXlBBeVuROdkyF4IXkor57a3-UPYhxnRhRN9X97IiVglZcsuPMf4PwHYZzDK_yDYbok6X9g10-zhNM1rvc4fTLh_McUuQy2ph7k2-ugheYa3AaQ96jw5ib4MeE_56K7Tns5BF_zui0dX3ewQQPs3sGhoiP9t9F9vXtmy-n74uzj-9Wpydnha44nwoKQradxtowVoMgNW0YA5BMQAfINWOESsEAddPQRsoWjG40Viw92UDN-SJ7uvPdDD6qfa2iooLImlNWkUSsdkTnYa02wY4QLpUHq64ufOgVhMnqAVULZYc17zpj2pKJtgaUgiMxKRWAhiav1_u_ze2IKW83BRgOTA8jzv5Qvb9QvKEVYzIZPN8bBJ_qFSc12qhxGMChn1PeZXpn2QjJEvpsh_aQUrPO-OSot7g64VVDKtakti-y5T-otDocrfYOjU33B4IXB4LETKmVPcwxqtXnT__Bfjhkyx2rg48xoLmpCiVqO8jXzVHbQVb7QU6yJ7creiO6nlz-F9-t8kQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1499149572</pqid></control><display><type>article</type><title>VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>Public Library of Science (PLoS)</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Jia, Peilin ; Zhao, Zhongming</creator><contributor>Radivojac, Predrag</contributor><creatorcontrib>Jia, Peilin ; Zhao, Zhongming ; Radivojac, Predrag</creatorcontrib><description>A major challenge in interpreting the large volume of mutation data identified by next-generation sequencing (NGS) is to distinguish driver mutations from neutral passenger mutations to facilitate the identification of targetable genes and new drugs. Current approaches are primarily based on mutation frequencies of single-genes, which lack the power to detect infrequently mutated driver genes and ignore functional interconnection and regulation among cancer genes. We propose a novel mutation network method, VarWalker, to prioritize driver genes in large scale cancer mutation data. VarWalker fits generalized additive models for each sample based on sample-specific mutation profiles and builds on the joint frequency of both mutation genes and their close interactors. These interactors are selected and optimized using the Random Walk with Restart algorithm in a protein-protein interaction network. We applied the method in >300 tumor genomes in two large-scale NGS benchmark datasets: 183 lung adenocarcinoma samples and 121 melanoma samples. In each cancer, we derived a consensus mutation subnetwork containing significantly enriched consensus cancer genes and cancer-related functional pathways. These cancer-specific mutation networks were then validated using independent datasets for each cancer. Importantly, VarWalker prioritizes well-known, infrequently mutated genes, which are shown to interact with highly recurrently mutated genes yet have been ignored by conventional single-gene-based approaches. Utilizing VarWalker, we demonstrated that network-assisted approaches can be effectively adapted to facilitate the detection of cancer driver genes in NGS data.</description><identifier>ISSN: 1553-7358</identifier><identifier>ISSN: 1553-734X</identifier><identifier>EISSN: 1553-7358</identifier><identifier>DOI: 10.1371/journal.pcbi.1003460</identifier><identifier>PMID: 24516372</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Adenocarcinoma - genetics ; Adenocarcinoma of Lung ; Algorithms ; Biology ; Cancer ; Complications and side effects ; Computational Biology ; Computer Science ; Consensus Sequence ; Databases, Genetic - statistics & numerical data ; Development and progression ; DNA Mutational Analysis - statistics & numerical data ; DNA, Neoplasm - genetics ; Gene Frequency ; Gene mutations ; Gene Regulatory Networks ; Genes ; Genetic aspects ; Genomes ; High-Throughput Nucleotide Sequencing - statistics & numerical data ; Humans ; Lung Neoplasms - genetics ; Medicine ; Melanoma ; Melanoma - genetics ; Models, Genetic ; Molecular Sequence Annotation - statistics & numerical data ; Mutation ; Neoplasms - genetics ; Oncogenes ; Ontology ; Proteins ; Risk factors ; Tumorigenesis</subject><ispartof>PLoS computational biology, 2014-02, Vol.10 (2), p.e1003460-e1003460</ispartof><rights>COPYRIGHT 2014 Public Library of Science</rights><rights>2014 Jia, Zhao 2014 Jia, Zhao</rights><rights>2014 Jia, Zhao. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited: Jia P, Zhao Z (2014) VarWalker: Personalized Mutation Network Analysis of Putative Cancer Genes from Next-Generation Sequencing Data. PLoS Comput Biol 10(2): e1003460. doi:10.1371/journal.pcbi.1003460</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c633t-1a57bdce8f228a5081922aa725adae3c2201752aec991977bafc9ce62155fa833</citedby><cites>FETCH-LOGICAL-c633t-1a57bdce8f228a5081922aa725adae3c2201752aec991977bafc9ce62155fa833</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3916227/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3916227/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,860,881,2095,2914,23846,27903,27904,53769,53771,79346,79347</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/24516372$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Radivojac, Predrag</contributor><creatorcontrib>Jia, Peilin</creatorcontrib><creatorcontrib>Zhao, Zhongming</creatorcontrib><title>VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data</title><title>PLoS computational biology</title><addtitle>PLoS Comput Biol</addtitle><description>A major challenge in interpreting the large volume of mutation data identified by next-generation sequencing (NGS) is to distinguish driver mutations from neutral passenger mutations to facilitate the identification of targetable genes and new drugs. Current approaches are primarily based on mutation frequencies of single-genes, which lack the power to detect infrequently mutated driver genes and ignore functional interconnection and regulation among cancer genes. We propose a novel mutation network method, VarWalker, to prioritize driver genes in large scale cancer mutation data. VarWalker fits generalized additive models for each sample based on sample-specific mutation profiles and builds on the joint frequency of both mutation genes and their close interactors. These interactors are selected and optimized using the Random Walk with Restart algorithm in a protein-protein interaction network. We applied the method in >300 tumor genomes in two large-scale NGS benchmark datasets: 183 lung adenocarcinoma samples and 121 melanoma samples. In each cancer, we derived a consensus mutation subnetwork containing significantly enriched consensus cancer genes and cancer-related functional pathways. These cancer-specific mutation networks were then validated using independent datasets for each cancer. Importantly, VarWalker prioritizes well-known, infrequently mutated genes, which are shown to interact with highly recurrently mutated genes yet have been ignored by conventional single-gene-based approaches. Utilizing VarWalker, we demonstrated that network-assisted approaches can be effectively adapted to facilitate the detection of cancer driver genes in NGS data.</description><subject>Adenocarcinoma - genetics</subject><subject>Adenocarcinoma of Lung</subject><subject>Algorithms</subject><subject>Biology</subject><subject>Cancer</subject><subject>Complications and side effects</subject><subject>Computational Biology</subject><subject>Computer Science</subject><subject>Consensus Sequence</subject><subject>Databases, Genetic - statistics & numerical data</subject><subject>Development and progression</subject><subject>DNA Mutational Analysis - statistics & numerical data</subject><subject>DNA, Neoplasm - genetics</subject><subject>Gene Frequency</subject><subject>Gene mutations</subject><subject>Gene Regulatory Networks</subject><subject>Genes</subject><subject>Genetic aspects</subject><subject>Genomes</subject><subject>High-Throughput Nucleotide Sequencing - statistics & numerical data</subject><subject>Humans</subject><subject>Lung Neoplasms - genetics</subject><subject>Medicine</subject><subject>Melanoma</subject><subject>Melanoma - genetics</subject><subject>Models, Genetic</subject><subject>Molecular Sequence Annotation - statistics & numerical data</subject><subject>Mutation</subject><subject>Neoplasms - genetics</subject><subject>Oncogenes</subject><subject>Ontology</subject><subject>Proteins</subject><subject>Risk factors</subject><subject>Tumorigenesis</subject><issn>1553-7358</issn><issn>1553-734X</issn><issn>1553-7358</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>DOA</sourceid><recordid>eNqVkk1v1DAQhiMEomXhHyCIxAUOWfwRxwkHpKriY6UKJD6P1sQZB28Te7GTQvn1eLvbqitxQT7E9jzvm_HMZNljSpaUS_py7efgYFhudGuXlBBeVuROdkyF4IXkor57a3-UPYhxnRhRN9X97IiVglZcsuPMf4PwHYZzDK_yDYbok6X9g10-zhNM1rvc4fTLh_McUuQy2ph7k2-ugheYa3AaQ96jw5ib4MeE_56K7Tns5BF_zui0dX3ewQQPs3sGhoiP9t9F9vXtmy-n74uzj-9Wpydnha44nwoKQradxtowVoMgNW0YA5BMQAfINWOESsEAddPQRsoWjG40Viw92UDN-SJ7uvPdDD6qfa2iooLImlNWkUSsdkTnYa02wY4QLpUHq64ufOgVhMnqAVULZYc17zpj2pKJtgaUgiMxKRWAhiav1_u_ze2IKW83BRgOTA8jzv5Qvb9QvKEVYzIZPN8bBJ_qFSc12qhxGMChn1PeZXpn2QjJEvpsh_aQUrPO-OSot7g64VVDKtakti-y5T-otDocrfYOjU33B4IXB4LETKmVPcwxqtXnT__Bfjhkyx2rg48xoLmpCiVqO8jXzVHbQVb7QU6yJ7creiO6nlz-F9-t8kQ</recordid><startdate>20140201</startdate><enddate>20140201</enddate><creator>Jia, Peilin</creator><creator>Zhao, Zhongming</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISN</scope><scope>ISR</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20140201</creationdate><title>VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data</title><author>Jia, Peilin ; Zhao, Zhongming</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c633t-1a57bdce8f228a5081922aa725adae3c2201752aec991977bafc9ce62155fa833</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Adenocarcinoma - genetics</topic><topic>Adenocarcinoma of Lung</topic><topic>Algorithms</topic><topic>Biology</topic><topic>Cancer</topic><topic>Complications and side effects</topic><topic>Computational Biology</topic><topic>Computer Science</topic><topic>Consensus Sequence</topic><topic>Databases, Genetic - statistics & numerical data</topic><topic>Development and progression</topic><topic>DNA Mutational Analysis - statistics & numerical data</topic><topic>DNA, Neoplasm - genetics</topic><topic>Gene Frequency</topic><topic>Gene mutations</topic><topic>Gene Regulatory Networks</topic><topic>Genes</topic><topic>Genetic aspects</topic><topic>Genomes</topic><topic>High-Throughput Nucleotide Sequencing - statistics & numerical data</topic><topic>Humans</topic><topic>Lung Neoplasms - genetics</topic><topic>Medicine</topic><topic>Melanoma</topic><topic>Melanoma - genetics</topic><topic>Models, Genetic</topic><topic>Molecular Sequence Annotation - statistics & numerical data</topic><topic>Mutation</topic><topic>Neoplasms - genetics</topic><topic>Oncogenes</topic><topic>Ontology</topic><topic>Proteins</topic><topic>Risk factors</topic><topic>Tumorigenesis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jia, Peilin</creatorcontrib><creatorcontrib>Zhao, Zhongming</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Canada</collection><collection>Gale In Context: Science</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PLoS computational biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jia, Peilin</au><au>Zhao, Zhongming</au><au>Radivojac, Predrag</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data</atitle><jtitle>PLoS computational biology</jtitle><addtitle>PLoS Comput Biol</addtitle><date>2014-02-01</date><risdate>2014</risdate><volume>10</volume><issue>2</issue><spage>e1003460</spage><epage>e1003460</epage><pages>e1003460-e1003460</pages><issn>1553-7358</issn><issn>1553-734X</issn><eissn>1553-7358</eissn><abstract>A major challenge in interpreting the large volume of mutation data identified by next-generation sequencing (NGS) is to distinguish driver mutations from neutral passenger mutations to facilitate the identification of targetable genes and new drugs. Current approaches are primarily based on mutation frequencies of single-genes, which lack the power to detect infrequently mutated driver genes and ignore functional interconnection and regulation among cancer genes. We propose a novel mutation network method, VarWalker, to prioritize driver genes in large scale cancer mutation data. VarWalker fits generalized additive models for each sample based on sample-specific mutation profiles and builds on the joint frequency of both mutation genes and their close interactors. These interactors are selected and optimized using the Random Walk with Restart algorithm in a protein-protein interaction network. We applied the method in >300 tumor genomes in two large-scale NGS benchmark datasets: 183 lung adenocarcinoma samples and 121 melanoma samples. In each cancer, we derived a consensus mutation subnetwork containing significantly enriched consensus cancer genes and cancer-related functional pathways. These cancer-specific mutation networks were then validated using independent datasets for each cancer. Importantly, VarWalker prioritizes well-known, infrequently mutated genes, which are shown to interact with highly recurrently mutated genes yet have been ignored by conventional single-gene-based approaches. Utilizing VarWalker, we demonstrated that network-assisted approaches can be effectively adapted to facilitate the detection of cancer driver genes in NGS data.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>24516372</pmid><doi>10.1371/journal.pcbi.1003460</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1553-7358 |
ispartof | PLoS computational biology, 2014-02, Vol.10 (2), p.e1003460-e1003460 |
issn | 1553-7358 1553-734X 1553-7358 |
language | eng |
recordid | cdi_plos_journals_1507831260 |
source | MEDLINE; DOAJ Directory of Open Access Journals; Public Library of Science (PLoS); EZB-FREE-00999 freely available EZB journals; PubMed Central |
subjects | Adenocarcinoma - genetics Adenocarcinoma of Lung Algorithms Biology Cancer Complications and side effects Computational Biology Computer Science Consensus Sequence Databases, Genetic - statistics & numerical data Development and progression DNA Mutational Analysis - statistics & numerical data DNA, Neoplasm - genetics Gene Frequency Gene mutations Gene Regulatory Networks Genes Genetic aspects Genomes High-Throughput Nucleotide Sequencing - statistics & numerical data Humans Lung Neoplasms - genetics Medicine Melanoma Melanoma - genetics Models, Genetic Molecular Sequence Annotation - statistics & numerical data Mutation Neoplasms - genetics Oncogenes Ontology Proteins Risk factors Tumorigenesis |
title | VarWalker: personalized mutation network analysis of putative cancer genes from next-generation sequencing data |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T06%3A45%3A32IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=VarWalker:%20personalized%20mutation%20network%20analysis%20of%20putative%20cancer%20genes%20from%20next-generation%20sequencing%20data&rft.jtitle=PLoS%20computational%20biology&rft.au=Jia,%20Peilin&rft.date=2014-02-01&rft.volume=10&rft.issue=2&rft.spage=e1003460&rft.epage=e1003460&rft.pages=e1003460-e1003460&rft.issn=1553-7358&rft.eissn=1553-7358&rft_id=info:doi/10.1371/journal.pcbi.1003460&rft_dat=%3Cgale_plos_%3EA369062955%3C/gale_plos_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1499149572&rft_id=info:pmid/24516372&rft_galeid=A369062955&rft_doaj_id=oai_doaj_org_article_ba4de83ddffb425b8ae753e0fae3aa91&rfr_iscdi=true |