Spatial distribution of disease-associated variants in three-dimensional structures of protein complexes

Next-generation sequencing enables simultaneous analysis of hundreds of human genomes associated with a particular phenotype, for example, a disease. These genomes naturally contain a lot of sequence variation that ranges from single-nucleotide variants (SNVs) to large-scale structural rearrangement...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Oncogenesis (New York, NY) NY), 2017-09, Vol.6 (9), p.e380-e380
Hauptverfasser:	Gress, A, Ramensky, V, Kalinina, O V
Format:	Artikel
Sprache:	eng
Schlagworte:	631/67/395 631/67/69 Apoptosis Cancer Cell Biology Computer applications Deoxyribonucleic acid Disease DNA DNA-binding protein Genomes Genotype & phenotype Homology Human Genetics Interfaces Internal Medicine Ligands Mathematical models Medicine Medicine & Public Health Mutation Nucleic acids Oncology Original original-article Protein interaction Proteins Spatial distribution
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	e380
container_issue	9
container_start_page	e380
container_title	Oncogenesis (New York, NY)
container_volume	6
creator	Gress, A Ramensky, V Kalinina, O V
description	Next-generation sequencing enables simultaneous analysis of hundreds of human genomes associated with a particular phenotype, for example, a disease. These genomes naturally contain a lot of sequence variation that ranges from single-nucleotide variants (SNVs) to large-scale structural rearrangements. In order to establish a functional connection between genotype and disease-associated phenotypes, one needs to distinguish disease drivers from neutral passenger variants. Functional annotation based on experimental assays is feasible only for a limited number of candidate mutations. Thus alternative computational tools are needed. A possible approach to annotating mutations functionally is to consider their spatial location relative to functionally relevant sites in three-dimensional (3D) structures of the harboring proteins. This is impeded by the lack of available protein 3D structures. Complementing experimentally resolved structures with reliable computational models is an attractive alternative. We developed a structure-based approach to characterizing comprehensive sets of non-synonymous single-nucleotide variants (nsSNVs): associated with cancer, non-cancer diseases and putatively functionally neutral. We searched experimentally resolved protein 3D structures for potential homology-modeling templates for proteins harboring corresponding mutations. We found such templates for all proteins with disease-associated nsSNVs, and 51 and 66% of proteins carrying common polymorphisms and annotated benign variants. Many mutations caused by nsSNVs can be found in protein–protein, protein–nucleic acid or protein–ligand complexes. Correction for the number of available templates per protein reveals that protein–protein interaction interfaces are not enriched in either cancer nsSNVs, or nsSNVs associated with non-cancer diseases. Whereas cancer-associated mutations are enriched in DNA-binding proteins, they are rarely located directly in DNA-interacting interfaces. In contrast, mutations associated with non-cancer diseases are in general rare in DNA-binding proteins, but enriched in DNA-interacting interfaces in these proteins. All disease-associated nsSNVs are overrepresented in ligand-binding pockets, and nsSNVs associated with non-cancer diseases are additionally enriched in protein core, where they probably affect overall protein stability.
doi_str_mv	10.1038/oncsis.2017.79
format	Article
fullrecord	<record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5623905</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1942188851</sourcerecordid><originalsourceid>FETCH-LOGICAL-c458t-4036b4acd41d8503f8786dc819e33ad6b6d83467eefccf2364dca29a3c94a553</originalsourceid><addsrcrecordid>eNptkUtr3TAQhU1pSUKabZbF0E03vtHb8qZQQl8QyCLZC11pnKtgW65GDu2_r4zTcFuqjSTmO0czOlV1ScmOEq6v4uQw4I4R2u7a7lV1xqhsm44w8frofFpdID6SsqSiSsqT6pTpTkhG1Vl1uJttDnaofcCcwn7JIU517Nc7WITGIkYXbAZfP9kU7JSxDlOdDwmg8WGECYuiGBT54vKSAFf5nGKGwrk4zgP8BHxbventgHDxvJ9X918-319_a25uv36__nTTOCF1bgThai-s84J6LQnvdauVd5p2wLn1aq-85kK1AL1zPeNKeGdZZ7nrhJWSn1cfN9t52Y_gHUw52cHMKYw2_TLRBvN3ZQoH8xCfjFSMd2Q1-PBskOKPBTCbMaCDYbATxAUN7QRrCdGMFfT9P-hjXFL5i42iWmtJC7XbKJciYoL-pRlKzBqj2WI0a4ym7Yrg3fEIL_if0ApwtQFYStMDpKN3_2_5G_zJrT4</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1942188851</pqid></control><display><type>article</type><title>Spatial distribution of disease-associated variants in three-dimensional structures of protein complexes</title><source>DOAJ Directory of Open Access Journals</source><source>PubMed Central Open Access</source><source>Springer Nature OA Free Journals</source><source>Nature Free</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Gress, A ; Ramensky, V ; Kalinina, O V</creator><creatorcontrib>Gress, A ; Ramensky, V ; Kalinina, O V</creatorcontrib><description>Next-generation sequencing enables simultaneous analysis of hundreds of human genomes associated with a particular phenotype, for example, a disease. These genomes naturally contain a lot of sequence variation that ranges from single-nucleotide variants (SNVs) to large-scale structural rearrangements. In order to establish a functional connection between genotype and disease-associated phenotypes, one needs to distinguish disease drivers from neutral passenger variants. Functional annotation based on experimental assays is feasible only for a limited number of candidate mutations. Thus alternative computational tools are needed. A possible approach to annotating mutations functionally is to consider their spatial location relative to functionally relevant sites in three-dimensional (3D) structures of the harboring proteins. This is impeded by the lack of available protein 3D structures. Complementing experimentally resolved structures with reliable computational models is an attractive alternative. We developed a structure-based approach to characterizing comprehensive sets of non-synonymous single-nucleotide variants (nsSNVs): associated with cancer, non-cancer diseases and putatively functionally neutral. We searched experimentally resolved protein 3D structures for potential homology-modeling templates for proteins harboring corresponding mutations. We found such templates for all proteins with disease-associated nsSNVs, and 51 and 66% of proteins carrying common polymorphisms and annotated benign variants. Many mutations caused by nsSNVs can be found in protein–protein, protein–nucleic acid or protein–ligand complexes. Correction for the number of available templates per protein reveals that protein–protein interaction interfaces are not enriched in either cancer nsSNVs, or nsSNVs associated with non-cancer diseases. Whereas cancer-associated mutations are enriched in DNA-binding proteins, they are rarely located directly in DNA-interacting interfaces. In contrast, mutations associated with non-cancer diseases are in general rare in DNA-binding proteins, but enriched in DNA-interacting interfaces in these proteins. All disease-associated nsSNVs are overrepresented in ligand-binding pockets, and nsSNVs associated with non-cancer diseases are additionally enriched in protein core, where they probably affect overall protein stability.</description><identifier>ISSN: 2157-9024</identifier><identifier>EISSN: 2157-9024</identifier><identifier>DOI: 10.1038/oncsis.2017.79</identifier><identifier>PMID: 28945216</identifier><language>eng</language><publisher>London: Nature Publishing Group UK</publisher><subject>631/67/395 ; 631/67/69 ; Apoptosis ; Cancer ; Cell Biology ; Computer applications ; Deoxyribonucleic acid ; Disease ; DNA ; DNA-binding protein ; Genomes ; Genotype & phenotype ; Homology ; Human Genetics ; Interfaces ; Internal Medicine ; Ligands ; Mathematical models ; Medicine ; Medicine & Public Health ; Mutation ; Nucleic acids ; Oncology ; Original ; original-article ; Protein interaction ; Proteins ; Spatial distribution</subject><ispartof>Oncogenesis (New York, NY), 2017-09, Vol.6 (9), p.e380-e380</ispartof><rights>The Author(s) 2017</rights><rights>Copyright Nature Publishing Group Sep 2017</rights><rights>Copyright © 2017 The Author(s) 2017 The Author(s)</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c458t-4036b4acd41d8503f8786dc819e33ad6b6d83467eefccf2364dca29a3c94a553</citedby><cites>FETCH-LOGICAL-c458t-4036b4acd41d8503f8786dc819e33ad6b6d83467eefccf2364dca29a3c94a553</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5623905/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5623905/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,864,885,27923,27924,41119,42188,51575,53790,53792</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/28945216$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Gress, A</creatorcontrib><creatorcontrib>Ramensky, V</creatorcontrib><creatorcontrib>Kalinina, O V</creatorcontrib><title>Spatial distribution of disease-associated variants in three-dimensional structures of protein complexes</title><title>Oncogenesis (New York, NY)</title><addtitle>Oncogenesis</addtitle><addtitle>Oncogenesis</addtitle><description>Next-generation sequencing enables simultaneous analysis of hundreds of human genomes associated with a particular phenotype, for example, a disease. These genomes naturally contain a lot of sequence variation that ranges from single-nucleotide variants (SNVs) to large-scale structural rearrangements. In order to establish a functional connection between genotype and disease-associated phenotypes, one needs to distinguish disease drivers from neutral passenger variants. Functional annotation based on experimental assays is feasible only for a limited number of candidate mutations. Thus alternative computational tools are needed. A possible approach to annotating mutations functionally is to consider their spatial location relative to functionally relevant sites in three-dimensional (3D) structures of the harboring proteins. This is impeded by the lack of available protein 3D structures. Complementing experimentally resolved structures with reliable computational models is an attractive alternative. We developed a structure-based approach to characterizing comprehensive sets of non-synonymous single-nucleotide variants (nsSNVs): associated with cancer, non-cancer diseases and putatively functionally neutral. We searched experimentally resolved protein 3D structures for potential homology-modeling templates for proteins harboring corresponding mutations. We found such templates for all proteins with disease-associated nsSNVs, and 51 and 66% of proteins carrying common polymorphisms and annotated benign variants. Many mutations caused by nsSNVs can be found in protein–protein, protein–nucleic acid or protein–ligand complexes. Correction for the number of available templates per protein reveals that protein–protein interaction interfaces are not enriched in either cancer nsSNVs, or nsSNVs associated with non-cancer diseases. Whereas cancer-associated mutations are enriched in DNA-binding proteins, they are rarely located directly in DNA-interacting interfaces. In contrast, mutations associated with non-cancer diseases are in general rare in DNA-binding proteins, but enriched in DNA-interacting interfaces in these proteins. All disease-associated nsSNVs are overrepresented in ligand-binding pockets, and nsSNVs associated with non-cancer diseases are additionally enriched in protein core, where they probably affect overall protein stability.</description><subject>631/67/395</subject><subject>631/67/69</subject><subject>Apoptosis</subject><subject>Cancer</subject><subject>Cell Biology</subject><subject>Computer applications</subject><subject>Deoxyribonucleic acid</subject><subject>Disease</subject><subject>DNA</subject><subject>DNA-binding protein</subject><subject>Genomes</subject><subject>Genotype & phenotype</subject><subject>Homology</subject><subject>Human Genetics</subject><subject>Interfaces</subject><subject>Internal Medicine</subject><subject>Ligands</subject><subject>Mathematical models</subject><subject>Medicine</subject><subject>Medicine & Public Health</subject><subject>Mutation</subject><subject>Nucleic acids</subject><subject>Oncology</subject><subject>Original</subject><subject>original-article</subject><subject>Protein interaction</subject><subject>Proteins</subject><subject>Spatial distribution</subject><issn>2157-9024</issn><issn>2157-9024</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNptkUtr3TAQhU1pSUKabZbF0E03vtHb8qZQQl8QyCLZC11pnKtgW65GDu2_r4zTcFuqjSTmO0czOlV1ScmOEq6v4uQw4I4R2u7a7lV1xqhsm44w8frofFpdID6SsqSiSsqT6pTpTkhG1Vl1uJttDnaofcCcwn7JIU517Nc7WITGIkYXbAZfP9kU7JSxDlOdDwmg8WGECYuiGBT54vKSAFf5nGKGwrk4zgP8BHxbventgHDxvJ9X918-319_a25uv36__nTTOCF1bgThai-s84J6LQnvdauVd5p2wLn1aq-85kK1AL1zPeNKeGdZZ7nrhJWSn1cfN9t52Y_gHUw52cHMKYw2_TLRBvN3ZQoH8xCfjFSMd2Q1-PBskOKPBTCbMaCDYbATxAUN7QRrCdGMFfT9P-hjXFL5i42iWmtJC7XbKJciYoL-pRlKzBqj2WI0a4ym7Yrg3fEIL_if0ApwtQFYStMDpKN3_2_5G_zJrT4</recordid><startdate>20170925</startdate><enddate>20170925</enddate><creator>Gress, A</creator><creator>Ramensky, V</creator><creator>Kalinina, O V</creator><general>Nature Publishing Group UK</general><general>Nature Publishing Group</general><scope>C6C</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88I</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M2P</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20170925</creationdate><title>Spatial distribution of disease-associated variants in three-dimensional structures of protein complexes</title><author>Gress, A ; Ramensky, V ; Kalinina, O V</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c458t-4036b4acd41d8503f8786dc819e33ad6b6d83467eefccf2364dca29a3c94a553</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>631/67/395</topic><topic>631/67/69</topic><topic>Apoptosis</topic><topic>Cancer</topic><topic>Cell Biology</topic><topic>Computer applications</topic><topic>Deoxyribonucleic acid</topic><topic>Disease</topic><topic>DNA</topic><topic>DNA-binding protein</topic><topic>Genomes</topic><topic>Genotype & phenotype</topic><topic>Homology</topic><topic>Human Genetics</topic><topic>Interfaces</topic><topic>Internal Medicine</topic><topic>Ligands</topic><topic>Mathematical models</topic><topic>Medicine</topic><topic>Medicine & Public Health</topic><topic>Mutation</topic><topic>Nucleic acids</topic><topic>Oncology</topic><topic>Original</topic><topic>original-article</topic><topic>Protein interaction</topic><topic>Proteins</topic><topic>Spatial distribution</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Gress, A</creatorcontrib><creatorcontrib>Ramensky, V</creatorcontrib><creatorcontrib>Kalinina, O V</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Science Database</collection><collection>Biological Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Oncogenesis (New York, NY)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gress, A</au><au>Ramensky, V</au><au>Kalinina, O V</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Spatial distribution of disease-associated variants in three-dimensional structures of protein complexes</atitle><jtitle>Oncogenesis (New York, NY)</jtitle><stitle>Oncogenesis</stitle><addtitle>Oncogenesis</addtitle><date>2017-09-25</date><risdate>2017</risdate><volume>6</volume><issue>9</issue><spage>e380</spage><epage>e380</epage><pages>e380-e380</pages><issn>2157-9024</issn><eissn>2157-9024</eissn><abstract>Next-generation sequencing enables simultaneous analysis of hundreds of human genomes associated with a particular phenotype, for example, a disease. These genomes naturally contain a lot of sequence variation that ranges from single-nucleotide variants (SNVs) to large-scale structural rearrangements. In order to establish a functional connection between genotype and disease-associated phenotypes, one needs to distinguish disease drivers from neutral passenger variants. Functional annotation based on experimental assays is feasible only for a limited number of candidate mutations. Thus alternative computational tools are needed. A possible approach to annotating mutations functionally is to consider their spatial location relative to functionally relevant sites in three-dimensional (3D) structures of the harboring proteins. This is impeded by the lack of available protein 3D structures. Complementing experimentally resolved structures with reliable computational models is an attractive alternative. We developed a structure-based approach to characterizing comprehensive sets of non-synonymous single-nucleotide variants (nsSNVs): associated with cancer, non-cancer diseases and putatively functionally neutral. We searched experimentally resolved protein 3D structures for potential homology-modeling templates for proteins harboring corresponding mutations. We found such templates for all proteins with disease-associated nsSNVs, and 51 and 66% of proteins carrying common polymorphisms and annotated benign variants. Many mutations caused by nsSNVs can be found in protein–protein, protein–nucleic acid or protein–ligand complexes. Correction for the number of available templates per protein reveals that protein–protein interaction interfaces are not enriched in either cancer nsSNVs, or nsSNVs associated with non-cancer diseases. Whereas cancer-associated mutations are enriched in DNA-binding proteins, they are rarely located directly in DNA-interacting interfaces. In contrast, mutations associated with non-cancer diseases are in general rare in DNA-binding proteins, but enriched in DNA-interacting interfaces in these proteins. All disease-associated nsSNVs are overrepresented in ligand-binding pockets, and nsSNVs associated with non-cancer diseases are additionally enriched in protein core, where they probably affect overall protein stability.</abstract><cop>London</cop><pub>Nature Publishing Group UK</pub><pmid>28945216</pmid><doi>10.1038/oncsis.2017.79</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2157-9024
ispartof	Oncogenesis (New York, NY), 2017-09, Vol.6 (9), p.e380-e380
issn	2157-9024 2157-9024
language	eng
recordid	cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5623905
source	DOAJ Directory of Open Access Journals; PubMed Central Open Access; Springer Nature OA Free Journals; Nature Free; EZB-FREE-00999 freely available EZB journals; PubMed Central
subjects	631/67/395 631/67/69 Apoptosis Cancer Cell Biology Computer applications Deoxyribonucleic acid Disease DNA DNA-binding protein Genomes Genotype & phenotype Homology Human Genetics Interfaces Internal Medicine Ligands Mathematical models Medicine Medicine & Public Health Mutation Nucleic acids Oncology Original original-article Protein interaction Proteins Spatial distribution
title	Spatial distribution of disease-associated variants in three-dimensional structures of protein complexes
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T14%3A17%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Spatial%20distribution%20of%20disease-associated%20variants%20in%20three-dimensional%20structures%20of%20protein%20complexes&rft.jtitle=Oncogenesis%20(New%20York,%20NY)&rft.au=Gress,%20A&rft.date=2017-09-25&rft.volume=6&rft.issue=9&rft.spage=e380&rft.epage=e380&rft.pages=e380-e380&rft.issn=2157-9024&rft.eissn=2157-9024&rft_id=info:doi/10.1038/oncsis.2017.79&rft_dat=%3Cproquest_pubme%3E1942188851%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1942188851&rft_id=info:pmid/28945216&rfr_iscdi=true