Structure‐conditioned amino‐acid couplings: How contact geometry affects pairwise sequence preferences
Relating a protein's sequence to its conformation is a central challenge for both structure prediction and sequence design. Statistical contact potentials, as well as their more descriptive versions that account for side‐chain orientation and other geometric descriptors, have served as simplist...
Gespeichert in:
Veröffentlicht in: | Protein science 2022-04, Vol.31 (4), p.900-917 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 917 |
---|---|
container_issue | 4 |
container_start_page | 900 |
container_title | Protein science |
container_volume | 31 |
creator | Holland, Jack Grigoryan, Gevorg |
description | Relating a protein's sequence to its conformation is a central challenge for both structure prediction and sequence design. Statistical contact potentials, as well as their more descriptive versions that account for side‐chain orientation and other geometric descriptors, have served as simplistic but useful means of representing second‐order contributions in sequence–structure relationships. Here we ask what happens when a pairwise potential is conditioned on the fully defined geometry of interacting backbones fragments. We show that the resulting structure‐conditioned coupling energies more accurately reflect pair preferences as a function of structural contexts. These structure‐conditioned energies more reliably encode native sequence information and more highly correlate with experimentally determined coupling energies. Clustering a database of interaction motifs by structure results in ensembles of similar energies and clustering them by energy results in ensembles of similar structures. By comparing many pairs of interaction motifs and showing that structural similarity and energetic similarity go hand‐in‐hand, we provide a tangible link between modular sequence and structure elements. This link is applicable to structural modeling, and we show that scoring CASP models with structured‐conditioned energies results in substantially higher correlation with structural quality than scoring the same models with a contact potential. We conclude that structure‐conditioned coupling energies are a good way to model the impact of interaction geometry on second‐order sequence preferences. |
doi_str_mv | 10.1002/pro.4280 |
format | Article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8927866</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2639932263</sourcerecordid><originalsourceid>FETCH-LOGICAL-c4380-f509cf8d47a7c514b238febdf23e6522f4e606342706c889c9e958a9f05a95323</originalsourceid><addsrcrecordid>eNp1kd9qFDEUh0NR2rUW-gQy4E1vpmaSTCbxQihFrVBoqQrehWzmZM0yk4xJpsve-Qg-o09i1tb6B7w6OcnHxzn5IXTc4NMGY_JiiuGUEYH30KJhXNZC8k-P0AJL3tSCcnGAnqS0xhizhtB9dEBbzDEhzQKt3-c4mzxH-P71mwm-d9kFD32lR-dDudPG9ZUJ8zQ4v0ovq4uwKa3P2uRqBWGEHLeVthZMTtWkXdy4BFWCLzN4A9UUwULcHdNT9NjqIcHRfT1EH9-8_nB-UV9evX13fnZZG0YFrm2LpbGiZ53uTNuwJaHCwrK3hAJvCbEMOOaUkQ5zI4Q0EmQrtLS41bKlhB6iV3feaV6O0BvwOepBTdGNOm5V0E79_eLdZ7UKt0pI0gnOi-DkXhBDWSNlNbpkYBi0hzAnRTghpGOMtQV9_g-6DnP0Zb1CUSkpKeW30MSQUvmRh2EarHYBlj6oXYAFffbn8A_gr8QKUN8BGzfA9r8idX1z9VP4A5hfqOg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2639932263</pqid></control><display><type>article</type><title>Structure‐conditioned amino‐acid couplings: How contact geometry affects pairwise sequence preferences</title><source>Wiley Online Library - AutoHoldings Journals</source><source>MEDLINE</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Wiley Online Library (Open Access Collection)</source><source>PubMed Central</source><source>Free Full-Text Journals in Chemistry</source><creator>Holland, Jack ; Grigoryan, Gevorg</creator><creatorcontrib>Holland, Jack ; Grigoryan, Gevorg</creatorcontrib><description>Relating a protein's sequence to its conformation is a central challenge for both structure prediction and sequence design. Statistical contact potentials, as well as their more descriptive versions that account for side‐chain orientation and other geometric descriptors, have served as simplistic but useful means of representing second‐order contributions in sequence–structure relationships. Here we ask what happens when a pairwise potential is conditioned on the fully defined geometry of interacting backbones fragments. We show that the resulting structure‐conditioned coupling energies more accurately reflect pair preferences as a function of structural contexts. These structure‐conditioned energies more reliably encode native sequence information and more highly correlate with experimentally determined coupling energies. Clustering a database of interaction motifs by structure results in ensembles of similar energies and clustering them by energy results in ensembles of similar structures. By comparing many pairs of interaction motifs and showing that structural similarity and energetic similarity go hand‐in‐hand, we provide a tangible link between modular sequence and structure elements. This link is applicable to structural modeling, and we show that scoring CASP models with structured‐conditioned energies results in substantially higher correlation with structural quality than scoring the same models with a contact potential. We conclude that structure‐conditioned coupling energies are a good way to model the impact of interaction geometry on second‐order sequence preferences.</description><identifier>ISSN: 0961-8368</identifier><identifier>EISSN: 1469-896X</identifier><identifier>DOI: 10.1002/pro.4280</identifier><identifier>PMID: 35060221</identifier><language>eng</language><publisher>Hoboken, USA: John Wiley & Sons, Inc</publisher><subject>Amino acid sequence ; Amino Acids - chemistry ; Clustering ; contact potential ; Contact potentials ; coupling energy ; Couplings ; Full‐Length Paper ; Full‐Length Papers ; Geometry ; Models, Molecular ; Modular structures ; Protein Conformation ; sequence–structure relationships ; Similarity ; statistical energy ; structural modeling ; tertiary motifs</subject><ispartof>Protein science, 2022-04, Vol.31 (4), p.900-917</ispartof><rights>2022 The Authors. published by Wiley Periodicals LLC on behalf of The Protein Society.</rights><rights>2022 The Authors. Protein Science published by Wiley Periodicals LLC on behalf of The Protein Society.</rights><rights>2022. This article is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c4380-f509cf8d47a7c514b238febdf23e6522f4e606342706c889c9e958a9f05a95323</citedby><cites>FETCH-LOGICAL-c4380-f509cf8d47a7c514b238febdf23e6522f4e606342706c889c9e958a9f05a95323</cites><orcidid>0000-0003-4645-7666 ; 0000-0001-6682-0659</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8927866/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8927866/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,315,729,782,786,887,1419,1435,27931,27932,45581,45582,46416,46840,53798,53800</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35060221$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Holland, Jack</creatorcontrib><creatorcontrib>Grigoryan, Gevorg</creatorcontrib><title>Structure‐conditioned amino‐acid couplings: How contact geometry affects pairwise sequence preferences</title><title>Protein science</title><addtitle>Protein Sci</addtitle><description>Relating a protein's sequence to its conformation is a central challenge for both structure prediction and sequence design. Statistical contact potentials, as well as their more descriptive versions that account for side‐chain orientation and other geometric descriptors, have served as simplistic but useful means of representing second‐order contributions in sequence–structure relationships. Here we ask what happens when a pairwise potential is conditioned on the fully defined geometry of interacting backbones fragments. We show that the resulting structure‐conditioned coupling energies more accurately reflect pair preferences as a function of structural contexts. These structure‐conditioned energies more reliably encode native sequence information and more highly correlate with experimentally determined coupling energies. Clustering a database of interaction motifs by structure results in ensembles of similar energies and clustering them by energy results in ensembles of similar structures. By comparing many pairs of interaction motifs and showing that structural similarity and energetic similarity go hand‐in‐hand, we provide a tangible link between modular sequence and structure elements. This link is applicable to structural modeling, and we show that scoring CASP models with structured‐conditioned energies results in substantially higher correlation with structural quality than scoring the same models with a contact potential. We conclude that structure‐conditioned coupling energies are a good way to model the impact of interaction geometry on second‐order sequence preferences.</description><subject>Amino acid sequence</subject><subject>Amino Acids - chemistry</subject><subject>Clustering</subject><subject>contact potential</subject><subject>Contact potentials</subject><subject>coupling energy</subject><subject>Couplings</subject><subject>Full‐Length Paper</subject><subject>Full‐Length Papers</subject><subject>Geometry</subject><subject>Models, Molecular</subject><subject>Modular structures</subject><subject>Protein Conformation</subject><subject>sequence–structure relationships</subject><subject>Similarity</subject><subject>statistical energy</subject><subject>structural modeling</subject><subject>tertiary motifs</subject><issn>0961-8368</issn><issn>1469-896X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>24P</sourceid><sourceid>WIN</sourceid><sourceid>EIF</sourceid><recordid>eNp1kd9qFDEUh0NR2rUW-gQy4E1vpmaSTCbxQihFrVBoqQrehWzmZM0yk4xJpsve-Qg-o09i1tb6B7w6OcnHxzn5IXTc4NMGY_JiiuGUEYH30KJhXNZC8k-P0AJL3tSCcnGAnqS0xhizhtB9dEBbzDEhzQKt3-c4mzxH-P71mwm-d9kFD32lR-dDudPG9ZUJ8zQ4v0ovq4uwKa3P2uRqBWGEHLeVthZMTtWkXdy4BFWCLzN4A9UUwULcHdNT9NjqIcHRfT1EH9-8_nB-UV9evX13fnZZG0YFrm2LpbGiZ53uTNuwJaHCwrK3hAJvCbEMOOaUkQ5zI4Q0EmQrtLS41bKlhB6iV3feaV6O0BvwOepBTdGNOm5V0E79_eLdZ7UKt0pI0gnOi-DkXhBDWSNlNbpkYBi0hzAnRTghpGOMtQV9_g-6DnP0Zb1CUSkpKeW30MSQUvmRh2EarHYBlj6oXYAFffbn8A_gr8QKUN8BGzfA9r8idX1z9VP4A5hfqOg</recordid><startdate>202204</startdate><enddate>202204</enddate><creator>Holland, Jack</creator><creator>Grigoryan, Gevorg</creator><general>John Wiley & Sons, Inc</general><general>Wiley Subscription Services, Inc</general><scope>24P</scope><scope>WIN</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QO</scope><scope>7T5</scope><scope>7TM</scope><scope>7U9</scope><scope>8FD</scope><scope>FR3</scope><scope>H94</scope><scope>K9.</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0003-4645-7666</orcidid><orcidid>https://orcid.org/0000-0001-6682-0659</orcidid></search><sort><creationdate>202204</creationdate><title>Structure‐conditioned amino‐acid couplings: How contact geometry affects pairwise sequence preferences</title><author>Holland, Jack ; Grigoryan, Gevorg</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c4380-f509cf8d47a7c514b238febdf23e6522f4e606342706c889c9e958a9f05a95323</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Amino acid sequence</topic><topic>Amino Acids - chemistry</topic><topic>Clustering</topic><topic>contact potential</topic><topic>Contact potentials</topic><topic>coupling energy</topic><topic>Couplings</topic><topic>Full‐Length Paper</topic><topic>Full‐Length Papers</topic><topic>Geometry</topic><topic>Models, Molecular</topic><topic>Modular structures</topic><topic>Protein Conformation</topic><topic>sequence–structure relationships</topic><topic>Similarity</topic><topic>statistical energy</topic><topic>structural modeling</topic><topic>tertiary motifs</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Holland, Jack</creatorcontrib><creatorcontrib>Grigoryan, Gevorg</creatorcontrib><collection>Wiley Online Library (Open Access Collection)</collection><collection>Wiley Online Library (Open Access Collection)</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Biotechnology Research Abstracts</collection><collection>Immunology Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Protein science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Holland, Jack</au><au>Grigoryan, Gevorg</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Structure‐conditioned amino‐acid couplings: How contact geometry affects pairwise sequence preferences</atitle><jtitle>Protein science</jtitle><addtitle>Protein Sci</addtitle><date>2022-04</date><risdate>2022</risdate><volume>31</volume><issue>4</issue><spage>900</spage><epage>917</epage><pages>900-917</pages><issn>0961-8368</issn><eissn>1469-896X</eissn><abstract>Relating a protein's sequence to its conformation is a central challenge for both structure prediction and sequence design. Statistical contact potentials, as well as their more descriptive versions that account for side‐chain orientation and other geometric descriptors, have served as simplistic but useful means of representing second‐order contributions in sequence–structure relationships. Here we ask what happens when a pairwise potential is conditioned on the fully defined geometry of interacting backbones fragments. We show that the resulting structure‐conditioned coupling energies more accurately reflect pair preferences as a function of structural contexts. These structure‐conditioned energies more reliably encode native sequence information and more highly correlate with experimentally determined coupling energies. Clustering a database of interaction motifs by structure results in ensembles of similar energies and clustering them by energy results in ensembles of similar structures. By comparing many pairs of interaction motifs and showing that structural similarity and energetic similarity go hand‐in‐hand, we provide a tangible link between modular sequence and structure elements. This link is applicable to structural modeling, and we show that scoring CASP models with structured‐conditioned energies results in substantially higher correlation with structural quality than scoring the same models with a contact potential. We conclude that structure‐conditioned coupling energies are a good way to model the impact of interaction geometry on second‐order sequence preferences.</abstract><cop>Hoboken, USA</cop><pub>John Wiley & Sons, Inc</pub><pmid>35060221</pmid><doi>10.1002/pro.4280</doi><tpages>18</tpages><orcidid>https://orcid.org/0000-0003-4645-7666</orcidid><orcidid>https://orcid.org/0000-0001-6682-0659</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0961-8368 |
ispartof | Protein science, 2022-04, Vol.31 (4), p.900-917 |
issn | 0961-8368 1469-896X |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8927866 |
source | Wiley Online Library - AutoHoldings Journals; MEDLINE; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Wiley Online Library (Open Access Collection); PubMed Central; Free Full-Text Journals in Chemistry |
subjects | Amino acid sequence Amino Acids - chemistry Clustering contact potential Contact potentials coupling energy Couplings Full‐Length Paper Full‐Length Papers Geometry Models, Molecular Modular structures Protein Conformation sequence–structure relationships Similarity statistical energy structural modeling tertiary motifs |
title | Structure‐conditioned amino‐acid couplings: How contact geometry affects pairwise sequence preferences |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-04T05%3A00%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Structure%E2%80%90conditioned%20amino%E2%80%90acid%20couplings:%20How%20contact%20geometry%20affects%20pairwise%20sequence%20preferences&rft.jtitle=Protein%20science&rft.au=Holland,%20Jack&rft.date=2022-04&rft.volume=31&rft.issue=4&rft.spage=900&rft.epage=917&rft.pages=900-917&rft.issn=0961-8368&rft.eissn=1469-896X&rft_id=info:doi/10.1002/pro.4280&rft_dat=%3Cproquest_pubme%3E2639932263%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2639932263&rft_id=info:pmid/35060221&rfr_iscdi=true |