A common root for coevolution and substitution rate variability in protein sequence evolution

We introduce a simple model that describes the average occurrence of point variations in a generic protein sequence. This model is based on the idea that mutations are more likely to be fixed at sites in contact with others that have mutated in the recent past. Therefore, we extend the usual assumpt...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Scientific reports 2019-12, Vol.9 (1), p.18032-10, Article 18032
Hauptverfasser: Rizzato, Francesca, Zamuner, Stefano, Pagnani, Andrea, Laio, Alessandro
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 10
container_issue 1
container_start_page 18032
container_title Scientific reports
container_volume 9
creator Rizzato, Francesca
Zamuner, Stefano
Pagnani, Andrea
Laio, Alessandro
description We introduce a simple model that describes the average occurrence of point variations in a generic protein sequence. This model is based on the idea that mutations are more likely to be fixed at sites in contact with others that have mutated in the recent past. Therefore, we extend the usual assumptions made in protein coevolution by introducing a time dumping on the effect of a substitution on its surrounding and makes correlated substitutions happen in avalanches localized in space and time. The model correctly predicts the average correlation of substitutions as a function of their distance along the sequence. At the same time, it predicts an among-site distribution of the number of substitutions per site highly compatible with a negative binomial, consistently with experimental data. The promising outcomes achieved with this model encourage the application of the same ideas in the field of pairwise and multiple sequence alignment.
doi_str_mv 10.1038/s41598-019-53958-w
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6888882</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2321213644</sourcerecordid><originalsourceid>FETCH-LOGICAL-c474t-a989125960ebf0b4eea0627f066eb07abd37ca9bb9ae9c69b8f2e0b1cdf5ec233</originalsourceid><addsrcrecordid>eNp9UU1PAjEQbYxGCPIHPJhNPK_2Y796MSHEr4TEix5N05ZZLIEttl0I_97iIuLFuUw7896baR9ClwTfEMyqW5-RnFcpJjzNGc-rdHOC-hRneUoZpadH5x4aej_HMXLKM8LPUY-RklPKeB-9jxJtl0vbJM7akNTWxTus7aINJhZlM018q3wwoSs4GSBZS2ekMgsTtolpkpWzAWL28NlCoyE58C_QWS0XHob7PEBvD_ev46d08vL4PB5NUp2VWUglrzihOS8wqBqrDEDigpY1LgpQuJRqykotuVJcAtcFV1VNASuip3UOmjI2QHed7qpVS5hqaIKTC7FyZindVlhpxN9OYz7EzK5FUe2CRoHrvYCz8RE-iLltXRN3FvELCSWsyLKIoh1KO-u9g_owgWCxc0V0rojoivh2RWwi6ep4twPlx4MIYB3Ax1YzA_c7-x_ZL0OpnIE</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2321213644</pqid></control><display><type>article</type><title>A common root for coevolution and substitution rate variability in protein sequence evolution</title><source>MEDLINE</source><source>Nature Free</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><source>Free Full-Text Journals in Chemistry</source><source>Springer Nature OA Free Journals</source><creator>Rizzato, Francesca ; Zamuner, Stefano ; Pagnani, Andrea ; Laio, Alessandro</creator><creatorcontrib>Rizzato, Francesca ; Zamuner, Stefano ; Pagnani, Andrea ; Laio, Alessandro</creatorcontrib><description>We introduce a simple model that describes the average occurrence of point variations in a generic protein sequence. This model is based on the idea that mutations are more likely to be fixed at sites in contact with others that have mutated in the recent past. Therefore, we extend the usual assumptions made in protein coevolution by introducing a time dumping on the effect of a substitution on its surrounding and makes correlated substitutions happen in avalanches localized in space and time. The model correctly predicts the average correlation of substitutions as a function of their distance along the sequence. At the same time, it predicts an among-site distribution of the number of substitutions per site highly compatible with a negative binomial, consistently with experimental data. The promising outcomes achieved with this model encourage the application of the same ideas in the field of pairwise and multiple sequence alignment.</description><identifier>ISSN: 2045-2322</identifier><identifier>EISSN: 2045-2322</identifier><identifier>DOI: 10.1038/s41598-019-53958-w</identifier><identifier>PMID: 31792239</identifier><language>eng</language><publisher>London: Nature Publishing Group UK</publisher><subject>631/114 ; 631/114/2397 ; 631/181/2474 ; 631/181/2481 ; 639/766/747 ; Amino Acid Sequence - genetics ; Amino Acid Substitution ; Codon - genetics ; Coevolution ; Evolution, Molecular ; Humanities and Social Sciences ; Humans ; Landslides ; Models, Genetic ; multidisciplinary ; Nucleotide sequence ; Point Mutation ; Proteins ; Science ; Science (multidisciplinary) ; Sequence Alignment</subject><ispartof>Scientific reports, 2019-12, Vol.9 (1), p.18032-10, Article 18032</ispartof><rights>The Author(s) 2019</rights><rights>2019. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c474t-a989125960ebf0b4eea0627f066eb07abd37ca9bb9ae9c69b8f2e0b1cdf5ec233</citedby><cites>FETCH-LOGICAL-c474t-a989125960ebf0b4eea0627f066eb07abd37ca9bb9ae9c69b8f2e0b1cdf5ec233</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6888882/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6888882/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,860,881,27901,27902,41096,42165,51551,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/31792239$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Rizzato, Francesca</creatorcontrib><creatorcontrib>Zamuner, Stefano</creatorcontrib><creatorcontrib>Pagnani, Andrea</creatorcontrib><creatorcontrib>Laio, Alessandro</creatorcontrib><title>A common root for coevolution and substitution rate variability in protein sequence evolution</title><title>Scientific reports</title><addtitle>Sci Rep</addtitle><addtitle>Sci Rep</addtitle><description>We introduce a simple model that describes the average occurrence of point variations in a generic protein sequence. This model is based on the idea that mutations are more likely to be fixed at sites in contact with others that have mutated in the recent past. Therefore, we extend the usual assumptions made in protein coevolution by introducing a time dumping on the effect of a substitution on its surrounding and makes correlated substitutions happen in avalanches localized in space and time. The model correctly predicts the average correlation of substitutions as a function of their distance along the sequence. At the same time, it predicts an among-site distribution of the number of substitutions per site highly compatible with a negative binomial, consistently with experimental data. The promising outcomes achieved with this model encourage the application of the same ideas in the field of pairwise and multiple sequence alignment.</description><subject>631/114</subject><subject>631/114/2397</subject><subject>631/181/2474</subject><subject>631/181/2481</subject><subject>639/766/747</subject><subject>Amino Acid Sequence - genetics</subject><subject>Amino Acid Substitution</subject><subject>Codon - genetics</subject><subject>Coevolution</subject><subject>Evolution, Molecular</subject><subject>Humanities and Social Sciences</subject><subject>Humans</subject><subject>Landslides</subject><subject>Models, Genetic</subject><subject>multidisciplinary</subject><subject>Nucleotide sequence</subject><subject>Point Mutation</subject><subject>Proteins</subject><subject>Science</subject><subject>Science (multidisciplinary)</subject><subject>Sequence Alignment</subject><issn>2045-2322</issn><issn>2045-2322</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><sourceid>EIF</sourceid><sourceid>BENPR</sourceid><recordid>eNp9UU1PAjEQbYxGCPIHPJhNPK_2Y796MSHEr4TEix5N05ZZLIEttl0I_97iIuLFuUw7896baR9ClwTfEMyqW5-RnFcpJjzNGc-rdHOC-hRneUoZpadH5x4aej_HMXLKM8LPUY-RklPKeB-9jxJtl0vbJM7akNTWxTus7aINJhZlM018q3wwoSs4GSBZS2ekMgsTtolpkpWzAWL28NlCoyE58C_QWS0XHob7PEBvD_ev46d08vL4PB5NUp2VWUglrzihOS8wqBqrDEDigpY1LgpQuJRqykotuVJcAtcFV1VNASuip3UOmjI2QHed7qpVS5hqaIKTC7FyZindVlhpxN9OYz7EzK5FUe2CRoHrvYCz8RE-iLltXRN3FvELCSWsyLKIoh1KO-u9g_owgWCxc0V0rojoivh2RWwi6ep4twPlx4MIYB3Ax1YzA_c7-x_ZL0OpnIE</recordid><startdate>20191202</startdate><enddate>20191202</enddate><creator>Rizzato, Francesca</creator><creator>Zamuner, Stefano</creator><creator>Pagnani, Andrea</creator><creator>Laio, Alessandro</creator><general>Nature Publishing Group UK</general><general>Nature Publishing Group</general><scope>C6C</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>88I</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>5PM</scope></search><sort><creationdate>20191202</creationdate><title>A common root for coevolution and substitution rate variability in protein sequence evolution</title><author>Rizzato, Francesca ; Zamuner, Stefano ; Pagnani, Andrea ; Laio, Alessandro</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c474t-a989125960ebf0b4eea0627f066eb07abd37ca9bb9ae9c69b8f2e0b1cdf5ec233</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>631/114</topic><topic>631/114/2397</topic><topic>631/181/2474</topic><topic>631/181/2481</topic><topic>639/766/747</topic><topic>Amino Acid Sequence - genetics</topic><topic>Amino Acid Substitution</topic><topic>Codon - genetics</topic><topic>Coevolution</topic><topic>Evolution, Molecular</topic><topic>Humanities and Social Sciences</topic><topic>Humans</topic><topic>Landslides</topic><topic>Models, Genetic</topic><topic>multidisciplinary</topic><topic>Nucleotide sequence</topic><topic>Point Mutation</topic><topic>Proteins</topic><topic>Science</topic><topic>Science (multidisciplinary)</topic><topic>Sequence Alignment</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Rizzato, Francesca</creatorcontrib><creatorcontrib>Zamuner, Stefano</creatorcontrib><creatorcontrib>Pagnani, Andrea</creatorcontrib><creatorcontrib>Laio, Alessandro</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Database</collection><collection>Biological Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Scientific reports</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rizzato, Francesca</au><au>Zamuner, Stefano</au><au>Pagnani, Andrea</au><au>Laio, Alessandro</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A common root for coevolution and substitution rate variability in protein sequence evolution</atitle><jtitle>Scientific reports</jtitle><stitle>Sci Rep</stitle><addtitle>Sci Rep</addtitle><date>2019-12-02</date><risdate>2019</risdate><volume>9</volume><issue>1</issue><spage>18032</spage><epage>10</epage><pages>18032-10</pages><artnum>18032</artnum><issn>2045-2322</issn><eissn>2045-2322</eissn><abstract>We introduce a simple model that describes the average occurrence of point variations in a generic protein sequence. This model is based on the idea that mutations are more likely to be fixed at sites in contact with others that have mutated in the recent past. Therefore, we extend the usual assumptions made in protein coevolution by introducing a time dumping on the effect of a substitution on its surrounding and makes correlated substitutions happen in avalanches localized in space and time. The model correctly predicts the average correlation of substitutions as a function of their distance along the sequence. At the same time, it predicts an among-site distribution of the number of substitutions per site highly compatible with a negative binomial, consistently with experimental data. The promising outcomes achieved with this model encourage the application of the same ideas in the field of pairwise and multiple sequence alignment.</abstract><cop>London</cop><pub>Nature Publishing Group UK</pub><pmid>31792239</pmid><doi>10.1038/s41598-019-53958-w</doi><tpages>10</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2045-2322
ispartof Scientific reports, 2019-12, Vol.9 (1), p.18032-10, Article 18032
issn 2045-2322
2045-2322
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_6888882
source MEDLINE; Nature Free; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central; Alma/SFX Local Collection; Free Full-Text Journals in Chemistry; Springer Nature OA Free Journals
subjects 631/114
631/114/2397
631/181/2474
631/181/2481
639/766/747
Amino Acid Sequence - genetics
Amino Acid Substitution
Codon - genetics
Coevolution
Evolution, Molecular
Humanities and Social Sciences
Humans
Landslides
Models, Genetic
multidisciplinary
Nucleotide sequence
Point Mutation
Proteins
Science
Science (multidisciplinary)
Sequence Alignment
title A common root for coevolution and substitution rate variability in protein sequence evolution
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-15T18%3A37%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20common%20root%20for%20coevolution%20and%20substitution%20rate%20variability%20in%20protein%20sequence%20evolution&rft.jtitle=Scientific%20reports&rft.au=Rizzato,%20Francesca&rft.date=2019-12-02&rft.volume=9&rft.issue=1&rft.spage=18032&rft.epage=10&rft.pages=18032-10&rft.artnum=18032&rft.issn=2045-2322&rft.eissn=2045-2322&rft_id=info:doi/10.1038/s41598-019-53958-w&rft_dat=%3Cproquest_pubme%3E2321213644%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2321213644&rft_id=info:pmid/31792239&rfr_iscdi=true