Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths

Proteins share similar segments with one another. Such “reused parts”—which have been successfully incorporated into other proteins—are likely to offer an evolutionary advantage over de novo evolved segments, as most of the latter will not even have the capacity to fold. To systematically explore th...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Proceedings of the National Academy of Sciences - PNAS 2017-10, Vol.114 (44), p.11703-11708
Hauptverfasser: Nepomnyachiy, Sergey, Ben-Tal, Nir, Kolodny, Rachel
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 11708
container_issue 44
container_start_page 11703
container_title Proceedings of the National Academy of Sciences - PNAS
container_volume 114
creator Nepomnyachiy, Sergey
Ben-Tal, Nir
Kolodny, Rachel
description Proteins share similar segments with one another. Such “reused parts”—which have been successfully incorporated into other proteins—are likely to offer an evolutionary advantage over de novo evolved segments, as most of the latter will not even have the capacity to fold. To systematically explore the evolutionary traces of segment “reuse” across proteins, we developed an automated methodology that identifies reused segments from protein alignments. We search for “themes”—segments of at least 35 residues of similar sequence and structure—reused within representative sets of 15,016 domains [Evolutionary Classification of Protein Domains (ECOD) database] or 20,398 chains [Protein Data Bank (PDB)]. We observe that theme reuse is highly prevalent and that reuse is more extensive when the length threshold for identifying a theme is lower. Structural domains, the best characterized form of reuse in proteins, are just one of many complex and intertwined evolutionary traces. Others include long themes shared among a few proteins, which encompass and overlap with shorter themes that recur in numerous proteins. The observed complexity is consistent with evolution by duplication and divergence, and some of the themes might include descendants of ancestral segments. The observed recursive footprints, where the same amino acid can simultaneously participate in several intertwined themes, could be a useful concept for protein design. Data are available at http://trachel-srv.cs.haifa.ac.il/rachel/ppi/themes/.
doi_str_mv 10.1073/pnas.1707642114
format Article
fullrecord <record><control><sourceid>jstor_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5676897</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><jstor_id>26488852</jstor_id><sourcerecordid>26488852</sourcerecordid><originalsourceid>FETCH-LOGICAL-c443t-aa95adf470faa182291a8fc4e3160e6914c9acae91774e3474a9189e1d66a2013</originalsourceid><addsrcrecordid>eNpdkc1r3DAQxUVpaTZpzz21GHrJxYlGlvVxKZSlH4FAL8lZTO3xxotsuZK9NP99ZTZN2oBgQO83w7x5jL0DfgFcV5fTiOkCNNdKCgD5gm2AWyiVtPwl23AudGmkkCfsNKU959zWhr9mJ8JybSqQG0bbMEyefhd0CH6Z-zBivC-6EOYp9uOcikgHQk9t0Y8Frg_9fepTEbosLSkLUwwzZTXRbqC1JUttf6CYqPA07ua79Ia96tAnevtQz9jt1y832-_l9Y9vV9vP12UjZTWXiLbGtpOad4hghLCApmskVaA4KQuysdggWdA6f0ot0YKxBK1SKDhUZ-zTce60_ByobfI6Eb3LVoZsywXs3f_K2N-5XTi4WmllrM4Dzh8GxPBroTS7oU8NeY8jhSU5sLXOt631in58hu7DEvN5VsqoSnJuRKYuj1QTQ0qRusdlgLs1QrdG6J4izB0f_vXwyP_NLAPvj8A-zSE-6UoaY2pR_QEAK6PK</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1986340082</pqid></control><display><type>article</type><title>Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths</title><source>MEDLINE</source><source>JSTOR Archive Collection A-Z Listing</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><source>Free Full-Text Journals in Chemistry</source><creator>Nepomnyachiy, Sergey ; Ben-Tal, Nir ; Kolodny, Rachel</creator><creatorcontrib>Nepomnyachiy, Sergey ; Ben-Tal, Nir ; Kolodny, Rachel</creatorcontrib><description>Proteins share similar segments with one another. Such “reused parts”—which have been successfully incorporated into other proteins—are likely to offer an evolutionary advantage over de novo evolved segments, as most of the latter will not even have the capacity to fold. To systematically explore the evolutionary traces of segment “reuse” across proteins, we developed an automated methodology that identifies reused segments from protein alignments. We search for “themes”—segments of at least 35 residues of similar sequence and structure—reused within representative sets of 15,016 domains [Evolutionary Classification of Protein Domains (ECOD) database] or 20,398 chains [Protein Data Bank (PDB)]. We observe that theme reuse is highly prevalent and that reuse is more extensive when the length threshold for identifying a theme is lower. Structural domains, the best characterized form of reuse in proteins, are just one of many complex and intertwined evolutionary traces. Others include long themes shared among a few proteins, which encompass and overlap with shorter themes that recur in numerous proteins. The observed complexity is consistent with evolution by duplication and divergence, and some of the themes might include descendants of ancestral segments. The observed recursive footprints, where the same amino acid can simultaneously participate in several intertwined themes, could be a useful concept for protein design. Data are available at http://trachel-srv.cs.haifa.ac.il/rachel/ppi/themes/.</description><identifier>ISSN: 0027-8424</identifier><identifier>EISSN: 1091-6490</identifier><identifier>DOI: 10.1073/pnas.1707642114</identifier><identifier>PMID: 29078314</identifier><language>eng</language><publisher>United States: National Academy of Sciences</publisher><subject>Amino Acid Sequence ; Amino acids ; Biological Sciences ; Complexity ; Computational Biology - methods ; Databases, Protein ; Divergence ; Evolution ; Evolution, Molecular ; Footprints ; Models, Genetic ; Protein Conformation ; Proteins ; Proteins - chemistry ; Proteins - genetics ; Reuse ; Segments</subject><ispartof>Proceedings of the National Academy of Sciences - PNAS, 2017-10, Vol.114 (44), p.11703-11708</ispartof><rights>Volumes 1–89 and 106–114, copyright as a collective work only; author(s) retains copyright to individual articles</rights><rights>Copyright © 2017 the Author(s). Published by PNAS.</rights><rights>Copyright National Academy of Sciences Oct 31, 2017</rights><rights>Copyright © 2017 the Author(s). Published by PNAS. 2017</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c443t-aa95adf470faa182291a8fc4e3160e6914c9acae91774e3474a9189e1d66a2013</citedby><cites>FETCH-LOGICAL-c443t-aa95adf470faa182291a8fc4e3160e6914c9acae91774e3474a9189e1d66a2013</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.jstor.org/stable/pdf/26488852$$EPDF$$P50$$Gjstor$$H</linktopdf><linktohtml>$$Uhttps://www.jstor.org/stable/26488852$$EHTML$$P50$$Gjstor$$H</linktohtml><link.rule.ids>230,314,727,780,784,803,885,27923,27924,53790,53792,58016,58249</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29078314$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Nepomnyachiy, Sergey</creatorcontrib><creatorcontrib>Ben-Tal, Nir</creatorcontrib><creatorcontrib>Kolodny, Rachel</creatorcontrib><title>Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths</title><title>Proceedings of the National Academy of Sciences - PNAS</title><addtitle>Proc Natl Acad Sci U S A</addtitle><description>Proteins share similar segments with one another. Such “reused parts”—which have been successfully incorporated into other proteins—are likely to offer an evolutionary advantage over de novo evolved segments, as most of the latter will not even have the capacity to fold. To systematically explore the evolutionary traces of segment “reuse” across proteins, we developed an automated methodology that identifies reused segments from protein alignments. We search for “themes”—segments of at least 35 residues of similar sequence and structure—reused within representative sets of 15,016 domains [Evolutionary Classification of Protein Domains (ECOD) database] or 20,398 chains [Protein Data Bank (PDB)]. We observe that theme reuse is highly prevalent and that reuse is more extensive when the length threshold for identifying a theme is lower. Structural domains, the best characterized form of reuse in proteins, are just one of many complex and intertwined evolutionary traces. Others include long themes shared among a few proteins, which encompass and overlap with shorter themes that recur in numerous proteins. The observed complexity is consistent with evolution by duplication and divergence, and some of the themes might include descendants of ancestral segments. The observed recursive footprints, where the same amino acid can simultaneously participate in several intertwined themes, could be a useful concept for protein design. Data are available at http://trachel-srv.cs.haifa.ac.il/rachel/ppi/themes/.</description><subject>Amino Acid Sequence</subject><subject>Amino acids</subject><subject>Biological Sciences</subject><subject>Complexity</subject><subject>Computational Biology - methods</subject><subject>Databases, Protein</subject><subject>Divergence</subject><subject>Evolution</subject><subject>Evolution, Molecular</subject><subject>Footprints</subject><subject>Models, Genetic</subject><subject>Protein Conformation</subject><subject>Proteins</subject><subject>Proteins - chemistry</subject><subject>Proteins - genetics</subject><subject>Reuse</subject><subject>Segments</subject><issn>0027-8424</issn><issn>1091-6490</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpdkc1r3DAQxUVpaTZpzz21GHrJxYlGlvVxKZSlH4FAL8lZTO3xxotsuZK9NP99ZTZN2oBgQO83w7x5jL0DfgFcV5fTiOkCNNdKCgD5gm2AWyiVtPwl23AudGmkkCfsNKU959zWhr9mJ8JybSqQG0bbMEyefhd0CH6Z-zBivC-6EOYp9uOcikgHQk9t0Y8Frg_9fepTEbosLSkLUwwzZTXRbqC1JUttf6CYqPA07ua79Ia96tAnevtQz9jt1y832-_l9Y9vV9vP12UjZTWXiLbGtpOad4hghLCApmskVaA4KQuysdggWdA6f0ot0YKxBK1SKDhUZ-zTce60_ByobfI6Eb3LVoZsywXs3f_K2N-5XTi4WmllrM4Dzh8GxPBroTS7oU8NeY8jhSU5sLXOt631in58hu7DEvN5VsqoSnJuRKYuj1QTQ0qRusdlgLs1QrdG6J4izB0f_vXwyP_NLAPvj8A-zSE-6UoaY2pR_QEAK6PK</recordid><startdate>20171031</startdate><enddate>20171031</enddate><creator>Nepomnyachiy, Sergey</creator><creator>Ben-Tal, Nir</creator><creator>Kolodny, Rachel</creator><general>National Academy of Sciences</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QG</scope><scope>7QL</scope><scope>7QP</scope><scope>7QR</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TK</scope><scope>7TM</scope><scope>7TO</scope><scope>7U9</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>H94</scope><scope>M7N</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20171031</creationdate><title>Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths</title><author>Nepomnyachiy, Sergey ; Ben-Tal, Nir ; Kolodny, Rachel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c443t-aa95adf470faa182291a8fc4e3160e6914c9acae91774e3474a9189e1d66a2013</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Amino Acid Sequence</topic><topic>Amino acids</topic><topic>Biological Sciences</topic><topic>Complexity</topic><topic>Computational Biology - methods</topic><topic>Databases, Protein</topic><topic>Divergence</topic><topic>Evolution</topic><topic>Evolution, Molecular</topic><topic>Footprints</topic><topic>Models, Genetic</topic><topic>Protein Conformation</topic><topic>Proteins</topic><topic>Proteins - chemistry</topic><topic>Proteins - genetics</topic><topic>Reuse</topic><topic>Segments</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Nepomnyachiy, Sergey</creatorcontrib><creatorcontrib>Ben-Tal, Nir</creatorcontrib><creatorcontrib>Kolodny, Rachel</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Calcium &amp; Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Oncogenes and Growth Factors Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Proceedings of the National Academy of Sciences - PNAS</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Nepomnyachiy, Sergey</au><au>Ben-Tal, Nir</au><au>Kolodny, Rachel</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths</atitle><jtitle>Proceedings of the National Academy of Sciences - PNAS</jtitle><addtitle>Proc Natl Acad Sci U S A</addtitle><date>2017-10-31</date><risdate>2017</risdate><volume>114</volume><issue>44</issue><spage>11703</spage><epage>11708</epage><pages>11703-11708</pages><issn>0027-8424</issn><eissn>1091-6490</eissn><abstract>Proteins share similar segments with one another. Such “reused parts”—which have been successfully incorporated into other proteins—are likely to offer an evolutionary advantage over de novo evolved segments, as most of the latter will not even have the capacity to fold. To systematically explore the evolutionary traces of segment “reuse” across proteins, we developed an automated methodology that identifies reused segments from protein alignments. We search for “themes”—segments of at least 35 residues of similar sequence and structure—reused within representative sets of 15,016 domains [Evolutionary Classification of Protein Domains (ECOD) database] or 20,398 chains [Protein Data Bank (PDB)]. We observe that theme reuse is highly prevalent and that reuse is more extensive when the length threshold for identifying a theme is lower. Structural domains, the best characterized form of reuse in proteins, are just one of many complex and intertwined evolutionary traces. Others include long themes shared among a few proteins, which encompass and overlap with shorter themes that recur in numerous proteins. The observed complexity is consistent with evolution by duplication and divergence, and some of the themes might include descendants of ancestral segments. The observed recursive footprints, where the same amino acid can simultaneously participate in several intertwined themes, could be a useful concept for protein design. Data are available at http://trachel-srv.cs.haifa.ac.il/rachel/ppi/themes/.</abstract><cop>United States</cop><pub>National Academy of Sciences</pub><pmid>29078314</pmid><doi>10.1073/pnas.1707642114</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0027-8424
ispartof Proceedings of the National Academy of Sciences - PNAS, 2017-10, Vol.114 (44), p.11703-11708
issn 0027-8424
1091-6490
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5676897
source MEDLINE; JSTOR Archive Collection A-Z Listing; PubMed Central; Alma/SFX Local Collection; Free Full-Text Journals in Chemistry
subjects Amino Acid Sequence
Amino acids
Biological Sciences
Complexity
Computational Biology - methods
Databases, Protein
Divergence
Evolution
Evolution, Molecular
Footprints
Models, Genetic
Protein Conformation
Proteins
Proteins - chemistry
Proteins - genetics
Reuse
Segments
title Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-10T20%3A34%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-jstor_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Complex%20evolutionary%20footprints%20revealed%20in%20an%20analysis%20of%20reused%20protein%20segments%20of%20diverse%20lengths&rft.jtitle=Proceedings%20of%20the%20National%20Academy%20of%20Sciences%20-%20PNAS&rft.au=Nepomnyachiy,%20Sergey&rft.date=2017-10-31&rft.volume=114&rft.issue=44&rft.spage=11703&rft.epage=11708&rft.pages=11703-11708&rft.issn=0027-8424&rft.eissn=1091-6490&rft_id=info:doi/10.1073/pnas.1707642114&rft_dat=%3Cjstor_pubme%3E26488852%3C/jstor_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1986340082&rft_id=info:pmid/29078314&rft_jstor_id=26488852&rfr_iscdi=true