Similar protein segments shared between domains of different evolutionary lineages

The emergence of novel proteins, beyond these that can be readily made by duplication and recombination of preexisting domains, is elusive. De novo emergence from random sequences is unlikely because the vast majority of random chains would not even fold, let alone function. An alternative explanati...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Protein science 2022-09, Vol.31 (9), p.e4407-n/a
Hauptverfasser: Qiu, Kaiyu, Ben‐Tal, Nir, Kolodny, Rachel
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page n/a
container_issue 9
container_start_page e4407
container_title Protein science
container_volume 31
creator Qiu, Kaiyu
Ben‐Tal, Nir
Kolodny, Rachel
description The emergence of novel proteins, beyond these that can be readily made by duplication and recombination of preexisting domains, is elusive. De novo emergence from random sequences is unlikely because the vast majority of random chains would not even fold, let alone function. An alternative explanation is that novel proteins emerge by duplication and fusion of pre‐existing polypeptide segments. In this case, traces of such ancient events may remain within contemporary proteins in the form of reused segments. Together with the late Dan Tawfik, we detected such similar segments, far shorter than intact protein domains, which are found in different environments. The detection of these, “bridging themes,” was based on a unique search strategy, where in addition to searching for similarity of shared fragments, so‐called “themes,” we also explicitly searched for cases in which the sequence segments before and after the theme are dissimilar (both in sequence and structure). Here, using a similar strategy, we further expanded the search and discovered almost 500 additional “bridging themes,” linking domains that are often from ancient folds. The themes, of 20 residues or more (average 53), do not retain their structure despite sharing 37% sequence identity on average. Indeed, conformation flexibility may confer an evolutionary advantage, in that it fits in multiple environments. We elaborate on two interesting themes, shared between Rossmann/Trefoil‐Plexin‐like domains and a β‐propeller‐like domain. For a Broad Audience A fundamental question in molecular evolution is how protein domains emerged. Similar segments shared between domains of seemingly distinct origins, may offer clues, as these may be remnants of the evolutionary process through which these domains emerged. However, finding such cases is difficult. Here, we expand the set of such cases which we curated previously, adding segments shared between domains that are considered ancient.
doi_str_mv 10.1002/pro.4407
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9387206</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2707855585</sourcerecordid><originalsourceid>FETCH-LOGICAL-c5047-db88a6bac729310d2c5a0ed69d7795b88e497d743454c9fa1410a9fb4197b0e93</originalsourceid><addsrcrecordid>eNp1kVtrFTEURoMo9lgFf4EEfPFl6k4mk8uLIKVeoFCpCr6FzGTPacpMckxmWvrvzbG1XsCnEPZi5dv5CHnO4IgB8Ne7nI6EAPWAbJiQptFGfntINmAka3Qr9QF5UsolAAjG28fkoJUggEu2Ieefwxwml2lVLBgiLbidMS6FlguX0dMel2vESH2aXYiFppH6MI6YK0TxKk3rElJ0-YZOIaLbYnlKHo1uKvjs7jwkX9-dfDn-0Jyevf94_Pa0GToQqvG91k72blDctAw8HzoH6KXxSpmuDlEY5ZVoRScGMzomGDgz9oIZ1QOa9pC8ufXu1n5GP9RA2U12l8Nc49jkgv17EsOF3aYra1qtOMgqeHUnyOn7imWxcygDTpOLmNZiuQLN60_x_Vsv_0Ev05pjXW9PKd11ne5-C4ecSsk43odhYPdF1Xuy-6Iq-uLP8Pfgr2Yq0NwC12HCm_-K7Kfzs5_CH0n4ng4</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2707855585</pqid></control><display><type>article</type><title>Similar protein segments shared between domains of different evolutionary lineages</title><source>MEDLINE</source><source>Wiley Online Library Free Content</source><source>Access via Wiley Online Library</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><source>Free Full-Text Journals in Chemistry</source><creator>Qiu, Kaiyu ; Ben‐Tal, Nir ; Kolodny, Rachel</creator><creatorcontrib>Qiu, Kaiyu ; Ben‐Tal, Nir ; Kolodny, Rachel</creatorcontrib><description>The emergence of novel proteins, beyond these that can be readily made by duplication and recombination of preexisting domains, is elusive. De novo emergence from random sequences is unlikely because the vast majority of random chains would not even fold, let alone function. An alternative explanation is that novel proteins emerge by duplication and fusion of pre‐existing polypeptide segments. In this case, traces of such ancient events may remain within contemporary proteins in the form of reused segments. Together with the late Dan Tawfik, we detected such similar segments, far shorter than intact protein domains, which are found in different environments. The detection of these, “bridging themes,” was based on a unique search strategy, where in addition to searching for similarity of shared fragments, so‐called “themes,” we also explicitly searched for cases in which the sequence segments before and after the theme are dissimilar (both in sequence and structure). Here, using a similar strategy, we further expanded the search and discovered almost 500 additional “bridging themes,” linking domains that are often from ancient folds. The themes, of 20 residues or more (average 53), do not retain their structure despite sharing 37% sequence identity on average. Indeed, conformation flexibility may confer an evolutionary advantage, in that it fits in multiple environments. We elaborate on two interesting themes, shared between Rossmann/Trefoil‐Plexin‐like domains and a β‐propeller‐like domain. For a Broad Audience A fundamental question in molecular evolution is how protein domains emerged. Similar segments shared between domains of seemingly distinct origins, may offer clues, as these may be remnants of the evolutionary process through which these domains emerged. However, finding such cases is difficult. Here, we expand the set of such cases which we curated previously, adding segments shared between domains that are considered ancient.</description><identifier>ISSN: 0961-8368</identifier><identifier>ISSN: 1469-896X</identifier><identifier>EISSN: 1469-896X</identifier><identifier>DOI: 10.1002/pro.4407</identifier><identifier>PMID: 36040261</identifier><language>eng</language><publisher>Hoboken, USA: John Wiley &amp; Sons, Inc</publisher><subject>Amino Acid Sequence ; ancestral peptides ; bridging themes ; Evolution ; Evolution, Molecular ; Full‐length Paper ; Full‐length Papers ; Molecular evolution ; Peptides - chemistry ; Polypeptides ; Protein Domains ; protein emergence ; protein evolutionary patterns ; protein space ; Protein structure ; Proteins ; Proteins - chemistry ; Proteins - genetics ; Recombination ; Reproduction (copying) ; Search methods ; Segments</subject><ispartof>Protein science, 2022-09, Vol.31 (9), p.e4407-n/a</ispartof><rights>2022 The Authors. published by Wiley Periodicals LLC on behalf of The Protein Society.</rights><rights>2022 The Authors. Protein Science published by Wiley Periodicals LLC on behalf of The Protein Society.</rights><rights>2022. This article is published under http://creativecommons.org/licenses/by-nc/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c5047-db88a6bac729310d2c5a0ed69d7795b88e497d743454c9fa1410a9fb4197b0e93</citedby><cites>FETCH-LOGICAL-c5047-db88a6bac729310d2c5a0ed69d7795b88e497d743454c9fa1410a9fb4197b0e93</cites><orcidid>0000-0001-6901-832X ; 0000-0002-0676-850X ; 0000-0001-8523-1614</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC9387206/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC9387206/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,1417,1433,27924,27925,45574,45575,46409,46833,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/36040261$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Qiu, Kaiyu</creatorcontrib><creatorcontrib>Ben‐Tal, Nir</creatorcontrib><creatorcontrib>Kolodny, Rachel</creatorcontrib><title>Similar protein segments shared between domains of different evolutionary lineages</title><title>Protein science</title><addtitle>Protein Sci</addtitle><description>The emergence of novel proteins, beyond these that can be readily made by duplication and recombination of preexisting domains, is elusive. De novo emergence from random sequences is unlikely because the vast majority of random chains would not even fold, let alone function. An alternative explanation is that novel proteins emerge by duplication and fusion of pre‐existing polypeptide segments. In this case, traces of such ancient events may remain within contemporary proteins in the form of reused segments. Together with the late Dan Tawfik, we detected such similar segments, far shorter than intact protein domains, which are found in different environments. The detection of these, “bridging themes,” was based on a unique search strategy, where in addition to searching for similarity of shared fragments, so‐called “themes,” we also explicitly searched for cases in which the sequence segments before and after the theme are dissimilar (both in sequence and structure). Here, using a similar strategy, we further expanded the search and discovered almost 500 additional “bridging themes,” linking domains that are often from ancient folds. The themes, of 20 residues or more (average 53), do not retain their structure despite sharing 37% sequence identity on average. Indeed, conformation flexibility may confer an evolutionary advantage, in that it fits in multiple environments. We elaborate on two interesting themes, shared between Rossmann/Trefoil‐Plexin‐like domains and a β‐propeller‐like domain. For a Broad Audience A fundamental question in molecular evolution is how protein domains emerged. Similar segments shared between domains of seemingly distinct origins, may offer clues, as these may be remnants of the evolutionary process through which these domains emerged. However, finding such cases is difficult. Here, we expand the set of such cases which we curated previously, adding segments shared between domains that are considered ancient.</description><subject>Amino Acid Sequence</subject><subject>ancestral peptides</subject><subject>bridging themes</subject><subject>Evolution</subject><subject>Evolution, Molecular</subject><subject>Full‐length Paper</subject><subject>Full‐length Papers</subject><subject>Molecular evolution</subject><subject>Peptides - chemistry</subject><subject>Polypeptides</subject><subject>Protein Domains</subject><subject>protein emergence</subject><subject>protein evolutionary patterns</subject><subject>protein space</subject><subject>Protein structure</subject><subject>Proteins</subject><subject>Proteins - chemistry</subject><subject>Proteins - genetics</subject><subject>Recombination</subject><subject>Reproduction (copying)</subject><subject>Search methods</subject><subject>Segments</subject><issn>0961-8368</issn><issn>1469-896X</issn><issn>1469-896X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>24P</sourceid><sourceid>WIN</sourceid><sourceid>EIF</sourceid><recordid>eNp1kVtrFTEURoMo9lgFf4EEfPFl6k4mk8uLIKVeoFCpCr6FzGTPacpMckxmWvrvzbG1XsCnEPZi5dv5CHnO4IgB8Ne7nI6EAPWAbJiQptFGfntINmAka3Qr9QF5UsolAAjG28fkoJUggEu2Ieefwxwml2lVLBgiLbidMS6FlguX0dMel2vESH2aXYiFppH6MI6YK0TxKk3rElJ0-YZOIaLbYnlKHo1uKvjs7jwkX9-dfDn-0Jyevf94_Pa0GToQqvG91k72blDctAw8HzoH6KXxSpmuDlEY5ZVoRScGMzomGDgz9oIZ1QOa9pC8ufXu1n5GP9RA2U12l8Nc49jkgv17EsOF3aYra1qtOMgqeHUnyOn7imWxcygDTpOLmNZiuQLN60_x_Vsv_0Ev05pjXW9PKd11ne5-C4ecSsk43odhYPdF1Xuy-6Iq-uLP8Pfgr2Yq0NwC12HCm_-K7Kfzs5_CH0n4ng4</recordid><startdate>202209</startdate><enddate>202209</enddate><creator>Qiu, Kaiyu</creator><creator>Ben‐Tal, Nir</creator><creator>Kolodny, Rachel</creator><general>John Wiley &amp; Sons, Inc</general><general>Wiley Subscription Services, Inc</general><scope>24P</scope><scope>WIN</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QO</scope><scope>7T5</scope><scope>7TM</scope><scope>7U9</scope><scope>8FD</scope><scope>FR3</scope><scope>H94</scope><scope>K9.</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0001-6901-832X</orcidid><orcidid>https://orcid.org/0000-0002-0676-850X</orcidid><orcidid>https://orcid.org/0000-0001-8523-1614</orcidid></search><sort><creationdate>202209</creationdate><title>Similar protein segments shared between domains of different evolutionary lineages</title><author>Qiu, Kaiyu ; Ben‐Tal, Nir ; Kolodny, Rachel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c5047-db88a6bac729310d2c5a0ed69d7795b88e497d743454c9fa1410a9fb4197b0e93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Amino Acid Sequence</topic><topic>ancestral peptides</topic><topic>bridging themes</topic><topic>Evolution</topic><topic>Evolution, Molecular</topic><topic>Full‐length Paper</topic><topic>Full‐length Papers</topic><topic>Molecular evolution</topic><topic>Peptides - chemistry</topic><topic>Polypeptides</topic><topic>Protein Domains</topic><topic>protein emergence</topic><topic>protein evolutionary patterns</topic><topic>protein space</topic><topic>Protein structure</topic><topic>Proteins</topic><topic>Proteins - chemistry</topic><topic>Proteins - genetics</topic><topic>Recombination</topic><topic>Reproduction (copying)</topic><topic>Search methods</topic><topic>Segments</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Qiu, Kaiyu</creatorcontrib><creatorcontrib>Ben‐Tal, Nir</creatorcontrib><creatorcontrib>Kolodny, Rachel</creatorcontrib><collection>Wiley Online Library (Open Access Collection)</collection><collection>Wiley Online Library Free Content</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Biotechnology Research Abstracts</collection><collection>Immunology Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Protein science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Qiu, Kaiyu</au><au>Ben‐Tal, Nir</au><au>Kolodny, Rachel</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Similar protein segments shared between domains of different evolutionary lineages</atitle><jtitle>Protein science</jtitle><addtitle>Protein Sci</addtitle><date>2022-09</date><risdate>2022</risdate><volume>31</volume><issue>9</issue><spage>e4407</spage><epage>n/a</epage><pages>e4407-n/a</pages><issn>0961-8368</issn><issn>1469-896X</issn><eissn>1469-896X</eissn><abstract>The emergence of novel proteins, beyond these that can be readily made by duplication and recombination of preexisting domains, is elusive. De novo emergence from random sequences is unlikely because the vast majority of random chains would not even fold, let alone function. An alternative explanation is that novel proteins emerge by duplication and fusion of pre‐existing polypeptide segments. In this case, traces of such ancient events may remain within contemporary proteins in the form of reused segments. Together with the late Dan Tawfik, we detected such similar segments, far shorter than intact protein domains, which are found in different environments. The detection of these, “bridging themes,” was based on a unique search strategy, where in addition to searching for similarity of shared fragments, so‐called “themes,” we also explicitly searched for cases in which the sequence segments before and after the theme are dissimilar (both in sequence and structure). Here, using a similar strategy, we further expanded the search and discovered almost 500 additional “bridging themes,” linking domains that are often from ancient folds. The themes, of 20 residues or more (average 53), do not retain their structure despite sharing 37% sequence identity on average. Indeed, conformation flexibility may confer an evolutionary advantage, in that it fits in multiple environments. We elaborate on two interesting themes, shared between Rossmann/Trefoil‐Plexin‐like domains and a β‐propeller‐like domain. For a Broad Audience A fundamental question in molecular evolution is how protein domains emerged. Similar segments shared between domains of seemingly distinct origins, may offer clues, as these may be remnants of the evolutionary process through which these domains emerged. However, finding such cases is difficult. Here, we expand the set of such cases which we curated previously, adding segments shared between domains that are considered ancient.</abstract><cop>Hoboken, USA</cop><pub>John Wiley &amp; Sons, Inc</pub><pmid>36040261</pmid><doi>10.1002/pro.4407</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0001-6901-832X</orcidid><orcidid>https://orcid.org/0000-0002-0676-850X</orcidid><orcidid>https://orcid.org/0000-0001-8523-1614</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0961-8368
ispartof Protein science, 2022-09, Vol.31 (9), p.e4407-n/a
issn 0961-8368
1469-896X
1469-896X
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9387206
source MEDLINE; Wiley Online Library Free Content; Access via Wiley Online Library; EZB-FREE-00999 freely available EZB journals; PubMed Central; Free Full-Text Journals in Chemistry
subjects Amino Acid Sequence
ancestral peptides
bridging themes
Evolution
Evolution, Molecular
Full‐length Paper
Full‐length Papers
Molecular evolution
Peptides - chemistry
Polypeptides
Protein Domains
protein emergence
protein evolutionary patterns
protein space
Protein structure
Proteins
Proteins - chemistry
Proteins - genetics
Recombination
Reproduction (copying)
Search methods
Segments
title Similar protein segments shared between domains of different evolutionary lineages
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-23T05%3A55%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Similar%20protein%20segments%20shared%20between%20domains%20of%20different%20evolutionary%20lineages&rft.jtitle=Protein%20science&rft.au=Qiu,%20Kaiyu&rft.date=2022-09&rft.volume=31&rft.issue=9&rft.spage=e4407&rft.epage=n/a&rft.pages=e4407-n/a&rft.issn=0961-8368&rft.eissn=1469-896X&rft_id=info:doi/10.1002/pro.4407&rft_dat=%3Cproquest_pubme%3E2707855585%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2707855585&rft_id=info:pmid/36040261&rfr_iscdi=true