Coevolution‐derived native and non‐native contacts determine the emergence of a novel fold in a universally conserved family of transcription factors

The NusG protein family is structurally and functionally conserved in all domains of life. Its members directly bind RNA polymerases and regulate transcription processivity and termination. RfaH, a divergent sub‐family in its evolutionary history, is known for displaying distinct features than those...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Protein science 2022-06, Vol.31 (6), p.e4337-n/a
Hauptverfasser: Galaz‐Davison, Pablo, Ferreiro, Diego U., Ramírez‐Sarmiento, César A.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page n/a
container_issue 6
container_start_page e4337
container_title Protein science
container_volume 31
creator Galaz‐Davison, Pablo
Ferreiro, Diego U.
Ramírez‐Sarmiento, César A.
description The NusG protein family is structurally and functionally conserved in all domains of life. Its members directly bind RNA polymerases and regulate transcription processivity and termination. RfaH, a divergent sub‐family in its evolutionary history, is known for displaying distinct features than those in NusG proteins, which allows them to regulate the expression of virulence factors in enterobacteria in a DNA sequence‐dependent manner. A striking feature is its structural interconversion between an active fold, which is the canonical NusG three‐dimensional structure, and an autoinhibited fold, which is distinctively novel. How this novel fold is encoded within RfaH sequence to encode a metamorphic protein remains elusive. In this work, we used publicly available genomic RfaH protein sequences to construct a complete multiple sequence alignment, which was further augmented with metagenomic sequences and curated by predicting their secondary structure propensities using JPred. Coevolving pairs of residues were calculated from these sequences using plmDCA and GREMLIN, which allowed us to detect the enrichment of key metamorphic contacts after sequence filtering. Finally, we combined our coevolutionary predictions with molecular dynamics to demonstrate that these interactions are sufficient to predict the structures of both native folds, where coevolutionary‐derived non‐native contacts may play a key role in achieving the compact RfaH novel fold. All in all, emergent coevolutionary signals found within RfaH sequences encode the autoinhibited and active folds of this protein, shedding light on the key interactions responsible for the action of this metamorphic protein.
doi_str_mv 10.1002/pro.4337
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9123645</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2671996961</sourcerecordid><originalsourceid>FETCH-LOGICAL-c4387-b17485165425fd76be579de52ee2470c3efb585b264c7796e8d68be525c2d5cb3</originalsourceid><addsrcrecordid>eNp1kc1u1DAQxy0EoktB4glQJC5cUhJ_JhcktOJLqlSEQOJmOc6kdeXYi51stTceode-Hk_CZLuUD4mTPTO_-XvGf0Ke1tVJXVX05SbFE86YukdWNZdt2bTy632yqlpZlw2TzRF5lPNlVVW8puwhOWJCMq5ksyI36wjb6OfJxfDj-3UPyW2hL4KZ8CxMwOu-cEjYGCZjp1z0MEEaXYBiuoACRkjnECwUcSgMtmzBF0P0feECxnPA3pSN97tFIUNa3hjM6DCBHVMyIdvkNssUmLdTTPkxeTAYn-HJ4TwmX96--bx-X56evfuwfn1aWs4aVXa14o2opeBUDL2SHQjV9iAoAOWqsgyGTjSio5JbpVoJTS8bhKiwtBe2Y8fk1a3uZu5G6C0EHMfrTXKjSTsdjdN_V4K70Odxq1v8S8kFCrw4CKT4bYY86dFlC96bAHHOmkpVt61ELxB9_g96GecUcL09hatQxn4L2hRzTjDcDVNXevEb46gXvxF99ufwd-AvgxEob4Er52H3XyH98dPZXvAnEUe60A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2671174233</pqid></control><display><type>article</type><title>Coevolution‐derived native and non‐native contacts determine the emergence of a novel fold in a universally conserved family of transcription factors</title><source>Wiley-Blackwell Journals</source><source>Wiley-Blackwell Open Access Backfiles</source><source>MEDLINE</source><source>Full-Text Journals in Chemistry (Open access)</source><source>PubMed Central</source><source>EZB Electronic Journals Library</source><creator>Galaz‐Davison, Pablo ; Ferreiro, Diego U. ; Ramírez‐Sarmiento, César A.</creator><creatorcontrib>Galaz‐Davison, Pablo ; Ferreiro, Diego U. ; Ramírez‐Sarmiento, César A.</creatorcontrib><description>The NusG protein family is structurally and functionally conserved in all domains of life. Its members directly bind RNA polymerases and regulate transcription processivity and termination. RfaH, a divergent sub‐family in its evolutionary history, is known for displaying distinct features than those in NusG proteins, which allows them to regulate the expression of virulence factors in enterobacteria in a DNA sequence‐dependent manner. A striking feature is its structural interconversion between an active fold, which is the canonical NusG three‐dimensional structure, and an autoinhibited fold, which is distinctively novel. How this novel fold is encoded within RfaH sequence to encode a metamorphic protein remains elusive. In this work, we used publicly available genomic RfaH protein sequences to construct a complete multiple sequence alignment, which was further augmented with metagenomic sequences and curated by predicting their secondary structure propensities using JPred. Coevolving pairs of residues were calculated from these sequences using plmDCA and GREMLIN, which allowed us to detect the enrichment of key metamorphic contacts after sequence filtering. Finally, we combined our coevolutionary predictions with molecular dynamics to demonstrate that these interactions are sufficient to predict the structures of both native folds, where coevolutionary‐derived non‐native contacts may play a key role in achieving the compact RfaH novel fold. All in all, emergent coevolutionary signals found within RfaH sequences encode the autoinhibited and active folds of this protein, shedding light on the key interactions responsible for the action of this metamorphic protein.</description><identifier>ISSN: 0961-8368</identifier><identifier>EISSN: 1469-896X</identifier><identifier>DOI: 10.1002/pro.4337</identifier><identifier>PMID: 35634768</identifier><language>eng</language><publisher>Hoboken, USA: John Wiley &amp; Sons, Inc</publisher><subject>Coevolution ; direct coupling analysis ; Divergence ; DNA-Directed RNA Polymerases - chemistry ; Escherichia coli Proteins - chemistry ; evolution ; fold‐switch ; Full‐length Paper ; Full‐length Papers ; Gremlin protein ; Metagenomics ; metamorphic proteins ; Molecular dynamics ; Nucleotide sequence ; NusG protein ; Peptide Elongation Factors - chemistry ; Peptide Elongation Factors - genetics ; Peptide Elongation Factors - metabolism ; Protein folding ; Protein structure ; Proteins ; Secondary structure ; Trans-Activators - chemistry ; transcription factor ; Transcription factors ; Transcription Factors - chemistry ; Transcription termination ; Virulence ; Virulence factors</subject><ispartof>Protein science, 2022-06, Vol.31 (6), p.e4337-n/a</ispartof><rights>2022 The Protein Society.</rights><rights>2022 The Protein Society</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c4387-b17485165425fd76be579de52ee2470c3efb585b264c7796e8d68be525c2d5cb3</citedby><cites>FETCH-LOGICAL-c4387-b17485165425fd76be579de52ee2470c3efb585b264c7796e8d68be525c2d5cb3</cites><orcidid>0000-0003-4647-903X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC9123645/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC9123645/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,727,780,784,885,1417,1433,27924,27925,45574,45575,46409,46833,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35634768$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Galaz‐Davison, Pablo</creatorcontrib><creatorcontrib>Ferreiro, Diego U.</creatorcontrib><creatorcontrib>Ramírez‐Sarmiento, César A.</creatorcontrib><title>Coevolution‐derived native and non‐native contacts determine the emergence of a novel fold in a universally conserved family of transcription factors</title><title>Protein science</title><addtitle>Protein Sci</addtitle><description>The NusG protein family is structurally and functionally conserved in all domains of life. Its members directly bind RNA polymerases and regulate transcription processivity and termination. RfaH, a divergent sub‐family in its evolutionary history, is known for displaying distinct features than those in NusG proteins, which allows them to regulate the expression of virulence factors in enterobacteria in a DNA sequence‐dependent manner. A striking feature is its structural interconversion between an active fold, which is the canonical NusG three‐dimensional structure, and an autoinhibited fold, which is distinctively novel. How this novel fold is encoded within RfaH sequence to encode a metamorphic protein remains elusive. In this work, we used publicly available genomic RfaH protein sequences to construct a complete multiple sequence alignment, which was further augmented with metagenomic sequences and curated by predicting their secondary structure propensities using JPred. Coevolving pairs of residues were calculated from these sequences using plmDCA and GREMLIN, which allowed us to detect the enrichment of key metamorphic contacts after sequence filtering. Finally, we combined our coevolutionary predictions with molecular dynamics to demonstrate that these interactions are sufficient to predict the structures of both native folds, where coevolutionary‐derived non‐native contacts may play a key role in achieving the compact RfaH novel fold. All in all, emergent coevolutionary signals found within RfaH sequences encode the autoinhibited and active folds of this protein, shedding light on the key interactions responsible for the action of this metamorphic protein.</description><subject>Coevolution</subject><subject>direct coupling analysis</subject><subject>Divergence</subject><subject>DNA-Directed RNA Polymerases - chemistry</subject><subject>Escherichia coli Proteins - chemistry</subject><subject>evolution</subject><subject>fold‐switch</subject><subject>Full‐length Paper</subject><subject>Full‐length Papers</subject><subject>Gremlin protein</subject><subject>Metagenomics</subject><subject>metamorphic proteins</subject><subject>Molecular dynamics</subject><subject>Nucleotide sequence</subject><subject>NusG protein</subject><subject>Peptide Elongation Factors - chemistry</subject><subject>Peptide Elongation Factors - genetics</subject><subject>Peptide Elongation Factors - metabolism</subject><subject>Protein folding</subject><subject>Protein structure</subject><subject>Proteins</subject><subject>Secondary structure</subject><subject>Trans-Activators - chemistry</subject><subject>transcription factor</subject><subject>Transcription factors</subject><subject>Transcription Factors - chemistry</subject><subject>Transcription termination</subject><subject>Virulence</subject><subject>Virulence factors</subject><issn>0961-8368</issn><issn>1469-896X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp1kc1u1DAQxy0EoktB4glQJC5cUhJ_JhcktOJLqlSEQOJmOc6kdeXYi51stTceode-Hk_CZLuUD4mTPTO_-XvGf0Ke1tVJXVX05SbFE86YukdWNZdt2bTy632yqlpZlw2TzRF5lPNlVVW8puwhOWJCMq5ksyI36wjb6OfJxfDj-3UPyW2hL4KZ8CxMwOu-cEjYGCZjp1z0MEEaXYBiuoACRkjnECwUcSgMtmzBF0P0feECxnPA3pSN97tFIUNa3hjM6DCBHVMyIdvkNssUmLdTTPkxeTAYn-HJ4TwmX96--bx-X56evfuwfn1aWs4aVXa14o2opeBUDL2SHQjV9iAoAOWqsgyGTjSio5JbpVoJTS8bhKiwtBe2Y8fk1a3uZu5G6C0EHMfrTXKjSTsdjdN_V4K70Odxq1v8S8kFCrw4CKT4bYY86dFlC96bAHHOmkpVt61ELxB9_g96GecUcL09hatQxn4L2hRzTjDcDVNXevEb46gXvxF99ufwd-AvgxEob4Er52H3XyH98dPZXvAnEUe60A</recordid><startdate>202206</startdate><enddate>202206</enddate><creator>Galaz‐Davison, Pablo</creator><creator>Ferreiro, Diego U.</creator><creator>Ramírez‐Sarmiento, César A.</creator><general>John Wiley &amp; Sons, Inc</general><general>Wiley Subscription Services, Inc</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QO</scope><scope>7T5</scope><scope>7TM</scope><scope>7U9</scope><scope>8FD</scope><scope>FR3</scope><scope>H94</scope><scope>K9.</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0003-4647-903X</orcidid></search><sort><creationdate>202206</creationdate><title>Coevolution‐derived native and non‐native contacts determine the emergence of a novel fold in a universally conserved family of transcription factors</title><author>Galaz‐Davison, Pablo ; Ferreiro, Diego U. ; Ramírez‐Sarmiento, César A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c4387-b17485165425fd76be579de52ee2470c3efb585b264c7796e8d68be525c2d5cb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Coevolution</topic><topic>direct coupling analysis</topic><topic>Divergence</topic><topic>DNA-Directed RNA Polymerases - chemistry</topic><topic>Escherichia coli Proteins - chemistry</topic><topic>evolution</topic><topic>fold‐switch</topic><topic>Full‐length Paper</topic><topic>Full‐length Papers</topic><topic>Gremlin protein</topic><topic>Metagenomics</topic><topic>metamorphic proteins</topic><topic>Molecular dynamics</topic><topic>Nucleotide sequence</topic><topic>NusG protein</topic><topic>Peptide Elongation Factors - chemistry</topic><topic>Peptide Elongation Factors - genetics</topic><topic>Peptide Elongation Factors - metabolism</topic><topic>Protein folding</topic><topic>Protein structure</topic><topic>Proteins</topic><topic>Secondary structure</topic><topic>Trans-Activators - chemistry</topic><topic>transcription factor</topic><topic>Transcription factors</topic><topic>Transcription Factors - chemistry</topic><topic>Transcription termination</topic><topic>Virulence</topic><topic>Virulence factors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Galaz‐Davison, Pablo</creatorcontrib><creatorcontrib>Ferreiro, Diego U.</creatorcontrib><creatorcontrib>Ramírez‐Sarmiento, César A.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Biotechnology Research Abstracts</collection><collection>Immunology Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Protein science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Galaz‐Davison, Pablo</au><au>Ferreiro, Diego U.</au><au>Ramírez‐Sarmiento, César A.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Coevolution‐derived native and non‐native contacts determine the emergence of a novel fold in a universally conserved family of transcription factors</atitle><jtitle>Protein science</jtitle><addtitle>Protein Sci</addtitle><date>2022-06</date><risdate>2022</risdate><volume>31</volume><issue>6</issue><spage>e4337</spage><epage>n/a</epage><pages>e4337-n/a</pages><issn>0961-8368</issn><eissn>1469-896X</eissn><abstract>The NusG protein family is structurally and functionally conserved in all domains of life. Its members directly bind RNA polymerases and regulate transcription processivity and termination. RfaH, a divergent sub‐family in its evolutionary history, is known for displaying distinct features than those in NusG proteins, which allows them to regulate the expression of virulence factors in enterobacteria in a DNA sequence‐dependent manner. A striking feature is its structural interconversion between an active fold, which is the canonical NusG three‐dimensional structure, and an autoinhibited fold, which is distinctively novel. How this novel fold is encoded within RfaH sequence to encode a metamorphic protein remains elusive. In this work, we used publicly available genomic RfaH protein sequences to construct a complete multiple sequence alignment, which was further augmented with metagenomic sequences and curated by predicting their secondary structure propensities using JPred. Coevolving pairs of residues were calculated from these sequences using plmDCA and GREMLIN, which allowed us to detect the enrichment of key metamorphic contacts after sequence filtering. Finally, we combined our coevolutionary predictions with molecular dynamics to demonstrate that these interactions are sufficient to predict the structures of both native folds, where coevolutionary‐derived non‐native contacts may play a key role in achieving the compact RfaH novel fold. All in all, emergent coevolutionary signals found within RfaH sequences encode the autoinhibited and active folds of this protein, shedding light on the key interactions responsible for the action of this metamorphic protein.</abstract><cop>Hoboken, USA</cop><pub>John Wiley &amp; Sons, Inc</pub><pmid>35634768</pmid><doi>10.1002/pro.4337</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0003-4647-903X</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0961-8368
ispartof Protein science, 2022-06, Vol.31 (6), p.e4337-n/a
issn 0961-8368
1469-896X
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9123645
source Wiley-Blackwell Journals; Wiley-Blackwell Open Access Backfiles; MEDLINE; Full-Text Journals in Chemistry (Open access); PubMed Central; EZB Electronic Journals Library
subjects Coevolution
direct coupling analysis
Divergence
DNA-Directed RNA Polymerases - chemistry
Escherichia coli Proteins - chemistry
evolution
fold‐switch
Full‐length Paper
Full‐length Papers
Gremlin protein
Metagenomics
metamorphic proteins
Molecular dynamics
Nucleotide sequence
NusG protein
Peptide Elongation Factors - chemistry
Peptide Elongation Factors - genetics
Peptide Elongation Factors - metabolism
Protein folding
Protein structure
Proteins
Secondary structure
Trans-Activators - chemistry
transcription factor
Transcription factors
Transcription Factors - chemistry
Transcription termination
Virulence
Virulence factors
title Coevolution‐derived native and non‐native contacts determine the emergence of a novel fold in a universally conserved family of transcription factors
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T23%3A38%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Coevolution%E2%80%90derived%20native%20and%20non%E2%80%90native%20contacts%20determine%20the%20emergence%20of%20a%20novel%20fold%20in%20a%20universally%20conserved%20family%20of%20transcription%20factors&rft.jtitle=Protein%20science&rft.au=Galaz%E2%80%90Davison,%20Pablo&rft.date=2022-06&rft.volume=31&rft.issue=6&rft.spage=e4337&rft.epage=n/a&rft.pages=e4337-n/a&rft.issn=0961-8368&rft.eissn=1469-896X&rft_id=info:doi/10.1002/pro.4337&rft_dat=%3Cproquest_pubme%3E2671996961%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2671174233&rft_id=info:pmid/35634768&rfr_iscdi=true