Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lantibiotics

Microbial natural products constitute a wide variety of chemical compounds, many which can have antibiotic, antiviral, or anticancer properties that make them interesting for clinical purposes. Natural product classes include polyketides (PKs), nonribosomal peptides (NRPs), and ribosomally synthesiz...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:PLoS biology 2020-12, Vol.18 (12)
Hauptverfasser: Kloosterman, Alexander M, Cimermancic, Peter, Elsayed, Somayah S, Du, Chao, Hadjithomas, Michalis, Donia, Mohamed S, Fischbach, Michael A, van Wezel, Gilles P, Medema, Marnix H, Roberts, Roland G
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 12
container_start_page
container_title PLoS biology
container_volume 18
creator Kloosterman, Alexander M
Cimermancic, Peter
Elsayed, Somayah S
Du, Chao
Hadjithomas, Michalis
Donia, Mohamed S
Fischbach, Michael A
van Wezel, Gilles P
Medema, Marnix H
Roberts, Roland G
description Microbial natural products constitute a wide variety of chemical compounds, many which can have antibiotic, antiviral, or anticancer properties that make them interesting for clinical purposes. Natural product classes include polyketides (PKs), nonribosomal peptides (NRPs), and ribosomally synthesized and post-translationally modified peptides (RiPPs). While variants of biosynthetic gene clusters (BGCs) for known classes of natural products are easy to identify in genome sequences, BGCs for new compound classes escape attention. In particular, evidence is accumulating that for RiPPs, subclasses known thus far may only represent the tip of an iceberg. Here, we present decRiPPter (Data-driven Exploratory Class-independent RiPP TrackER), a RiPP genome mining algorithm aimed at the discovery of novel RiPP classes. DecRiPPter combines a Support Vector Machine (SVM) that identifies candidate RiPP precursors with pan-genomic analyses to identify which of these are encoded within operon-like structures that are part of the accessory genome of a genus. Subsequently, it prioritizes such regions based on the presence of new enzymology and based on patterns of gene cluster and precursor peptide conservation across species. We then applied decRiPPter to mine 1,295 Streptomyces genomes, which led to the identification of 42 new candidate RiPP families that could not be found by existing programs. One of these was studied further and elucidated as a representative of a novel subfamily of lanthipeptides, which we designate class V. The 2D structure of the new RiPP, which we name pristinin A3 (1), was solved using nuclear magnetic resonance (NMR), tandem mass spectrometry (MS/MS) data, and chemical labeling. Two previously unidentified modifying enzymes are proposed to create the hallmark lanthionine bridges. Taken together, our work highlights how novel natural product families can be discovered by methods going beyond sequence similarity searches to integrate multiple pathway discovery criteria.
doi_str_mv 10.1371/journal.pbio.3001026
format Article
fullrecord <record><control><sourceid>gale</sourceid><recordid>TN_cdi_gale_infotracmisc_A650815180</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A650815180</galeid><sourcerecordid>A650815180</sourcerecordid><originalsourceid>FETCH-LOGICAL-g324t-6b781462038ded74ae14e520d4655b63407b3154507adf46f090dbdd3ef93b13</originalsourceid><addsrcrecordid>eNqVj01LAzEQQHNQsFb_gYeAJw-7Jpvs17GUqoViSy1eSzaZ3U3ZJmWTSr37w43YgwUPyhxmmHlvhkHohpKYspzeb-y-N6KLd5W2MSOEkiQ7QwOach6VNGcX6NK5DSFJUibFAH1MDjthnLYG2xov9WKBg-jejW_Ba4ndTkjAvu3tvmmxNh6aXvgjHsyoAWO3WjosjMJbIVttAHcgeqNNg_dG2jfowxSbUHRYdsK5L7cTxutwKhxxV-i8Fp2D62MeotXDZDV-imbzx-l4NIsalnAfZVVeUJ4lhBUKVM4FUA5pQhTP0rTKGCd5xcKjKcmFqnlWk5KoSikGdckqyobo9nttIzpYa1Nb3wu51U6uR1lKCprSggQq_oUKoSD8aQ3UOvRPhLsTITAeDr4Re-fW05flP9jnv7Pz15_sJwa-ngI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lantibiotics</title><source>DOAJ Directory of Open Access Journals</source><source>Public Library of Science (PLoS) Journals Open Access</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Kloosterman, Alexander M ; Cimermancic, Peter ; Elsayed, Somayah S ; Du, Chao ; Hadjithomas, Michalis ; Donia, Mohamed S ; Fischbach, Michael A ; van Wezel, Gilles P ; Medema, Marnix H ; Roberts, Roland G</creator><creatorcontrib>Kloosterman, Alexander M ; Cimermancic, Peter ; Elsayed, Somayah S ; Du, Chao ; Hadjithomas, Michalis ; Donia, Mohamed S ; Fischbach, Michael A ; van Wezel, Gilles P ; Medema, Marnix H ; Roberts, Roland G</creatorcontrib><description>Microbial natural products constitute a wide variety of chemical compounds, many which can have antibiotic, antiviral, or anticancer properties that make them interesting for clinical purposes. Natural product classes include polyketides (PKs), nonribosomal peptides (NRPs), and ribosomally synthesized and post-translationally modified peptides (RiPPs). While variants of biosynthetic gene clusters (BGCs) for known classes of natural products are easy to identify in genome sequences, BGCs for new compound classes escape attention. In particular, evidence is accumulating that for RiPPs, subclasses known thus far may only represent the tip of an iceberg. Here, we present decRiPPter (Data-driven Exploratory Class-independent RiPP TrackER), a RiPP genome mining algorithm aimed at the discovery of novel RiPP classes. DecRiPPter combines a Support Vector Machine (SVM) that identifies candidate RiPP precursors with pan-genomic analyses to identify which of these are encoded within operon-like structures that are part of the accessory genome of a genus. Subsequently, it prioritizes such regions based on the presence of new enzymology and based on patterns of gene cluster and precursor peptide conservation across species. We then applied decRiPPter to mine 1,295 Streptomyces genomes, which led to the identification of 42 new candidate RiPP families that could not be found by existing programs. One of these was studied further and elucidated as a representative of a novel subfamily of lanthipeptides, which we designate class V. The 2D structure of the new RiPP, which we name pristinin A3 (1), was solved using nuclear magnetic resonance (NMR), tandem mass spectrometry (MS/MS) data, and chemical labeling. Two previously unidentified modifying enzymes are proposed to create the hallmark lanthionine bridges. Taken together, our work highlights how novel natural product families can be discovered by methods going beyond sequence similarity searches to integrate multiple pathway discovery criteria.</description><identifier>ISSN: 1544-9173</identifier><identifier>DOI: 10.1371/journal.pbio.3001026</identifier><language>eng</language><publisher>Public Library of Science</publisher><subject>Antibiotics ; Chemical properties ; Genetic aspects ; Genomics ; Identification and classification ; Innovations ; Machine learning ; Methods ; Natural products ; Peptides ; Synthesis</subject><ispartof>PLoS biology, 2020-12, Vol.18 (12)</ispartof><rights>COPYRIGHT 2020 Public Library of Science</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,864,27915,27916</link.rule.ids></links><search><creatorcontrib>Kloosterman, Alexander M</creatorcontrib><creatorcontrib>Cimermancic, Peter</creatorcontrib><creatorcontrib>Elsayed, Somayah S</creatorcontrib><creatorcontrib>Du, Chao</creatorcontrib><creatorcontrib>Hadjithomas, Michalis</creatorcontrib><creatorcontrib>Donia, Mohamed S</creatorcontrib><creatorcontrib>Fischbach, Michael A</creatorcontrib><creatorcontrib>van Wezel, Gilles P</creatorcontrib><creatorcontrib>Medema, Marnix H</creatorcontrib><creatorcontrib>Roberts, Roland G</creatorcontrib><title>Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lantibiotics</title><title>PLoS biology</title><description>Microbial natural products constitute a wide variety of chemical compounds, many which can have antibiotic, antiviral, or anticancer properties that make them interesting for clinical purposes. Natural product classes include polyketides (PKs), nonribosomal peptides (NRPs), and ribosomally synthesized and post-translationally modified peptides (RiPPs). While variants of biosynthetic gene clusters (BGCs) for known classes of natural products are easy to identify in genome sequences, BGCs for new compound classes escape attention. In particular, evidence is accumulating that for RiPPs, subclasses known thus far may only represent the tip of an iceberg. Here, we present decRiPPter (Data-driven Exploratory Class-independent RiPP TrackER), a RiPP genome mining algorithm aimed at the discovery of novel RiPP classes. DecRiPPter combines a Support Vector Machine (SVM) that identifies candidate RiPP precursors with pan-genomic analyses to identify which of these are encoded within operon-like structures that are part of the accessory genome of a genus. Subsequently, it prioritizes such regions based on the presence of new enzymology and based on patterns of gene cluster and precursor peptide conservation across species. We then applied decRiPPter to mine 1,295 Streptomyces genomes, which led to the identification of 42 new candidate RiPP families that could not be found by existing programs. One of these was studied further and elucidated as a representative of a novel subfamily of lanthipeptides, which we designate class V. The 2D structure of the new RiPP, which we name pristinin A3 (1), was solved using nuclear magnetic resonance (NMR), tandem mass spectrometry (MS/MS) data, and chemical labeling. Two previously unidentified modifying enzymes are proposed to create the hallmark lanthionine bridges. Taken together, our work highlights how novel natural product families can be discovered by methods going beyond sequence similarity searches to integrate multiple pathway discovery criteria.</description><subject>Antibiotics</subject><subject>Chemical properties</subject><subject>Genetic aspects</subject><subject>Genomics</subject><subject>Identification and classification</subject><subject>Innovations</subject><subject>Machine learning</subject><subject>Methods</subject><subject>Natural products</subject><subject>Peptides</subject><subject>Synthesis</subject><issn>1544-9173</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNqVj01LAzEQQHNQsFb_gYeAJw-7Jpvs17GUqoViSy1eSzaZ3U3ZJmWTSr37w43YgwUPyhxmmHlvhkHohpKYspzeb-y-N6KLd5W2MSOEkiQ7QwOach6VNGcX6NK5DSFJUibFAH1MDjthnLYG2xov9WKBg-jejW_Ba4ndTkjAvu3tvmmxNh6aXvgjHsyoAWO3WjosjMJbIVttAHcgeqNNg_dG2jfowxSbUHRYdsK5L7cTxutwKhxxV-i8Fp2D62MeotXDZDV-imbzx-l4NIsalnAfZVVeUJ4lhBUKVM4FUA5pQhTP0rTKGCd5xcKjKcmFqnlWk5KoSikGdckqyobo9nttIzpYa1Nb3wu51U6uR1lKCprSggQq_oUKoSD8aQ3UOvRPhLsTITAeDr4Re-fW05flP9jnv7Pz15_sJwa-ngI</recordid><startdate>20201222</startdate><enddate>20201222</enddate><creator>Kloosterman, Alexander M</creator><creator>Cimermancic, Peter</creator><creator>Elsayed, Somayah S</creator><creator>Du, Chao</creator><creator>Hadjithomas, Michalis</creator><creator>Donia, Mohamed S</creator><creator>Fischbach, Michael A</creator><creator>van Wezel, Gilles P</creator><creator>Medema, Marnix H</creator><creator>Roberts, Roland G</creator><general>Public Library of Science</general><scope>IOV</scope><scope>ISN</scope><scope>ISR</scope></search><sort><creationdate>20201222</creationdate><title>Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lantibiotics</title><author>Kloosterman, Alexander M ; Cimermancic, Peter ; Elsayed, Somayah S ; Du, Chao ; Hadjithomas, Michalis ; Donia, Mohamed S ; Fischbach, Michael A ; van Wezel, Gilles P ; Medema, Marnix H ; Roberts, Roland G</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-g324t-6b781462038ded74ae14e520d4655b63407b3154507adf46f090dbdd3ef93b13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Antibiotics</topic><topic>Chemical properties</topic><topic>Genetic aspects</topic><topic>Genomics</topic><topic>Identification and classification</topic><topic>Innovations</topic><topic>Machine learning</topic><topic>Methods</topic><topic>Natural products</topic><topic>Peptides</topic><topic>Synthesis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kloosterman, Alexander M</creatorcontrib><creatorcontrib>Cimermancic, Peter</creatorcontrib><creatorcontrib>Elsayed, Somayah S</creatorcontrib><creatorcontrib>Du, Chao</creatorcontrib><creatorcontrib>Hadjithomas, Michalis</creatorcontrib><creatorcontrib>Donia, Mohamed S</creatorcontrib><creatorcontrib>Fischbach, Michael A</creatorcontrib><creatorcontrib>van Wezel, Gilles P</creatorcontrib><creatorcontrib>Medema, Marnix H</creatorcontrib><creatorcontrib>Roberts, Roland G</creatorcontrib><collection>Gale In Context: Opposing Viewpoints</collection><collection>Gale In Context: Canada</collection><collection>Gale In Context: Science</collection><jtitle>PLoS biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kloosterman, Alexander M</au><au>Cimermancic, Peter</au><au>Elsayed, Somayah S</au><au>Du, Chao</au><au>Hadjithomas, Michalis</au><au>Donia, Mohamed S</au><au>Fischbach, Michael A</au><au>van Wezel, Gilles P</au><au>Medema, Marnix H</au><au>Roberts, Roland G</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lantibiotics</atitle><jtitle>PLoS biology</jtitle><date>2020-12-22</date><risdate>2020</risdate><volume>18</volume><issue>12</issue><issn>1544-9173</issn><abstract>Microbial natural products constitute a wide variety of chemical compounds, many which can have antibiotic, antiviral, or anticancer properties that make them interesting for clinical purposes. Natural product classes include polyketides (PKs), nonribosomal peptides (NRPs), and ribosomally synthesized and post-translationally modified peptides (RiPPs). While variants of biosynthetic gene clusters (BGCs) for known classes of natural products are easy to identify in genome sequences, BGCs for new compound classes escape attention. In particular, evidence is accumulating that for RiPPs, subclasses known thus far may only represent the tip of an iceberg. Here, we present decRiPPter (Data-driven Exploratory Class-independent RiPP TrackER), a RiPP genome mining algorithm aimed at the discovery of novel RiPP classes. DecRiPPter combines a Support Vector Machine (SVM) that identifies candidate RiPP precursors with pan-genomic analyses to identify which of these are encoded within operon-like structures that are part of the accessory genome of a genus. Subsequently, it prioritizes such regions based on the presence of new enzymology and based on patterns of gene cluster and precursor peptide conservation across species. We then applied decRiPPter to mine 1,295 Streptomyces genomes, which led to the identification of 42 new candidate RiPP families that could not be found by existing programs. One of these was studied further and elucidated as a representative of a novel subfamily of lanthipeptides, which we designate class V. The 2D structure of the new RiPP, which we name pristinin A3 (1), was solved using nuclear magnetic resonance (NMR), tandem mass spectrometry (MS/MS) data, and chemical labeling. Two previously unidentified modifying enzymes are proposed to create the hallmark lanthionine bridges. Taken together, our work highlights how novel natural product families can be discovered by methods going beyond sequence similarity searches to integrate multiple pathway discovery criteria.</abstract><pub>Public Library of Science</pub><doi>10.1371/journal.pbio.3001026</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1544-9173
ispartof PLoS biology, 2020-12, Vol.18 (12)
issn 1544-9173
language eng
recordid cdi_gale_infotracmisc_A650815180
source DOAJ Directory of Open Access Journals; Public Library of Science (PLoS) Journals Open Access; EZB-FREE-00999 freely available EZB journals; PubMed Central
subjects Antibiotics
Chemical properties
Genetic aspects
Genomics
Identification and classification
Innovations
Machine learning
Methods
Natural products
Peptides
Synthesis
title Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lantibiotics
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T21%3A11%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Expansion%20of%20RiPP%20biosynthetic%20space%20through%20integration%20of%20pan-genomics%20and%20machine%20learning%20uncovers%20a%20novel%20class%20of%20lantibiotics&rft.jtitle=PLoS%20biology&rft.au=Kloosterman,%20Alexander%20M&rft.date=2020-12-22&rft.volume=18&rft.issue=12&rft.issn=1544-9173&rft_id=info:doi/10.1371/journal.pbio.3001026&rft_dat=%3Cgale%3EA650815180%3C/gale%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_galeid=A650815180&rfr_iscdi=true