Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains

Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged strain SS1 and its parent PMSS1 to...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:mBio 2017-02, Vol.8 (1)
Hauptverfasser: Draper, Jenny L, Hansen, Lori M, Bernick, David L, Abedrabbo, Samar, Underwood, Jason G, Kong, Nguyet, Huang, Bihua C, Weis, Allison M, Weimer, Bart C, van Vliet, Arnoud H M, Pourmand, Nader, Solnick, Jay V, Karplus, Kevin, Ottemann, Karen M
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 1
container_start_page
container_title mBio
container_volume 8
creator Draper, Jenny L
Hansen, Lori M
Bernick, David L
Abedrabbo, Samar
Underwood, Jason G
Kong, Nguyet
Huang, Bihua C
Weis, Allison M
Weimer, Bart C
van Vliet, Arnoud H M
Pourmand, Nader
Solnick, Jay V
Karplus, Kevin
Ottemann, Karen M
description Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these isolates, including movement of the transposable element IS , large and small inversions, multiple single nucleotide polymorphisms, and variation in copy number. The gene was found as 1 to 4 tandem copies located off the island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in , which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB α-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains. Although it is well known that many bacterial genomes are highly variable, it is nonetheless traditional to refer to, analyze, and publish "the genome" of a bacterial strain. Variability is usually reduced ("only sequence from a single colony"), ignored ("just publish the consensus"), or placed in the "too-hard" basket ("analysis of raw read data is more robust"). Now that whole-genome sequences are regularly used to assess virulence and track outbreaks, a better understanding of the baseline genomic variation present within single strains is needed. Here, we describe the variability seen in typical working stocks and colonies of pathogen model strains SS1 and PMSS1 as revealed by use of high-coverage mate pair next-generation sequencing (NGS) and confirmed by traditional laboratory techniques. This work demonstrates that reliance on
doi_str_mv 10.1128/mBio.02321-16
format Article
fullrecord <record><control><sourceid>pubmed_cross</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5358919</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>28223462</sourcerecordid><originalsourceid>FETCH-LOGICAL-c387t-cd96998bdee316cc0d93c6221041671b6f38a2d4432da2b3928cdb0848ecb0993</originalsourceid><addsrcrecordid>eNpVkMtOwzAQRS0EolXpki3yD6R47MS1WSDxaotUiUXpho3lOJPWKI-SpEX9exIKFcxmZjR37pUOIZfARgBcXef3vhwxLjgEIE9In0PEgnEEcNrNEgIOXPfIsK7fWVtCgBLsnPS44lyEkvfJ28RmmXV7Wqa0WSNdFv5ji3SKRZnjDV1guxUO6aPfYVX7Zk8_fbP2BV34YpUhnWHmXRlb12BFN_usrDxdNJX1RX1BzlKb1Tj86QOynDy9PsyC-cv0-eFuHjihxk3gEi21VnGCKEA6xxItnOQcWAhyDLFMhbI8CUPBE8tjoblyScxUqNDFTGsxILcH3802zjFxWLT5mdlUPrfV3pTWm_-Xwq_NqtyZSERKQ2cQHAxcVdZ1henxF5jpOJuOs_nmbEC2-qu_gUf1L1XxBctpesc</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains</title><source>American Society for Microbiology</source><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><source>PubMed Central Open Access</source><creator>Draper, Jenny L ; Hansen, Lori M ; Bernick, David L ; Abedrabbo, Samar ; Underwood, Jason G ; Kong, Nguyet ; Huang, Bihua C ; Weis, Allison M ; Weimer, Bart C ; van Vliet, Arnoud H M ; Pourmand, Nader ; Solnick, Jay V ; Karplus, Kevin ; Ottemann, Karen M</creator><creatorcontrib>Draper, Jenny L ; Hansen, Lori M ; Bernick, David L ; Abedrabbo, Samar ; Underwood, Jason G ; Kong, Nguyet ; Huang, Bihua C ; Weis, Allison M ; Weimer, Bart C ; van Vliet, Arnoud H M ; Pourmand, Nader ; Solnick, Jay V ; Karplus, Kevin ; Ottemann, Karen M</creatorcontrib><description>Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these isolates, including movement of the transposable element IS , large and small inversions, multiple single nucleotide polymorphisms, and variation in copy number. The gene was found as 1 to 4 tandem copies located off the island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in , which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB α-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains. Although it is well known that many bacterial genomes are highly variable, it is nonetheless traditional to refer to, analyze, and publish "the genome" of a bacterial strain. Variability is usually reduced ("only sequence from a single colony"), ignored ("just publish the consensus"), or placed in the "too-hard" basket ("analysis of raw read data is more robust"). Now that whole-genome sequences are regularly used to assess virulence and track outbreaks, a better understanding of the baseline genomic variation present within single strains is needed. Here, we describe the variability seen in typical working stocks and colonies of pathogen model strains SS1 and PMSS1 as revealed by use of high-coverage mate pair next-generation sequencing (NGS) and confirmed by traditional laboratory techniques. This work demonstrates that reliance on a consensus assembly as "the genome" of a bacterial strain may be misleading.</description><identifier>ISSN: 2161-2129</identifier><identifier>EISSN: 2150-7511</identifier><identifier>DOI: 10.1128/mBio.02321-16</identifier><identifier>PMID: 28223462</identifier><language>eng</language><publisher>United States: American Society for Microbiology</publisher><subject>Animals ; Genetic Variation ; Genome, Bacterial ; Helicobacter pylori - genetics ; High-Throughput Nucleotide Sequencing ; Mice ; Mutation</subject><ispartof>mBio, 2017-02, Vol.8 (1)</ispartof><rights>Copyright © 2017 Draper et al.</rights><rights>Copyright © 2017 Draper et al. 2017 Draper et al.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c387t-cd96998bdee316cc0d93c6221041671b6f38a2d4432da2b3928cdb0848ecb0993</citedby><cites>FETCH-LOGICAL-c387t-cd96998bdee316cc0d93c6221041671b6f38a2d4432da2b3928cdb0848ecb0993</cites><orcidid>0000-0002-7471-1978 ; 0000-0003-0203-1305 ; 0000-0002-6916-6442 ; 0000-0001-6265-7401</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5358919/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5358919/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,860,881,3175,27901,27902,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/28223462$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Draper, Jenny L</creatorcontrib><creatorcontrib>Hansen, Lori M</creatorcontrib><creatorcontrib>Bernick, David L</creatorcontrib><creatorcontrib>Abedrabbo, Samar</creatorcontrib><creatorcontrib>Underwood, Jason G</creatorcontrib><creatorcontrib>Kong, Nguyet</creatorcontrib><creatorcontrib>Huang, Bihua C</creatorcontrib><creatorcontrib>Weis, Allison M</creatorcontrib><creatorcontrib>Weimer, Bart C</creatorcontrib><creatorcontrib>van Vliet, Arnoud H M</creatorcontrib><creatorcontrib>Pourmand, Nader</creatorcontrib><creatorcontrib>Solnick, Jay V</creatorcontrib><creatorcontrib>Karplus, Kevin</creatorcontrib><creatorcontrib>Ottemann, Karen M</creatorcontrib><title>Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains</title><title>mBio</title><addtitle>mBio</addtitle><description>Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these isolates, including movement of the transposable element IS , large and small inversions, multiple single nucleotide polymorphisms, and variation in copy number. The gene was found as 1 to 4 tandem copies located off the island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in , which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB α-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains. Although it is well known that many bacterial genomes are highly variable, it is nonetheless traditional to refer to, analyze, and publish "the genome" of a bacterial strain. Variability is usually reduced ("only sequence from a single colony"), ignored ("just publish the consensus"), or placed in the "too-hard" basket ("analysis of raw read data is more robust"). Now that whole-genome sequences are regularly used to assess virulence and track outbreaks, a better understanding of the baseline genomic variation present within single strains is needed. Here, we describe the variability seen in typical working stocks and colonies of pathogen model strains SS1 and PMSS1 as revealed by use of high-coverage mate pair next-generation sequencing (NGS) and confirmed by traditional laboratory techniques. This work demonstrates that reliance on a consensus assembly as "the genome" of a bacterial strain may be misleading.</description><subject>Animals</subject><subject>Genetic Variation</subject><subject>Genome, Bacterial</subject><subject>Helicobacter pylori - genetics</subject><subject>High-Throughput Nucleotide Sequencing</subject><subject>Mice</subject><subject>Mutation</subject><issn>2161-2129</issn><issn>2150-7511</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpVkMtOwzAQRS0EolXpki3yD6R47MS1WSDxaotUiUXpho3lOJPWKI-SpEX9exIKFcxmZjR37pUOIZfARgBcXef3vhwxLjgEIE9In0PEgnEEcNrNEgIOXPfIsK7fWVtCgBLsnPS44lyEkvfJ28RmmXV7Wqa0WSNdFv5ji3SKRZnjDV1guxUO6aPfYVX7Zk8_fbP2BV34YpUhnWHmXRlb12BFN_usrDxdNJX1RX1BzlKb1Tj86QOynDy9PsyC-cv0-eFuHjihxk3gEi21VnGCKEA6xxItnOQcWAhyDLFMhbI8CUPBE8tjoblyScxUqNDFTGsxILcH3802zjFxWLT5mdlUPrfV3pTWm_-Xwq_NqtyZSERKQ2cQHAxcVdZ1henxF5jpOJuOs_nmbEC2-qu_gUf1L1XxBctpesc</recordid><startdate>20170221</startdate><enddate>20170221</enddate><creator>Draper, Jenny L</creator><creator>Hansen, Lori M</creator><creator>Bernick, David L</creator><creator>Abedrabbo, Samar</creator><creator>Underwood, Jason G</creator><creator>Kong, Nguyet</creator><creator>Huang, Bihua C</creator><creator>Weis, Allison M</creator><creator>Weimer, Bart C</creator><creator>van Vliet, Arnoud H M</creator><creator>Pourmand, Nader</creator><creator>Solnick, Jay V</creator><creator>Karplus, Kevin</creator><creator>Ottemann, Karen M</creator><general>American Society for Microbiology</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-7471-1978</orcidid><orcidid>https://orcid.org/0000-0003-0203-1305</orcidid><orcidid>https://orcid.org/0000-0002-6916-6442</orcidid><orcidid>https://orcid.org/0000-0001-6265-7401</orcidid></search><sort><creationdate>20170221</creationdate><title>Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains</title><author>Draper, Jenny L ; Hansen, Lori M ; Bernick, David L ; Abedrabbo, Samar ; Underwood, Jason G ; Kong, Nguyet ; Huang, Bihua C ; Weis, Allison M ; Weimer, Bart C ; van Vliet, Arnoud H M ; Pourmand, Nader ; Solnick, Jay V ; Karplus, Kevin ; Ottemann, Karen M</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c387t-cd96998bdee316cc0d93c6221041671b6f38a2d4432da2b3928cdb0848ecb0993</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Animals</topic><topic>Genetic Variation</topic><topic>Genome, Bacterial</topic><topic>Helicobacter pylori - genetics</topic><topic>High-Throughput Nucleotide Sequencing</topic><topic>Mice</topic><topic>Mutation</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Draper, Jenny L</creatorcontrib><creatorcontrib>Hansen, Lori M</creatorcontrib><creatorcontrib>Bernick, David L</creatorcontrib><creatorcontrib>Abedrabbo, Samar</creatorcontrib><creatorcontrib>Underwood, Jason G</creatorcontrib><creatorcontrib>Kong, Nguyet</creatorcontrib><creatorcontrib>Huang, Bihua C</creatorcontrib><creatorcontrib>Weis, Allison M</creatorcontrib><creatorcontrib>Weimer, Bart C</creatorcontrib><creatorcontrib>van Vliet, Arnoud H M</creatorcontrib><creatorcontrib>Pourmand, Nader</creatorcontrib><creatorcontrib>Solnick, Jay V</creatorcontrib><creatorcontrib>Karplus, Kevin</creatorcontrib><creatorcontrib>Ottemann, Karen M</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>mBio</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Draper, Jenny L</au><au>Hansen, Lori M</au><au>Bernick, David L</au><au>Abedrabbo, Samar</au><au>Underwood, Jason G</au><au>Kong, Nguyet</au><au>Huang, Bihua C</au><au>Weis, Allison M</au><au>Weimer, Bart C</au><au>van Vliet, Arnoud H M</au><au>Pourmand, Nader</au><au>Solnick, Jay V</au><au>Karplus, Kevin</au><au>Ottemann, Karen M</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains</atitle><jtitle>mBio</jtitle><addtitle>mBio</addtitle><date>2017-02-21</date><risdate>2017</risdate><volume>8</volume><issue>1</issue><issn>2161-2129</issn><eissn>2150-7511</eissn><abstract>Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these isolates, including movement of the transposable element IS , large and small inversions, multiple single nucleotide polymorphisms, and variation in copy number. The gene was found as 1 to 4 tandem copies located off the island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in , which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB α-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains. Although it is well known that many bacterial genomes are highly variable, it is nonetheless traditional to refer to, analyze, and publish "the genome" of a bacterial strain. Variability is usually reduced ("only sequence from a single colony"), ignored ("just publish the consensus"), or placed in the "too-hard" basket ("analysis of raw read data is more robust"). Now that whole-genome sequences are regularly used to assess virulence and track outbreaks, a better understanding of the baseline genomic variation present within single strains is needed. Here, we describe the variability seen in typical working stocks and colonies of pathogen model strains SS1 and PMSS1 as revealed by use of high-coverage mate pair next-generation sequencing (NGS) and confirmed by traditional laboratory techniques. This work demonstrates that reliance on a consensus assembly as "the genome" of a bacterial strain may be misleading.</abstract><cop>United States</cop><pub>American Society for Microbiology</pub><pmid>28223462</pmid><doi>10.1128/mBio.02321-16</doi><orcidid>https://orcid.org/0000-0002-7471-1978</orcidid><orcidid>https://orcid.org/0000-0003-0203-1305</orcidid><orcidid>https://orcid.org/0000-0002-6916-6442</orcidid><orcidid>https://orcid.org/0000-0001-6265-7401</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2161-2129
ispartof mBio, 2017-02, Vol.8 (1)
issn 2161-2129
2150-7511
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5358919
source American Society for Microbiology; MEDLINE; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central; PubMed Central Open Access
subjects Animals
Genetic Variation
Genome, Bacterial
Helicobacter pylori - genetics
High-Throughput Nucleotide Sequencing
Mice
Mutation
title Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T22%3A51%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pubmed_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Fallacy%20of%20the%20Unique%20Genome:%20Sequence%20Diversity%20within%20Single%20Helicobacter%20pylori%20Strains&rft.jtitle=mBio&rft.au=Draper,%20Jenny%20L&rft.date=2017-02-21&rft.volume=8&rft.issue=1&rft.issn=2161-2129&rft.eissn=2150-7511&rft_id=info:doi/10.1128/mBio.02321-16&rft_dat=%3Cpubmed_cross%3E28223462%3C/pubmed_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/28223462&rfr_iscdi=true