Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains
Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged strain SS1 and its parent PMSS1 to...
Gespeichert in:
Veröffentlicht in: | mBio 2017-02, Vol.8 (1) |
---|---|
Hauptverfasser: | , , , , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 1 |
container_start_page | |
container_title | mBio |
container_volume | 8 |
creator | Draper, Jenny L Hansen, Lori M Bernick, David L Abedrabbo, Samar Underwood, Jason G Kong, Nguyet Huang, Bihua C Weis, Allison M Weimer, Bart C van Vliet, Arnoud H M Pourmand, Nader Solnick, Jay V Karplus, Kevin Ottemann, Karen M |
description | Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged
strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these
isolates, including movement of the transposable element IS
, large and small inversions, multiple single nucleotide polymorphisms, and variation in
copy number. The
gene was found as 1 to 4 tandem copies located off the
island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in
, which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB α-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains.
Although it is well known that many bacterial genomes are highly variable, it is nonetheless traditional to refer to, analyze, and publish "the genome" of a bacterial strain. Variability is usually reduced ("only sequence from a single colony"), ignored ("just publish the consensus"), or placed in the "too-hard" basket ("analysis of raw read data is more robust"). Now that whole-genome sequences are regularly used to assess virulence and track outbreaks, a better understanding of the baseline genomic variation present within single strains is needed. Here, we describe the variability seen in typical working stocks and colonies of pathogen
model strains SS1 and PMSS1 as revealed by use of high-coverage mate pair next-generation sequencing (NGS) and confirmed by traditional laboratory techniques. This work demonstrates that reliance on |
doi_str_mv | 10.1128/mBio.02321-16 |
format | Article |
fullrecord | <record><control><sourceid>pubmed_cross</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5358919</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>28223462</sourcerecordid><originalsourceid>FETCH-LOGICAL-c387t-cd96998bdee316cc0d93c6221041671b6f38a2d4432da2b3928cdb0848ecb0993</originalsourceid><addsrcrecordid>eNpVkMtOwzAQRS0EolXpki3yD6R47MS1WSDxaotUiUXpho3lOJPWKI-SpEX9exIKFcxmZjR37pUOIZfARgBcXef3vhwxLjgEIE9In0PEgnEEcNrNEgIOXPfIsK7fWVtCgBLsnPS44lyEkvfJ28RmmXV7Wqa0WSNdFv5ji3SKRZnjDV1guxUO6aPfYVX7Zk8_fbP2BV34YpUhnWHmXRlb12BFN_usrDxdNJX1RX1BzlKb1Tj86QOynDy9PsyC-cv0-eFuHjihxk3gEi21VnGCKEA6xxItnOQcWAhyDLFMhbI8CUPBE8tjoblyScxUqNDFTGsxILcH3802zjFxWLT5mdlUPrfV3pTWm_-Xwq_NqtyZSERKQ2cQHAxcVdZ1henxF5jpOJuOs_nmbEC2-qu_gUf1L1XxBctpesc</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains</title><source>American Society for Microbiology</source><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><source>PubMed Central Open Access</source><creator>Draper, Jenny L ; Hansen, Lori M ; Bernick, David L ; Abedrabbo, Samar ; Underwood, Jason G ; Kong, Nguyet ; Huang, Bihua C ; Weis, Allison M ; Weimer, Bart C ; van Vliet, Arnoud H M ; Pourmand, Nader ; Solnick, Jay V ; Karplus, Kevin ; Ottemann, Karen M</creator><creatorcontrib>Draper, Jenny L ; Hansen, Lori M ; Bernick, David L ; Abedrabbo, Samar ; Underwood, Jason G ; Kong, Nguyet ; Huang, Bihua C ; Weis, Allison M ; Weimer, Bart C ; van Vliet, Arnoud H M ; Pourmand, Nader ; Solnick, Jay V ; Karplus, Kevin ; Ottemann, Karen M</creatorcontrib><description>Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged
strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these
isolates, including movement of the transposable element IS
, large and small inversions, multiple single nucleotide polymorphisms, and variation in
copy number. The
gene was found as 1 to 4 tandem copies located off the
island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in
, which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB α-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains.
Although it is well known that many bacterial genomes are highly variable, it is nonetheless traditional to refer to, analyze, and publish "the genome" of a bacterial strain. Variability is usually reduced ("only sequence from a single colony"), ignored ("just publish the consensus"), or placed in the "too-hard" basket ("analysis of raw read data is more robust"). Now that whole-genome sequences are regularly used to assess virulence and track outbreaks, a better understanding of the baseline genomic variation present within single strains is needed. Here, we describe the variability seen in typical working stocks and colonies of pathogen
model strains SS1 and PMSS1 as revealed by use of high-coverage mate pair next-generation sequencing (NGS) and confirmed by traditional laboratory techniques. This work demonstrates that reliance on a consensus assembly as "the genome" of a bacterial strain may be misleading.</description><identifier>ISSN: 2161-2129</identifier><identifier>EISSN: 2150-7511</identifier><identifier>DOI: 10.1128/mBio.02321-16</identifier><identifier>PMID: 28223462</identifier><language>eng</language><publisher>United States: American Society for Microbiology</publisher><subject>Animals ; Genetic Variation ; Genome, Bacterial ; Helicobacter pylori - genetics ; High-Throughput Nucleotide Sequencing ; Mice ; Mutation</subject><ispartof>mBio, 2017-02, Vol.8 (1)</ispartof><rights>Copyright © 2017 Draper et al.</rights><rights>Copyright © 2017 Draper et al. 2017 Draper et al.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c387t-cd96998bdee316cc0d93c6221041671b6f38a2d4432da2b3928cdb0848ecb0993</citedby><cites>FETCH-LOGICAL-c387t-cd96998bdee316cc0d93c6221041671b6f38a2d4432da2b3928cdb0848ecb0993</cites><orcidid>0000-0002-7471-1978 ; 0000-0003-0203-1305 ; 0000-0002-6916-6442 ; 0000-0001-6265-7401</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5358919/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5358919/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,860,881,3175,27901,27902,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/28223462$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Draper, Jenny L</creatorcontrib><creatorcontrib>Hansen, Lori M</creatorcontrib><creatorcontrib>Bernick, David L</creatorcontrib><creatorcontrib>Abedrabbo, Samar</creatorcontrib><creatorcontrib>Underwood, Jason G</creatorcontrib><creatorcontrib>Kong, Nguyet</creatorcontrib><creatorcontrib>Huang, Bihua C</creatorcontrib><creatorcontrib>Weis, Allison M</creatorcontrib><creatorcontrib>Weimer, Bart C</creatorcontrib><creatorcontrib>van Vliet, Arnoud H M</creatorcontrib><creatorcontrib>Pourmand, Nader</creatorcontrib><creatorcontrib>Solnick, Jay V</creatorcontrib><creatorcontrib>Karplus, Kevin</creatorcontrib><creatorcontrib>Ottemann, Karen M</creatorcontrib><title>Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains</title><title>mBio</title><addtitle>mBio</addtitle><description>Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged
strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these
isolates, including movement of the transposable element IS
, large and small inversions, multiple single nucleotide polymorphisms, and variation in
copy number. The
gene was found as 1 to 4 tandem copies located off the
island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in
, which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB α-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains.
Although it is well known that many bacterial genomes are highly variable, it is nonetheless traditional to refer to, analyze, and publish "the genome" of a bacterial strain. Variability is usually reduced ("only sequence from a single colony"), ignored ("just publish the consensus"), or placed in the "too-hard" basket ("analysis of raw read data is more robust"). Now that whole-genome sequences are regularly used to assess virulence and track outbreaks, a better understanding of the baseline genomic variation present within single strains is needed. Here, we describe the variability seen in typical working stocks and colonies of pathogen
model strains SS1 and PMSS1 as revealed by use of high-coverage mate pair next-generation sequencing (NGS) and confirmed by traditional laboratory techniques. This work demonstrates that reliance on a consensus assembly as "the genome" of a bacterial strain may be misleading.</description><subject>Animals</subject><subject>Genetic Variation</subject><subject>Genome, Bacterial</subject><subject>Helicobacter pylori - genetics</subject><subject>High-Throughput Nucleotide Sequencing</subject><subject>Mice</subject><subject>Mutation</subject><issn>2161-2129</issn><issn>2150-7511</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpVkMtOwzAQRS0EolXpki3yD6R47MS1WSDxaotUiUXpho3lOJPWKI-SpEX9exIKFcxmZjR37pUOIZfARgBcXef3vhwxLjgEIE9In0PEgnEEcNrNEgIOXPfIsK7fWVtCgBLsnPS44lyEkvfJ28RmmXV7Wqa0WSNdFv5ji3SKRZnjDV1guxUO6aPfYVX7Zk8_fbP2BV34YpUhnWHmXRlb12BFN_usrDxdNJX1RX1BzlKb1Tj86QOynDy9PsyC-cv0-eFuHjihxk3gEi21VnGCKEA6xxItnOQcWAhyDLFMhbI8CUPBE8tjoblyScxUqNDFTGsxILcH3802zjFxWLT5mdlUPrfV3pTWm_-Xwq_NqtyZSERKQ2cQHAxcVdZ1henxF5jpOJuOs_nmbEC2-qu_gUf1L1XxBctpesc</recordid><startdate>20170221</startdate><enddate>20170221</enddate><creator>Draper, Jenny L</creator><creator>Hansen, Lori M</creator><creator>Bernick, David L</creator><creator>Abedrabbo, Samar</creator><creator>Underwood, Jason G</creator><creator>Kong, Nguyet</creator><creator>Huang, Bihua C</creator><creator>Weis, Allison M</creator><creator>Weimer, Bart C</creator><creator>van Vliet, Arnoud H M</creator><creator>Pourmand, Nader</creator><creator>Solnick, Jay V</creator><creator>Karplus, Kevin</creator><creator>Ottemann, Karen M</creator><general>American Society for Microbiology</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-7471-1978</orcidid><orcidid>https://orcid.org/0000-0003-0203-1305</orcidid><orcidid>https://orcid.org/0000-0002-6916-6442</orcidid><orcidid>https://orcid.org/0000-0001-6265-7401</orcidid></search><sort><creationdate>20170221</creationdate><title>Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains</title><author>Draper, Jenny L ; Hansen, Lori M ; Bernick, David L ; Abedrabbo, Samar ; Underwood, Jason G ; Kong, Nguyet ; Huang, Bihua C ; Weis, Allison M ; Weimer, Bart C ; van Vliet, Arnoud H M ; Pourmand, Nader ; Solnick, Jay V ; Karplus, Kevin ; Ottemann, Karen M</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c387t-cd96998bdee316cc0d93c6221041671b6f38a2d4432da2b3928cdb0848ecb0993</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Animals</topic><topic>Genetic Variation</topic><topic>Genome, Bacterial</topic><topic>Helicobacter pylori - genetics</topic><topic>High-Throughput Nucleotide Sequencing</topic><topic>Mice</topic><topic>Mutation</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Draper, Jenny L</creatorcontrib><creatorcontrib>Hansen, Lori M</creatorcontrib><creatorcontrib>Bernick, David L</creatorcontrib><creatorcontrib>Abedrabbo, Samar</creatorcontrib><creatorcontrib>Underwood, Jason G</creatorcontrib><creatorcontrib>Kong, Nguyet</creatorcontrib><creatorcontrib>Huang, Bihua C</creatorcontrib><creatorcontrib>Weis, Allison M</creatorcontrib><creatorcontrib>Weimer, Bart C</creatorcontrib><creatorcontrib>van Vliet, Arnoud H M</creatorcontrib><creatorcontrib>Pourmand, Nader</creatorcontrib><creatorcontrib>Solnick, Jay V</creatorcontrib><creatorcontrib>Karplus, Kevin</creatorcontrib><creatorcontrib>Ottemann, Karen M</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>mBio</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Draper, Jenny L</au><au>Hansen, Lori M</au><au>Bernick, David L</au><au>Abedrabbo, Samar</au><au>Underwood, Jason G</au><au>Kong, Nguyet</au><au>Huang, Bihua C</au><au>Weis, Allison M</au><au>Weimer, Bart C</au><au>van Vliet, Arnoud H M</au><au>Pourmand, Nader</au><au>Solnick, Jay V</au><au>Karplus, Kevin</au><au>Ottemann, Karen M</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains</atitle><jtitle>mBio</jtitle><addtitle>mBio</addtitle><date>2017-02-21</date><risdate>2017</risdate><volume>8</volume><issue>1</issue><issn>2161-2129</issn><eissn>2150-7511</eissn><abstract>Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged
strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these
isolates, including movement of the transposable element IS
, large and small inversions, multiple single nucleotide polymorphisms, and variation in
copy number. The
gene was found as 1 to 4 tandem copies located off the
island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in
, which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB α-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains.
Although it is well known that many bacterial genomes are highly variable, it is nonetheless traditional to refer to, analyze, and publish "the genome" of a bacterial strain. Variability is usually reduced ("only sequence from a single colony"), ignored ("just publish the consensus"), or placed in the "too-hard" basket ("analysis of raw read data is more robust"). Now that whole-genome sequences are regularly used to assess virulence and track outbreaks, a better understanding of the baseline genomic variation present within single strains is needed. Here, we describe the variability seen in typical working stocks and colonies of pathogen
model strains SS1 and PMSS1 as revealed by use of high-coverage mate pair next-generation sequencing (NGS) and confirmed by traditional laboratory techniques. This work demonstrates that reliance on a consensus assembly as "the genome" of a bacterial strain may be misleading.</abstract><cop>United States</cop><pub>American Society for Microbiology</pub><pmid>28223462</pmid><doi>10.1128/mBio.02321-16</doi><orcidid>https://orcid.org/0000-0002-7471-1978</orcidid><orcidid>https://orcid.org/0000-0003-0203-1305</orcidid><orcidid>https://orcid.org/0000-0002-6916-6442</orcidid><orcidid>https://orcid.org/0000-0001-6265-7401</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2161-2129 |
ispartof | mBio, 2017-02, Vol.8 (1) |
issn | 2161-2129 2150-7511 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5358919 |
source | American Society for Microbiology; MEDLINE; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central; PubMed Central Open Access |
subjects | Animals Genetic Variation Genome, Bacterial Helicobacter pylori - genetics High-Throughput Nucleotide Sequencing Mice Mutation |
title | Fallacy of the Unique Genome: Sequence Diversity within Single Helicobacter pylori Strains |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T22%3A51%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pubmed_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Fallacy%20of%20the%20Unique%20Genome:%20Sequence%20Diversity%20within%20Single%20Helicobacter%20pylori%20Strains&rft.jtitle=mBio&rft.au=Draper,%20Jenny%20L&rft.date=2017-02-21&rft.volume=8&rft.issue=1&rft.issn=2161-2129&rft.eissn=2150-7511&rft_id=info:doi/10.1128/mBio.02321-16&rft_dat=%3Cpubmed_cross%3E28223462%3C/pubmed_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/28223462&rfr_iscdi=true |