An Open-Source Program (Haplo-ST) for Whole-Genome Sequence Typing Shows Extensive Diversity among Listeria monocytogenes Isolates in Outdoor Environments and Poultry Processing Plants
A reliable and standardized classification of is important for accurate strain identification during outbreak investigations. Current whole-genome sequencing (WGS)-based approaches for strain characterization are either difficult to standardize, rendering them less suitable for data exchange, or are...
Gespeichert in:
Veröffentlicht in: | Applied and environmental microbiology 2020-12, Vol.87 (1), p.1 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A reliable and standardized classification of
is important for accurate strain identification during outbreak investigations. Current whole-genome sequencing (WGS)-based approaches for strain characterization are either difficult to standardize, rendering them less suitable for data exchange, or are not freely available. Thus, we developed a portable and open-source tool, Haplo-ST, to improve standardization and provide maximum discriminatory potential to WGS data tied to a multilocus sequence typing (MLST) framework. Haplo-ST performs whole-genome MLST (wgMLST) for
while allowing for data exchangeability worldwide. This tool takes in (i) raw WGS reads as input, (ii) cleans the raw data according to user-specified parameters, (iii) assembles genes across loci by mapping to genes from reference strains, and (iv) assigns allelic profiles to assembled genes and provides a wgMLST subtyping for each isolate. Data exchangeability relies on the tool assigning allelic profiles based on a centralized nomenclature defined by the widely used BIGSdb-
database. Tests of Haplo-ST's performance with simulated reads from
reference strains demonstrated high sensitivity (97.5%), and coverage depths of ≥20× were found to be sufficient for wgMLST profiling. We then used Haplo-ST to characterize and differentiate between two groups of
isolates derived from the natural environment and poultry processing plants. Phylogenetic reconstruction identified lineages within each group, and no lineage specificity was observed with isolate phenotypes (transient versus persistent) or origins. Genetic differentiation analyses between isolate groups identified 21 significantly differentiated loci, potentially enriched for adaptation and persistence of
within poultry processing plants.
We have developed an open-source tool (https://github.com/swarnalilouha/Haplo-ST) that provides allele-based subtyping of
isolates at the whole-genome level. Along with allelic profiles, this tool also generates allele sequences and identifies paralogs, which is useful for phylogenetic tree reconstruction and deciphering relationships between closely related isolates. More broadly, Haplo-ST is flexible and can be adapted to characterize the genome of any haploid organism simply by installing an organism-specific gene database. Haplo-ST also allows for scalable subtyping of isolates; fewer reference genes can be used for low-resolution typing, whereas higher resolution can be achieved by increasing the number of |
---|---|
ISSN: | 0099-2240 1098-5336 |
DOI: | 10.1128/AEM.02248-20 |