Deep-Learning Resources for Studying Glycan-Mediated Host-Microbe Interactions

Glycans, the most diverse biopolymer, are shaped by evolutionary pressures stemming from host-microbe interactions. Here, we present machine learning and bioinformatics methods to leverage the evolutionary information present in glycans to gain insights into how pathogens and commensals interact wit...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Cell host & microbe 2021-01, Vol.29 (1), p.132-144.e3
Hauptverfasser: Bojar, Daniel, Powers, Rani K., Camacho, Diogo M., Collins, James J.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Glycans, the most diverse biopolymer, are shaped by evolutionary pressures stemming from host-microbe interactions. Here, we present machine learning and bioinformatics methods to leverage the evolutionary information present in glycans to gain insights into how pathogens and commensals interact with hosts. By using techniques from natural language processing, we develop deep-learning models for glycans that are trained on a curated dataset of 19,299 unique glycans and can be used to study and predict glycan functions. We show that these models can be utilized to predict glycan immunogenicity and the pathogenicity of bacterial strains, as well as investigate glycan-mediated immune evasion via molecular mimicry. We also develop glycan-alignment methods and use these to analyze virulence-determining glycan motifs in the capsular polysaccharides of bacterial pathogens. These resources enable one to identify and study glycan motifs involved in immunogenicity, pathogenicity, molecular mimicry, and immune evasion, expanding our understanding of host-microbe interactions. [Display omitted] •Glycan-focused language models can be used for sequence-to-function models•Information in glycans predicts immunogenicity, pathogenicity, and taxonomic origin•Glycan alignments shed light into bacterial virulence Bojar et al. present a workflow that combines machine learning and bioinformatics techniques to analyze the prominent role of glycans in host-microbe interactions. The herein developed glycan-focused language models and alignments allow for the prediction and analysis of glycan immunogenicity, association with pathogenicity, and taxonomic classification.
ISSN:1931-3128
1934-6069
DOI:10.1016/j.chom.2020.10.004