An informatic workflow for the enhanced annotation of excretory/secretory proteins of Haemonchus contortus

Major advances in genomic and associated technologies have demanded reliable bioinformatic tools and workflows for the annotation of genes and their products via comparative analyses using well-curated reference data sets, accessible in public repositories. However, the accurate in silico annotation...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computational and structural biotechnology journal 2023-01, Vol.21, p.2696-2704
Hauptverfasser: Zheng, Yuanting, Young, Neil D., Song, Jiangning, Chang, Bill C.H., Gasser, Robin B.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Major advances in genomic and associated technologies have demanded reliable bioinformatic tools and workflows for the annotation of genes and their products via comparative analyses using well-curated reference data sets, accessible in public repositories. However, the accurate in silico annotation of molecules (proteins) encoded in organisms (e.g., multicellular parasites) which are evolutionarily distant from those for which these extensive reference data sets are available, including invertebrate model organisms (e.g., Caenorhabditis elegans – free-living nematode, and Drosophila melanogaster – the vinegar fly) and vertebrate species (e.g., Homo sapiens and Mus musculus), remains a major challenge. Here, we constructed an informatic workflow for the enhanced annotation of biologically-important, excretory/secretory (ES) proteins (“secretome”) encoded in the genome of a parasitic roundworm, called Haemonchus contortus (commonly known as the barber’s pole worm). We critically evaluated the performance of five distinct methods, refined some of them, and then combined the use of all five methods to comprehensively annotate ES proteins, according to gene ontology, biological pathways and/or metabolic (enzymatic) processes. Then, using optimised parameter settings, we applied this workflow to comprehensively annotate 2591 of all 3353 proteins (77.3%) in the secretome of H. contortus. This result is a substantial improvement (10–25%) over previous annotations using individual, “off-the-shelf” algorithms and default settings, indicating the ready applicability of the present, refined workflow to gene/protein sequence data sets from a wide range of organisms in the Tree-of-Life. [Display omitted] •We established an integrated bioinformatic workflow for the annotation of proteins that outperforms previous pipelines.•This workflow was applied to annotate biologically relevant, excretory/secretory proteins of Haemonchus contortus.•This integrative workflow will have broad applicability for protein annotation in parasites.
ISSN:2001-0370
2001-0370
DOI:10.1016/j.csbj.2023.03.025