Validation of bioinformatic approaches for predicting allergen cross reactivity
Part of the allergenicity assessment of newly expressed proteins in genetically engineered food crops involves an assessment of potential cross-reactivity with known allergens. Bioinformatic approaches are used to evaluate the amino acid sequence identity or similarity between newly expressed protei...
Gespeichert in:
Veröffentlicht in: | Food and chemical toxicology 2019-10, Vol.132, p.110656-110656, Article 110656 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Part of the allergenicity assessment of newly expressed proteins in genetically engineered food crops involves an assessment of potential cross-reactivity with known allergens. Bioinformatic approaches are used to evaluate the amino acid sequence identity or similarity between newly expressed proteins and the sequences of known allergens. To be useful, such approaches must be sensitive to detecting cross-reactive potential, but also capable of excluding low-risk sequences. One difficulty in comparing the effectiveness of different bioinformatic approaches has been the lack of a standardized validation and evaluation method. Here, we propose a standardized method for evaluating the sensitivity of different bioinformatic algorithms using a comprehensive database of known allergen sequences. We combine this with a previously described method for evaluating selectivity using sequences from a crop not known to commonly cause food allergy (e.g. maize) to compare the standard “>35% identity-criterion over sliding-window of ≥80 amino acids” bioinformatic approach with the previously described “one-to-one (1:1) FASTA” similarity approach using an E-value threshold of 1E-9. Results confirm the superiority of the 1:1 FASTA approach for selectively detecting cross-reactive allergens. The validation methods described here can be applied to other algorithms to select even better fit-for-purpose approaches for evaluating cross-reactive risk.
•Bioinformatic approaches are used, in part, for predicting protein allergenic risk.•Standard validation procedures for evaluating bioinformatic methods are lacking.•Sensitivity is evaluated here using a comprehensive allergen database.•Selectivity is evaluated here using maize protein sequences.•Sequence similarity rather than identity is better for identifying true allergens. |
---|---|
ISSN: | 0278-6915 1873-6351 |
DOI: | 10.1016/j.fct.2019.110656 |