TUnA: an uncertainty-aware transformer model for sequence-based protein–protein interaction prediction

Abstract Protein–protein interactions (PPIs) are important for many biological processes, but predicting them from sequence data remains challenging. Existing deep learning models often cannot generalize to proteins not present in the training set and do not provide uncertainty estimates for their p...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Briefings in bioinformatics 2024-07, Vol.25 (5)
Hauptverfasser:	Ko, Young Su, Parkinson, Jonathan, Liu, Cong, Wang, Wei
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Amino acid sequence Biological activity Computational Biology - methods Deep Learning Estimates Gaussian process Predictions Problem Solving Protocol Protein interaction Protein Interaction Mapping - methods Proteins Proteins - chemistry Proteins - metabolism State-of-the-art reviews Transformers Uncertainty
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Abstract Protein–protein interactions (PPIs) are important for many biological processes, but predicting them from sequence data remains challenging. Existing deep learning models often cannot generalize to proteins not present in the training set and do not provide uncertainty estimates for their predictions. To address these limitations, we present TUnA, a Transformer-based uncertainty-aware model for PPI prediction. TUnA uses ESM-2 embeddings with Transformer encoders and incorporates a Spectral-normalized Neural Gaussian Process. TUnA achieves state-of-the-art performance and, importantly, evaluates uncertainty for unseen sequences. We demonstrate that TUnA’s uncertainty estimates can effectively identify the most reliable predictions, significantly reducing false positives. This capability is crucial in bridging the gap between computational predictions and experimental validation.
ISSN:	1467-5463 1477-4054 1477-4054
DOI:	10.1093/bib/bbae359