Global protein-protein interaction networks in yeast saccharomyces cerevisiae and helicobacter pylori
Understanding many biological processes relies heavily on accurately predicting protein-protein interactions (PPIs). In this study, we propose a novel method for predicting PPIs that is based on LogitBoost with a binary bat feature selection algorithm. Our approach involves the extraction of an init...
Gespeichert in:
Veröffentlicht in: | Talanta (Oxford) 2023-12, Vol.265, p.124836, Article 124836 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Understanding many biological processes relies heavily on accurately predicting protein-protein interactions (PPIs). In this study, we propose a novel method for predicting PPIs that is based on LogitBoost with a binary bat feature selection algorithm. Our approach involves the extraction of an initial feature vector by combining pseudo amino acid composition (PseAAC), pseudo-position-specific scoring matrix (PsePSSM), reduced sequence and index-vectors (RSIV), and autocorrelation descriptor (AD). Subsequently, a binary bat algorithm is applied to eliminate redundant features, and the resulting optimal features are fed into the LogitBoost classifier for the identification of PPIs. To evaluate the proposed method, we test it on two databases, Saccharomyces cerevisiae and Helicobacter pylori, using 10-fold cross-validation, and achieve accuracies of 94.39% and 97.89%, respectively. Our results showcase the significant potential of our pipeline in accurately predicting protein-protein interactions (PPIs), thereby offering a valuable resource to the scientific research community.
[Display omitted]
•We extract and fused PsePSSM, PseAAC, AD and RSIV to enrich sequence for obtaining effective features.•We used binary bat meta-hurestic algorithm for removing unreleavant redundant features and saving predicting time.•In addition to using several classifiers, LogitBoost ensemble method is used to increase the accuracy of our model.•We show that fusion features outperform than others. Prediction accuracy does not decrease despite feature reduction. |
---|---|
ISSN: | 0039-9140 1873-3573 1873-3573 |
DOI: | 10.1016/j.talanta.2023.124836 |