Global protein-protein interaction networks in yeast saccharomyces cerevisiae and helicobacter pylori

Understanding many biological processes relies heavily on accurately predicting protein-protein interactions (PPIs). In this study, we propose a novel method for predicting PPIs that is based on LogitBoost with a binary bat feature selection algorithm. Our approach involves the extraction of an init...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Talanta (Oxford) 2023-12, Vol.265, p.124836, Article 124836
Hauptverfasser: Zandi, Farzad, Mansouri, Parvaneh, Goodarzi, Mohammad
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Understanding many biological processes relies heavily on accurately predicting protein-protein interactions (PPIs). In this study, we propose a novel method for predicting PPIs that is based on LogitBoost with a binary bat feature selection algorithm. Our approach involves the extraction of an initial feature vector by combining pseudo amino acid composition (PseAAC), pseudo-position-specific scoring matrix (PsePSSM), reduced sequence and index-vectors (RSIV), and autocorrelation descriptor (AD). Subsequently, a binary bat algorithm is applied to eliminate redundant features, and the resulting optimal features are fed into the LogitBoost classifier for the identification of PPIs. To evaluate the proposed method, we test it on two databases, Saccharomyces cerevisiae and Helicobacter pylori, using 10-fold cross-validation, and achieve accuracies of 94.39% and 97.89%, respectively. Our results showcase the significant potential of our pipeline in accurately predicting protein-protein interactions (PPIs), thereby offering a valuable resource to the scientific research community. [Display omitted] •We extract and fused PsePSSM, PseAAC, AD and RSIV to enrich sequence for obtaining effective features.•We used binary bat meta-hurestic algorithm for removing unreleavant redundant features and saving predicting time.•In addition to using several classifiers, LogitBoost ensemble method is used to increase the accuracy of our model.•We show that fusion features outperform than others. Prediction accuracy does not decrease despite feature reduction.
ISSN:0039-9140
1873-3573
1873-3573
DOI:10.1016/j.talanta.2023.124836