THGB: predicting ligand-receptor interactions by combining tree boosting and histogram-based gradient boosting

Ligand-receptor interaction (LRI) prediction has great significance in biological and medical research and facilitates to infer and analyze cell-to-cell communication. However, wet experiments for new LRI discovery are costly and time-consuming. Here, we propose a computational model called THGB to...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Scientific reports 2024-11, Vol.14 (1), p.29604-14
Hauptverfasser: Zhou, Liqian, Song, Jiao, Li, Zejun, Hu, Yingxi, Guo, Wenyan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Ligand-receptor interaction (LRI) prediction has great significance in biological and medical research and facilitates to infer and analyze cell-to-cell communication. However, wet experiments for new LRI discovery are costly and time-consuming. Here, we propose a computational model called THGB to uncover new LRIs. THGB first extracts feature information of Ligand-Receptor (LR) pairs using iFeature. Next, it adopts a tree boosting model to obtain representative LR features. Finally, it devises the histogram-based gradient boosting model to capture high-quality LRIs. To assess the THGB performance, we compared it with three new LRI prediction models (i.e., CellEnBoost, CellGiQ, and CellComNet) and one classical protein-protein interaction inference model PIPR. The results demonstrated that THGB achieved the best overall predictions in terms of six evaluation indictors (i.e., precision, recall, accuracy, F1-score, AUC, and AUPR). To measure the effect of LR feature selection on the prediction, THGB was compared with four feature selection methods (i.e., PCA, NMF, LLE, and TSVD). The results showed that the tree boosting model was more appropriate to select representative LR features and improve LRI prediction. We also conducted ablation study and found that THGB with feature selection outperformed THGB without feature selection. We hope that THGB is a useful tool to find new LRIs and further infer cell-to-cell communication.
ISSN:2045-2322
2045-2322
DOI:10.1038/s41598-024-78954-7