NetAUC: A network-based multi-biomarker identification method by AUC optimization

•Propose a novel method for identifying biomarkers by AUC optimization model.•Combine gene expression and topological information from protein-protein interaction network to construct the integrated network.•Introduce the label propagation algorithm to highlight the important genes.•Introduce the sm...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Methods (San Diego, Calif.) Calif.), 2022-02, Vol.198, p.56-64
Hauptverfasser: Li, Xing-Yi, Xiang, Ju, Wu, Fang-Xiang, Li, Min
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Propose a novel method for identifying biomarkers by AUC optimization model.•Combine gene expression and topological information from protein-protein interaction network to construct the integrated network.•Introduce the label propagation algorithm to highlight the important genes.•Introduce the smooth hinge loss function into AUC optimization model. Complex diseases are caused by a variety of factors, and their diagnosis, treatment and prognosis are usually difficult. Proteins play an indispensable role in living organisms and perform specific biological functions by interacting with other proteins or biomolecules, their dysfunction may lead to diseases, it is a natural way to mine disease-related biomarkers from protein-protein interaction network. AUC, the area under the receiver operating characteristics (ROC) curve, is regarded as a gold standard to evaluate the effectiveness of a binary classifier, which measures the classification ability of an algorithm under arbitrary distribution or any misclassification cost. In this study, we have proposed a network-based multi-biomarker identification method by AUC optimization (NetAUC), which integrates gene expression and the network information to identify biomarkers for the complex disease analysis. The main purpose is to optimize two objectives simultaneously: maximizing AUC and minimizing the number of selected features. We have applied NetAUC to two types of disease analysis: 1) prognosis of breast cancer, 2) classification of similar diseases. The results show that NetAUC can identify a small panel of disease-related biomarkers which have the powerful classification ability and the functional interpretability.
ISSN:1046-2023
1095-9130
DOI:10.1016/j.ymeth.2021.08.001