PrGeFNE: Predicting disease-related genes by fast network embedding
•Propose a novel method for predicting disease genes by fast network embedding.•Integrate multiple types of related associations into a heterogeneous network.•Dual-layer heterogeneous network is reconstructed.•Integration of different data enhances ability of disease-gene prediction.•Develop a web t...
Gespeichert in:
Veröffentlicht in: | Methods (San Diego, Calif.) Calif.), 2021-08, Vol.192, p.3-12 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | •Propose a novel method for predicting disease genes by fast network embedding.•Integrate multiple types of related associations into a heterogeneous network.•Dual-layer heterogeneous network is reconstructed.•Integration of different data enhances ability of disease-gene prediction.•Develop a web tool for candidate genes of diseases and enrichment analysis.
Identifying disease-related genes is of importance for understanding of molecule mechanisms of diseases, as well as diagnosis and treatment of diseases. Many computational methods have been proposed to predict disease-related genes, but how to make full use of multi-source biological data to enhance the ability of disease-gene prediction is still challenging. In this paper, we proposed a novel method for predicting disease-related genes by using fast network embedding (PrGeFNE), which can integrate multiple types of associations related to diseases and genes. Specifically, we first constructed a heterogeneous network by using phenotype-disease, disease-gene, protein-protein and gene-GO associations; and low-dimensional representation of nodes is extracted from the network by using a fast network embedding algorithm. Then, a dual-layer heterogeneous network was reconstructed by using the low-dimensional representation, and a network propagation was applied to the dual-layer heterogeneous network to predict disease-related genes. Through cross-validation and newly added-association validation, we displayed the important roles of different types of association data in enhancing the ability of disease-gene prediction, and confirmed the excellent performance of PrGeFNE by comparing to state-of-the-art algorithms. Furthermore, we developed a web tool that can facilitate researchers to search for candidate genes of different diseases predicted by PrGeFNE, along with the enrichment analysis of GO and pathway on candidate gene set. This may be useful for investigation of diseases’ molecular mechanisms as well as their experimental validations. The web tool is available at http://bioinformatics.csu.edu.cn/prgefne/. |
---|---|
ISSN: | 1046-2023 1095-9130 |
DOI: | 10.1016/j.ymeth.2020.06.015 |