Identification of essential proteins based on a new combination of topological and biological features in weighted protein–protein interaction networks

The identification of essential proteins in protein–protein interaction (PPI) networks is not only important in understanding the process of cellular life but also useful in diagnosis and drug design. The network topology-based centrality measures are sensitive to noise of network. Moreover, these m...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IET systems biology 2018-12, Vol.12 (6), p.247-257
Hauptverfasser:	Elahi, Abdolkarim, Babamir, Seyed Morteza
Format:	Artikel
Sprache:	eng
Schlagworte:	biological features biology computing drugs Gene Ontology incomplete PPI networks low‐connectivity essential proteins Models, Biological network topology‐based centrality measures ontologies (artificial intelligence) protein complexes Protein Interaction Mapping proteins Research Article Saccharomyces cerevisiae - genetics Saccharomyces cerevisiae - metabolism topological centrality measures topology weighted centrality measures weighted protein–protein interaction networks YDIP‐W dataset YHQ‐W dataset YMBD‐W dataset YMIPS‐W dataset
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The identification of essential proteins in protein–protein interaction (PPI) networks is not only important in understanding the process of cellular life but also useful in diagnosis and drug design. The network topology-based centrality measures are sensitive to noise of network. Moreover, these measures cannot detect low-connectivity essential proteins. The authors have proposed a new method using a combination of topological centrality measures and biological features based on statistical analyses of essential proteins and protein complexes. With incomplete PPI networks, they face the challenge of false-positive interactions. To remove these interactions, the PPI networks are weighted by gene ontology. Furthermore, they use a combination of classifiers, including the newly proposed measures and traditional weighted centrality measures, to improve the precision of identification. This combination is evaluated using the logistic regression model in terms of significance levels. The proposed method has been implemented and compared to both previous and more recent efficient computational methods using six statistical standards. The results show that the proposed method is more precise in identifying essential proteins than the previous methods. This level of precision was obtained through the use of four different data sets: YHQ-W, YMBD-W, YDIP-W and YMIPS-W.
ISSN:	1751-8849 1751-8857 1751-8857
DOI:	10.1049/iet-syb.2018.5024