Identification of SNARE Proteins Through a Novel Hybrid Model

SNARE proteins are a large family of membrane fusion proteins. As a lot of human diseases are related to the SNARE proteins, they have attracted people to study them. Traditionally, the SNARE proteins can be identified through bioinformatics techniques, which are expensive and time-consuming. Some r...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2020, Vol.8, p.117877-117887
1. Verfasser: Li, Guilin
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:SNARE proteins are a large family of membrane fusion proteins. As a lot of human diseases are related to the SNARE proteins, they have attracted people to study them. Traditionally, the SNARE proteins can be identified through bioinformatics techniques, which are expensive and time-consuming. Some researchers attempt to identify the SNARE proteins by the machine learning algorithms. A deep learning model called SNARE-CNN is proposed to predict SNARE proteins. A 2D convolutional neural network is constructed and the Position-Specific Scoring Matrix (PSSM) profile is used to distinguish the SNARE proteins from the other kinds of proteins. Although the SNARE-CNN can achieve high accuracy, the performance of the model still has room to improve. In this paper, a novel hybrid model, that combines the random forest algorithm with the oversampling filter and 188D feature extraction method, is proposed. By trying different combinations of feature extraction methods, filtering methods and classification algorithms, the hybrid model, we proposed, can achieve the best performance among all combinations. Experiments show that the performance of our hybrid model is better than that of the SNARE-CNN model.
ISSN:2169-3536
2169-3536
DOI:10.1109/ACCESS.2020.3004446