Computational identification of ubiquitination sites in Arabidopsis thaliana using convolutional neural networks
Key message We developed two CNNs for predicting ubiquitination sites in Arabidopsis thaliana , demonstrated their competitive performance, analyzed amino acid physicochemical properties and the CNN structures, and predicted ubiquitination sites in Arabidopsis . As an important posttranslational pro...
Gespeichert in:
Veröffentlicht in: | Plant molecular biology 2021-04, Vol.105 (6), p.601-610 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Key message
We developed two CNNs for predicting ubiquitination sites in
Arabidopsis thaliana
, demonstrated their competitive performance, analyzed amino acid physicochemical properties and the CNN structures, and predicted ubiquitination sites in
Arabidopsis
.
As an important posttranslational protein modification, ubiquitination plays critical roles in plant physiology, including plant growth and development, biotic and abiotic stress, metabolism, and so on. A lot of ubiquitination site prediction models have been developed for human, mouse and yeast. However, there are few models to predict ubiquitination sites for the plant
Arabidopsis thaliana
. Based on this context, we proposed two convolutional neural network (CNN) based models for predicting ubiquitination sites in
A. thaliana
. The two models reach AUC (area under the ROC curve) values of 0.924 and 0.913 respectively in five-fold cross-validation, and 0.921 and 0.914 respectively in independent test, which outperform other models and demonstrate the competitive edge of them. We in-depth analyze the amino acid physicochemical properties in the neighboring sequence regions of the ubiquitination sites, and study the influence of the CNN structure to the prediction performance. Potential ubiquitination sites in the global
Arbidopsis
proteome are predicted using the two CNN models. To facilitate the community, the source code, training and test dataset, predicted ubiquitination sites in the
Arbidopsis
proteome are available at GitHub (
http://github.com/nongdaxiaofeng/CNNAthUbi
) for interest users. |
---|---|
ISSN: | 0167-4412 1573-5028 |
DOI: | 10.1007/s11103-020-01112-w |