GR-m6A: Prediction of N6-methyladenosine sites in mammals with molecular graph and residual network

RNA N6-methyladenine (m6A), which is produced by the methylation of the N6 position of eukaryotic adenine, is a relatively common post-transcriptional modification on the surface of the molecule, which frequently plays a crucial role in biological processes. Biological experimental methods to identi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computers in biology and medicine 2023-09, Vol.163, p.107202-107202, Article 107202
Hauptverfasser: Qiu, Shi, Liu, Renxin, Liang, Ying
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:RNA N6-methyladenine (m6A), which is produced by the methylation of the N6 position of eukaryotic adenine, is a relatively common post-transcriptional modification on the surface of the molecule, which frequently plays a crucial role in biological processes. Biological experimental methods to identify m6A have been studied and implemented in recent years, but they cannot be promoted widely due to drawbacks such as the time and cost of reagents and equipment. Therefore, researchers have proposed computational strategies for identifying m6A sites, but these strategies do not account for the mechanism of methylation occurrence or the structure of RNA molecules. This study, therefore, proposed a novel deep learning model for predicting m6A sites, GR-m6A, which predicts m6A sites by extracting features from the physicochemical properties and spatial structure of molecules via residual networks. In GR-m6A, each RNA base string is represented by SMILES as two matrices comprising topology structural information and node attributes with molecular physicochemical characteristics. The feature encoding matrix was then obtained by fusing the topology matrix and the node matrix in accordance with the graphical convolutional network principle. Correspondingly, the more discriminative features were extracted from the encoding matrix using the residual neural network and predicted using a multilayer perceptron. As evident from the 5-fold cross-validation and independent validation, the GR-m6A model outperformed other existing methods. Thus, we hope that GR-m6A can aid researchers in predicting mammalian m6A loci. The source code and database are available at https://github.com/YingLiangjxau/GR-m6A. •GR-m6A uses the spatial structure and physicochemical properties of the molecule.•GR-m6A uses the SMILES string to translate the structural information into ASCII.•GR-m6A extracts the feature through residual neural network.•GR-m6A outperforms contemporaneous methods in most cases in both validations.
ISSN:0010-4825
1879-0534
DOI:10.1016/j.compbiomed.2023.107202