AliNA – a deep learning program for RNA secondary structure prediction

Nowadays there are numerous discovered natural RNA variations participating in different cellular processes and artificial RNA, e. g., aptamers, riboswitches. One of the required tasks in the investigation of their functions and mechanism of influence on cells and interaction with targets is the pre...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Molecular informatics 2023-12, Vol.42 (12), p.e202300113-n/a
Hauptverfasser: Nasaev, Shamsudin S., Mukanov, Artem R., Kuznetsov, Ivan I., Veselovsky, Alexander V.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Nowadays there are numerous discovered natural RNA variations participating in different cellular processes and artificial RNA, e. g., aptamers, riboswitches. One of the required tasks in the investigation of their functions and mechanism of influence on cells and interaction with targets is the prediction of RNA secondary structures. The classic thermodynamic‐based prediction algorithms do not consider the specificity of biological folding and deep learning methods that were designed to resolve this issue suffer from homology‐based methods problems. Herein, we present a method for RNA secondary structure prediction based on deep learning – AliNA (ALIgned Nucleic Acids). Our method successfully predicts secondary structures for non‐homologous to train‐data RNA families thanks to usage of the data augmentation techniques. Augmentation extends existing datasets with easily‐accessible simulated data. The proposed method shows a high quality of prediction across different benchmarks including pseudoknots. The method is available on GitHub for free (https://github.com/Arty40m/AliNA).
ISSN:1868-1743
1868-1751
1868-1751
DOI:10.1002/minf.202300113