A Graph-Based Approach for the DNA Word Design Problem

The aim of this paper is to improve the best known solution of an important problem, the DNA Word Design problem, which has its roots in Bioinformatics and Coding Theory. The problem is to design DNA codes that satisfy some combinatorial constraints. The constraints considered are: minimum Hamming d...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE/ACM transactions on computational biology and bioinformatics 2021-11, Vol.18 (6), p.2747-2752
Hauptverfasser: Luncasu, Victor, Raschip, Madalina
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The aim of this paper is to improve the best known solution of an important problem, the DNA Word Design problem, which has its roots in Bioinformatics and Coding Theory. The problem is to design DNA codes that satisfy some combinatorial constraints. The constraints considered are: minimum Hamming distance, fixed GC content and the reverse complement Hamming distance. The problem is modeled as a maximum independent set problem. Existing complex approaches for the maximum independent set problem, suitable for large graphs, were tested. In order to tackle large instances, libraries for external memory computations and sampling techniques were investigated. Eventually, we succeed in finding good lower bounds for the instances that were analyzed.
ISSN:1545-5963
1557-9964
DOI:10.1109/TCBB.2020.3008346