Trace Reconstruction Problems in Computational Biology

The problem of reconstructing a string from its error-prone copies, the trace reconstruction problem, was introduced by Vladimir Levenshtein two decades ago. While there has been considerable theoretical work on trace reconstruction, practical solutions have only recently started to emerge in the co...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2020-10
Hauptverfasser: Bhardwaj, Vinnu, Pevzner, Pavel A, Rashtchian, Cyrus, Safonova, Yana
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Bhardwaj, Vinnu
Pevzner, Pavel A
Rashtchian, Cyrus
Safonova, Yana
description The problem of reconstructing a string from its error-prone copies, the trace reconstruction problem, was introduced by Vladimir Levenshtein two decades ago. While there has been considerable theoretical work on trace reconstruction, practical solutions have only recently started to emerge in the context of two rapidly developing research areas: immunogenomics and DNA data storage. In immunogenomics, traces correspond to mutated copies of genes, with mutations generated naturally by the adaptive immune system. In DNA data storage, traces correspond to noisy copies of DNA molecules that encode digital data, with errors being artifacts of the data retrieval process. In this paper, we introduce several new trace generation models and open questions relevant to trace reconstruction for immunogenomics and DNA data storage, survey theoretical results on trace reconstruction, and highlight their connections to computational biology. Throughout, we discuss the applicability and shortcomings of known solutions and suggest future research directions.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2450885096</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2450885096</sourcerecordid><originalsourceid>FETCH-proquest_journals_24508850963</originalsourceid><addsrcrecordid>eNqNi0ELwiAYQCUIGrX_IHQemE6za6PoGLH7cGLhcPuWnx769xX0Azo9eLy3IAUXYlfpmvMVKREHxhhXey6lKIhqo7GO3pyFCVPMNnmY6DVCH9yI1E-0gXHOyXy9CfToIcDjtSHLuwnoyh_XZHs-tc2lmiM8s8PUDZDjZ8CO15JpLdlBif-qNwSqNdo</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2450885096</pqid></control><display><type>article</type><title>Trace Reconstruction Problems in Computational Biology</title><source>Free E- Journals</source><creator>Bhardwaj, Vinnu ; Pevzner, Pavel A ; Rashtchian, Cyrus ; Safonova, Yana</creator><creatorcontrib>Bhardwaj, Vinnu ; Pevzner, Pavel A ; Rashtchian, Cyrus ; Safonova, Yana</creatorcontrib><description>The problem of reconstructing a string from its error-prone copies, the trace reconstruction problem, was introduced by Vladimir Levenshtein two decades ago. While there has been considerable theoretical work on trace reconstruction, practical solutions have only recently started to emerge in the context of two rapidly developing research areas: immunogenomics and DNA data storage. In immunogenomics, traces correspond to mutated copies of genes, with mutations generated naturally by the adaptive immune system. In DNA data storage, traces correspond to noisy copies of DNA molecules that encode digital data, with errors being artifacts of the data retrieval process. In this paper, we introduce several new trace generation models and open questions relevant to trace reconstruction for immunogenomics and DNA data storage, survey theoretical results on trace reconstruction, and highlight their connections to computational biology. Throughout, we discuss the applicability and shortcomings of known solutions and suggest future research directions.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Adaptive systems ; Biology ; Data retrieval ; Data storage ; Deoxyribonucleic acid ; Digital data ; DNA ; Immune system ; Mutation ; Reconstruction</subject><ispartof>arXiv.org, 2020-10</ispartof><rights>2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Bhardwaj, Vinnu</creatorcontrib><creatorcontrib>Pevzner, Pavel A</creatorcontrib><creatorcontrib>Rashtchian, Cyrus</creatorcontrib><creatorcontrib>Safonova, Yana</creatorcontrib><title>Trace Reconstruction Problems in Computational Biology</title><title>arXiv.org</title><description>The problem of reconstructing a string from its error-prone copies, the trace reconstruction problem, was introduced by Vladimir Levenshtein two decades ago. While there has been considerable theoretical work on trace reconstruction, practical solutions have only recently started to emerge in the context of two rapidly developing research areas: immunogenomics and DNA data storage. In immunogenomics, traces correspond to mutated copies of genes, with mutations generated naturally by the adaptive immune system. In DNA data storage, traces correspond to noisy copies of DNA molecules that encode digital data, with errors being artifacts of the data retrieval process. In this paper, we introduce several new trace generation models and open questions relevant to trace reconstruction for immunogenomics and DNA data storage, survey theoretical results on trace reconstruction, and highlight their connections to computational biology. Throughout, we discuss the applicability and shortcomings of known solutions and suggest future research directions.</description><subject>Adaptive systems</subject><subject>Biology</subject><subject>Data retrieval</subject><subject>Data storage</subject><subject>Deoxyribonucleic acid</subject><subject>Digital data</subject><subject>DNA</subject><subject>Immune system</subject><subject>Mutation</subject><subject>Reconstruction</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNi0ELwiAYQCUIGrX_IHQemE6za6PoGLH7cGLhcPuWnx769xX0Azo9eLy3IAUXYlfpmvMVKREHxhhXey6lKIhqo7GO3pyFCVPMNnmY6DVCH9yI1E-0gXHOyXy9CfToIcDjtSHLuwnoyh_XZHs-tc2lmiM8s8PUDZDjZ8CO15JpLdlBif-qNwSqNdo</recordid><startdate>20201012</startdate><enddate>20201012</enddate><creator>Bhardwaj, Vinnu</creator><creator>Pevzner, Pavel A</creator><creator>Rashtchian, Cyrus</creator><creator>Safonova, Yana</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20201012</creationdate><title>Trace Reconstruction Problems in Computational Biology</title><author>Bhardwaj, Vinnu ; Pevzner, Pavel A ; Rashtchian, Cyrus ; Safonova, Yana</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_24508850963</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Adaptive systems</topic><topic>Biology</topic><topic>Data retrieval</topic><topic>Data storage</topic><topic>Deoxyribonucleic acid</topic><topic>Digital data</topic><topic>DNA</topic><topic>Immune system</topic><topic>Mutation</topic><topic>Reconstruction</topic><toplevel>online_resources</toplevel><creatorcontrib>Bhardwaj, Vinnu</creatorcontrib><creatorcontrib>Pevzner, Pavel A</creatorcontrib><creatorcontrib>Rashtchian, Cyrus</creatorcontrib><creatorcontrib>Safonova, Yana</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Bhardwaj, Vinnu</au><au>Pevzner, Pavel A</au><au>Rashtchian, Cyrus</au><au>Safonova, Yana</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Trace Reconstruction Problems in Computational Biology</atitle><jtitle>arXiv.org</jtitle><date>2020-10-12</date><risdate>2020</risdate><eissn>2331-8422</eissn><abstract>The problem of reconstructing a string from its error-prone copies, the trace reconstruction problem, was introduced by Vladimir Levenshtein two decades ago. While there has been considerable theoretical work on trace reconstruction, practical solutions have only recently started to emerge in the context of two rapidly developing research areas: immunogenomics and DNA data storage. In immunogenomics, traces correspond to mutated copies of genes, with mutations generated naturally by the adaptive immune system. In DNA data storage, traces correspond to noisy copies of DNA molecules that encode digital data, with errors being artifacts of the data retrieval process. In this paper, we introduce several new trace generation models and open questions relevant to trace reconstruction for immunogenomics and DNA data storage, survey theoretical results on trace reconstruction, and highlight their connections to computational biology. Throughout, we discuss the applicability and shortcomings of known solutions and suggest future research directions.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2020-10
issn 2331-8422
language eng
recordid cdi_proquest_journals_2450885096
source Free E- Journals
subjects Adaptive systems
Biology
Data retrieval
Data storage
Deoxyribonucleic acid
Digital data
DNA
Immune system
Mutation
Reconstruction
title Trace Reconstruction Problems in Computational Biology
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T15%3A00%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Trace%20Reconstruction%20Problems%20in%20Computational%20Biology&rft.jtitle=arXiv.org&rft.au=Bhardwaj,%20Vinnu&rft.date=2020-10-12&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2450885096%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2450885096&rft_id=info:pmid/&rfr_iscdi=true