Trace Reconstruction Problems in Computational Biology
The problem of reconstructing a string from its error-prone copies, the trace reconstruction problem, was introduced by Vladimir Levenshtein two decades ago. While there has been considerable theoretical work on trace reconstruction, practical solutions have only recently started to emerge in the co...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2020-10 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Bhardwaj, Vinnu Pevzner, Pavel A Rashtchian, Cyrus Safonova, Yana |
description | The problem of reconstructing a string from its error-prone copies, the trace reconstruction problem, was introduced by Vladimir Levenshtein two decades ago. While there has been considerable theoretical work on trace reconstruction, practical solutions have only recently started to emerge in the context of two rapidly developing research areas: immunogenomics and DNA data storage. In immunogenomics, traces correspond to mutated copies of genes, with mutations generated naturally by the adaptive immune system. In DNA data storage, traces correspond to noisy copies of DNA molecules that encode digital data, with errors being artifacts of the data retrieval process. In this paper, we introduce several new trace generation models and open questions relevant to trace reconstruction for immunogenomics and DNA data storage, survey theoretical results on trace reconstruction, and highlight their connections to computational biology. Throughout, we discuss the applicability and shortcomings of known solutions and suggest future research directions. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2450885096</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2450885096</sourcerecordid><originalsourceid>FETCH-proquest_journals_24508850963</originalsourceid><addsrcrecordid>eNqNi0ELwiAYQCUIGrX_IHQemE6za6PoGLH7cGLhcPuWnx769xX0Azo9eLy3IAUXYlfpmvMVKREHxhhXey6lKIhqo7GO3pyFCVPMNnmY6DVCH9yI1E-0gXHOyXy9CfToIcDjtSHLuwnoyh_XZHs-tc2lmiM8s8PUDZDjZ8CO15JpLdlBif-qNwSqNdo</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2450885096</pqid></control><display><type>article</type><title>Trace Reconstruction Problems in Computational Biology</title><source>Free E- Journals</source><creator>Bhardwaj, Vinnu ; Pevzner, Pavel A ; Rashtchian, Cyrus ; Safonova, Yana</creator><creatorcontrib>Bhardwaj, Vinnu ; Pevzner, Pavel A ; Rashtchian, Cyrus ; Safonova, Yana</creatorcontrib><description>The problem of reconstructing a string from its error-prone copies, the trace reconstruction problem, was introduced by Vladimir Levenshtein two decades ago. While there has been considerable theoretical work on trace reconstruction, practical solutions have only recently started to emerge in the context of two rapidly developing research areas: immunogenomics and DNA data storage. In immunogenomics, traces correspond to mutated copies of genes, with mutations generated naturally by the adaptive immune system. In DNA data storage, traces correspond to noisy copies of DNA molecules that encode digital data, with errors being artifacts of the data retrieval process. In this paper, we introduce several new trace generation models and open questions relevant to trace reconstruction for immunogenomics and DNA data storage, survey theoretical results on trace reconstruction, and highlight their connections to computational biology. Throughout, we discuss the applicability and shortcomings of known solutions and suggest future research directions.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Adaptive systems ; Biology ; Data retrieval ; Data storage ; Deoxyribonucleic acid ; Digital data ; DNA ; Immune system ; Mutation ; Reconstruction</subject><ispartof>arXiv.org, 2020-10</ispartof><rights>2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Bhardwaj, Vinnu</creatorcontrib><creatorcontrib>Pevzner, Pavel A</creatorcontrib><creatorcontrib>Rashtchian, Cyrus</creatorcontrib><creatorcontrib>Safonova, Yana</creatorcontrib><title>Trace Reconstruction Problems in Computational Biology</title><title>arXiv.org</title><description>The problem of reconstructing a string from its error-prone copies, the trace reconstruction problem, was introduced by Vladimir Levenshtein two decades ago. While there has been considerable theoretical work on trace reconstruction, practical solutions have only recently started to emerge in the context of two rapidly developing research areas: immunogenomics and DNA data storage. In immunogenomics, traces correspond to mutated copies of genes, with mutations generated naturally by the adaptive immune system. In DNA data storage, traces correspond to noisy copies of DNA molecules that encode digital data, with errors being artifacts of the data retrieval process. In this paper, we introduce several new trace generation models and open questions relevant to trace reconstruction for immunogenomics and DNA data storage, survey theoretical results on trace reconstruction, and highlight their connections to computational biology. Throughout, we discuss the applicability and shortcomings of known solutions and suggest future research directions.</description><subject>Adaptive systems</subject><subject>Biology</subject><subject>Data retrieval</subject><subject>Data storage</subject><subject>Deoxyribonucleic acid</subject><subject>Digital data</subject><subject>DNA</subject><subject>Immune system</subject><subject>Mutation</subject><subject>Reconstruction</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNi0ELwiAYQCUIGrX_IHQemE6za6PoGLH7cGLhcPuWnx769xX0Azo9eLy3IAUXYlfpmvMVKREHxhhXey6lKIhqo7GO3pyFCVPMNnmY6DVCH9yI1E-0gXHOyXy9CfToIcDjtSHLuwnoyh_XZHs-tc2lmiM8s8PUDZDjZ8CO15JpLdlBif-qNwSqNdo</recordid><startdate>20201012</startdate><enddate>20201012</enddate><creator>Bhardwaj, Vinnu</creator><creator>Pevzner, Pavel A</creator><creator>Rashtchian, Cyrus</creator><creator>Safonova, Yana</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20201012</creationdate><title>Trace Reconstruction Problems in Computational Biology</title><author>Bhardwaj, Vinnu ; Pevzner, Pavel A ; Rashtchian, Cyrus ; Safonova, Yana</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_24508850963</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Adaptive systems</topic><topic>Biology</topic><topic>Data retrieval</topic><topic>Data storage</topic><topic>Deoxyribonucleic acid</topic><topic>Digital data</topic><topic>DNA</topic><topic>Immune system</topic><topic>Mutation</topic><topic>Reconstruction</topic><toplevel>online_resources</toplevel><creatorcontrib>Bhardwaj, Vinnu</creatorcontrib><creatorcontrib>Pevzner, Pavel A</creatorcontrib><creatorcontrib>Rashtchian, Cyrus</creatorcontrib><creatorcontrib>Safonova, Yana</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Bhardwaj, Vinnu</au><au>Pevzner, Pavel A</au><au>Rashtchian, Cyrus</au><au>Safonova, Yana</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Trace Reconstruction Problems in Computational Biology</atitle><jtitle>arXiv.org</jtitle><date>2020-10-12</date><risdate>2020</risdate><eissn>2331-8422</eissn><abstract>The problem of reconstructing a string from its error-prone copies, the trace reconstruction problem, was introduced by Vladimir Levenshtein two decades ago. While there has been considerable theoretical work on trace reconstruction, practical solutions have only recently started to emerge in the context of two rapidly developing research areas: immunogenomics and DNA data storage. In immunogenomics, traces correspond to mutated copies of genes, with mutations generated naturally by the adaptive immune system. In DNA data storage, traces correspond to noisy copies of DNA molecules that encode digital data, with errors being artifacts of the data retrieval process. In this paper, we introduce several new trace generation models and open questions relevant to trace reconstruction for immunogenomics and DNA data storage, survey theoretical results on trace reconstruction, and highlight their connections to computational biology. Throughout, we discuss the applicability and shortcomings of known solutions and suggest future research directions.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2020-10 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2450885096 |
source | Free E- Journals |
subjects | Adaptive systems Biology Data retrieval Data storage Deoxyribonucleic acid Digital data DNA Immune system Mutation Reconstruction |
title | Trace Reconstruction Problems in Computational Biology |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T15%3A00%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Trace%20Reconstruction%20Problems%20in%20Computational%20Biology&rft.jtitle=arXiv.org&rft.au=Bhardwaj,%20Vinnu&rft.date=2020-10-12&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2450885096%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2450885096&rft_id=info:pmid/&rfr_iscdi=true |