Parallel molecular computation on digital data stored in DNA

DNA is an incredibly dense storage medium for digital data. However, computing on the stored information is expensive and slow, requiring rounds of sequencing, in silico computation, and DNA synthesis. Prior work on accessing and modifying data using DNA hybridization or enzymatic reactions had limi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Proceedings of the National Academy of Sciences - PNAS 2023-09, Vol.120 (37), p.e2217330120
Hauptverfasser: Wang, Boya, Wang, Siyuan Stella, Chalk, Cameron, Ellington, Andrew D., Soloveichik, David
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 37
container_start_page e2217330120
container_title Proceedings of the National Academy of Sciences - PNAS
container_volume 120
creator Wang, Boya
Wang, Siyuan Stella
Chalk, Cameron
Ellington, Andrew D.
Soloveichik, David
description DNA is an incredibly dense storage medium for digital data. However, computing on the stored information is expensive and slow, requiring rounds of sequencing, in silico computation, and DNA synthesis. Prior work on accessing and modifying data using DNA hybridization or enzymatic reactions had limited computation capabilities. Inspired by the computational power of “DNA strand displacement,” we augment DNA storage with “in-memory” molecular computation using strand displacement reactions to algorithmically modify data in a parallel manner. We show programs for binary counting and Turing universal cellular automaton Rule 110, the latter of which is, in principle, capable of implementing any computer algorithm. Information is stored in the nicks of DNA, and a secondary sequence-level encoding allows high-throughput sequencing-based readout. We conducted multiple rounds of computation on 4-bit data registers, as well as random access of data (selective access and erasure). We demonstrate that large strand displacement cascades with 244 distinct strand exchanges (sequential and in parallel) can use naturally occurring DNA sequence from M13 bacteriophage without stringent sequence design, which has the potential to improve the scale of computation and decrease cost. Our work merges DNA storage and DNA computing, setting the foundation of entirely molecular algorithms for parallel manipulation of digital information preserved in DNA.
doi_str_mv 10.1073/pnas.2217330120
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10500265</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2861644315</sourcerecordid><originalsourceid>FETCH-LOGICAL-c399t-fe6f74848ae2b830c146de3d53c84a180d5257065be70d7b76e747e40ae624533</originalsourceid><addsrcrecordid>eNpdkc1Lw0AQxRdRbP04ew148RI7-70BQUr9hKIe9Lxsk6lGNtm6mwj-96a0KAoDc5gfjzfvEXJC4ZyC5pNV69I5Y1RzDpTBDhlTKGiuRAG7ZAzAdG4EEyNykNI7ABTSwD4Zca1UwQ0bk4snF5336LMmeCx772JWhmbVd66rQ5sNU9Wvded8VrnOZakLEausbrOrh-kR2Vs6n_B4uw_Jy8318-wunz_e3s-m87zkRdHlS1RLLYwwDtnCcCipUBXySvLSCEcNVJJJDUouUEOlF1qhFhoFOFRMSM4PyeVGd9UvGqxKbLvBtF3FunHxywZX27-Xtn6zr-HTUpBDBkoOCmdbhRg-ekydbepUoveuxdAny4yiSghO1-jpP_Q99LEd_ltTEoRiBQzUZEOVMaQUcfnjhoJdV2PX1djfavg34_N_TA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2865046290</pqid></control><display><type>article</type><title>Parallel molecular computation on digital data stored in DNA</title><source>PubMed Central</source><source>Alma/SFX Local Collection</source><source>Free Full-Text Journals in Chemistry</source><creator>Wang, Boya ; Wang, Siyuan Stella ; Chalk, Cameron ; Ellington, Andrew D. ; Soloveichik, David</creator><creatorcontrib>Wang, Boya ; Wang, Siyuan Stella ; Chalk, Cameron ; Ellington, Andrew D. ; Soloveichik, David</creatorcontrib><description>DNA is an incredibly dense storage medium for digital data. However, computing on the stored information is expensive and slow, requiring rounds of sequencing, in silico computation, and DNA synthesis. Prior work on accessing and modifying data using DNA hybridization or enzymatic reactions had limited computation capabilities. Inspired by the computational power of “DNA strand displacement,” we augment DNA storage with “in-memory” molecular computation using strand displacement reactions to algorithmically modify data in a parallel manner. We show programs for binary counting and Turing universal cellular automaton Rule 110, the latter of which is, in principle, capable of implementing any computer algorithm. Information is stored in the nicks of DNA, and a secondary sequence-level encoding allows high-throughput sequencing-based readout. We conducted multiple rounds of computation on 4-bit data registers, as well as random access of data (selective access and erasure). We demonstrate that large strand displacement cascades with 244 distinct strand exchanges (sequential and in parallel) can use naturally occurring DNA sequence from M13 bacteriophage without stringent sequence design, which has the potential to improve the scale of computation and decrease cost. Our work merges DNA storage and DNA computing, setting the foundation of entirely molecular algorithms for parallel manipulation of digital information preserved in DNA.</description><identifier>ISSN: 0027-8424</identifier><identifier>ISSN: 1091-6490</identifier><identifier>EISSN: 1091-6490</identifier><identifier>DOI: 10.1073/pnas.2217330120</identifier><identifier>PMID: 37669382</identifier><language>eng</language><publisher>Washington: National Academy of Sciences</publisher><subject>Algorithms ; Biological Sciences ; Cellular automata ; Deoxyribonucleic acid ; Digital data ; DNA ; DNA biosynthesis ; Gene sequencing ; Hybridization ; Next-generation sequencing ; Nucleotide sequence ; Phages ; Physical Sciences ; Random access</subject><ispartof>Proceedings of the National Academy of Sciences - PNAS, 2023-09, Vol.120 (37), p.e2217330120</ispartof><rights>Copyright National Academy of Sciences Sep 12, 2023</rights><rights>Copyright © 2023 the Author(s). Published by PNAS. 2023</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c399t-fe6f74848ae2b830c146de3d53c84a180d5257065be70d7b76e747e40ae624533</citedby><cites>FETCH-LOGICAL-c399t-fe6f74848ae2b830c146de3d53c84a180d5257065be70d7b76e747e40ae624533</cites><orcidid>0000-0003-1898-4063 ; 0000-0002-2585-4120 ; 0000-0001-8249-6553</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC10500265/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC10500265/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,27924,27925,53791,53793</link.rule.ids></links><search><creatorcontrib>Wang, Boya</creatorcontrib><creatorcontrib>Wang, Siyuan Stella</creatorcontrib><creatorcontrib>Chalk, Cameron</creatorcontrib><creatorcontrib>Ellington, Andrew D.</creatorcontrib><creatorcontrib>Soloveichik, David</creatorcontrib><title>Parallel molecular computation on digital data stored in DNA</title><title>Proceedings of the National Academy of Sciences - PNAS</title><description>DNA is an incredibly dense storage medium for digital data. However, computing on the stored information is expensive and slow, requiring rounds of sequencing, in silico computation, and DNA synthesis. Prior work on accessing and modifying data using DNA hybridization or enzymatic reactions had limited computation capabilities. Inspired by the computational power of “DNA strand displacement,” we augment DNA storage with “in-memory” molecular computation using strand displacement reactions to algorithmically modify data in a parallel manner. We show programs for binary counting and Turing universal cellular automaton Rule 110, the latter of which is, in principle, capable of implementing any computer algorithm. Information is stored in the nicks of DNA, and a secondary sequence-level encoding allows high-throughput sequencing-based readout. We conducted multiple rounds of computation on 4-bit data registers, as well as random access of data (selective access and erasure). We demonstrate that large strand displacement cascades with 244 distinct strand exchanges (sequential and in parallel) can use naturally occurring DNA sequence from M13 bacteriophage without stringent sequence design, which has the potential to improve the scale of computation and decrease cost. Our work merges DNA storage and DNA computing, setting the foundation of entirely molecular algorithms for parallel manipulation of digital information preserved in DNA.</description><subject>Algorithms</subject><subject>Biological Sciences</subject><subject>Cellular automata</subject><subject>Deoxyribonucleic acid</subject><subject>Digital data</subject><subject>DNA</subject><subject>DNA biosynthesis</subject><subject>Gene sequencing</subject><subject>Hybridization</subject><subject>Next-generation sequencing</subject><subject>Nucleotide sequence</subject><subject>Phages</subject><subject>Physical Sciences</subject><subject>Random access</subject><issn>0027-8424</issn><issn>1091-6490</issn><issn>1091-6490</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNpdkc1Lw0AQxRdRbP04ew148RI7-70BQUr9hKIe9Lxsk6lGNtm6mwj-96a0KAoDc5gfjzfvEXJC4ZyC5pNV69I5Y1RzDpTBDhlTKGiuRAG7ZAzAdG4EEyNykNI7ABTSwD4Zca1UwQ0bk4snF5336LMmeCx772JWhmbVd66rQ5sNU9Wvded8VrnOZakLEausbrOrh-kR2Vs6n_B4uw_Jy8318-wunz_e3s-m87zkRdHlS1RLLYwwDtnCcCipUBXySvLSCEcNVJJJDUouUEOlF1qhFhoFOFRMSM4PyeVGd9UvGqxKbLvBtF3FunHxywZX27-Xtn6zr-HTUpBDBkoOCmdbhRg-ekydbepUoveuxdAny4yiSghO1-jpP_Q99LEd_ltTEoRiBQzUZEOVMaQUcfnjhoJdV2PX1djfavg34_N_TA</recordid><startdate>20230912</startdate><enddate>20230912</enddate><creator>Wang, Boya</creator><creator>Wang, Siyuan Stella</creator><creator>Chalk, Cameron</creator><creator>Ellington, Andrew D.</creator><creator>Soloveichik, David</creator><general>National Academy of Sciences</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7QG</scope><scope>7QL</scope><scope>7QP</scope><scope>7QR</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TK</scope><scope>7TM</scope><scope>7TO</scope><scope>7U9</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>H94</scope><scope>M7N</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0003-1898-4063</orcidid><orcidid>https://orcid.org/0000-0002-2585-4120</orcidid><orcidid>https://orcid.org/0000-0001-8249-6553</orcidid></search><sort><creationdate>20230912</creationdate><title>Parallel molecular computation on digital data stored in DNA</title><author>Wang, Boya ; Wang, Siyuan Stella ; Chalk, Cameron ; Ellington, Andrew D. ; Soloveichik, David</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c399t-fe6f74848ae2b830c146de3d53c84a180d5257065be70d7b76e747e40ae624533</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithms</topic><topic>Biological Sciences</topic><topic>Cellular automata</topic><topic>Deoxyribonucleic acid</topic><topic>Digital data</topic><topic>DNA</topic><topic>DNA biosynthesis</topic><topic>Gene sequencing</topic><topic>Hybridization</topic><topic>Next-generation sequencing</topic><topic>Nucleotide sequence</topic><topic>Phages</topic><topic>Physical Sciences</topic><topic>Random access</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wang, Boya</creatorcontrib><creatorcontrib>Wang, Siyuan Stella</creatorcontrib><creatorcontrib>Chalk, Cameron</creatorcontrib><creatorcontrib>Ellington, Andrew D.</creatorcontrib><creatorcontrib>Soloveichik, David</creatorcontrib><collection>CrossRef</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Calcium &amp; Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Oncogenes and Growth Factors Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Proceedings of the National Academy of Sciences - PNAS</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wang, Boya</au><au>Wang, Siyuan Stella</au><au>Chalk, Cameron</au><au>Ellington, Andrew D.</au><au>Soloveichik, David</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Parallel molecular computation on digital data stored in DNA</atitle><jtitle>Proceedings of the National Academy of Sciences - PNAS</jtitle><date>2023-09-12</date><risdate>2023</risdate><volume>120</volume><issue>37</issue><spage>e2217330120</spage><pages>e2217330120-</pages><issn>0027-8424</issn><issn>1091-6490</issn><eissn>1091-6490</eissn><abstract>DNA is an incredibly dense storage medium for digital data. However, computing on the stored information is expensive and slow, requiring rounds of sequencing, in silico computation, and DNA synthesis. Prior work on accessing and modifying data using DNA hybridization or enzymatic reactions had limited computation capabilities. Inspired by the computational power of “DNA strand displacement,” we augment DNA storage with “in-memory” molecular computation using strand displacement reactions to algorithmically modify data in a parallel manner. We show programs for binary counting and Turing universal cellular automaton Rule 110, the latter of which is, in principle, capable of implementing any computer algorithm. Information is stored in the nicks of DNA, and a secondary sequence-level encoding allows high-throughput sequencing-based readout. We conducted multiple rounds of computation on 4-bit data registers, as well as random access of data (selective access and erasure). We demonstrate that large strand displacement cascades with 244 distinct strand exchanges (sequential and in parallel) can use naturally occurring DNA sequence from M13 bacteriophage without stringent sequence design, which has the potential to improve the scale of computation and decrease cost. Our work merges DNA storage and DNA computing, setting the foundation of entirely molecular algorithms for parallel manipulation of digital information preserved in DNA.</abstract><cop>Washington</cop><pub>National Academy of Sciences</pub><pmid>37669382</pmid><doi>10.1073/pnas.2217330120</doi><orcidid>https://orcid.org/0000-0003-1898-4063</orcidid><orcidid>https://orcid.org/0000-0002-2585-4120</orcidid><orcidid>https://orcid.org/0000-0001-8249-6553</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0027-8424
ispartof Proceedings of the National Academy of Sciences - PNAS, 2023-09, Vol.120 (37), p.e2217330120
issn 0027-8424
1091-6490
1091-6490
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10500265
source PubMed Central; Alma/SFX Local Collection; Free Full-Text Journals in Chemistry
subjects Algorithms
Biological Sciences
Cellular automata
Deoxyribonucleic acid
Digital data
DNA
DNA biosynthesis
Gene sequencing
Hybridization
Next-generation sequencing
Nucleotide sequence
Phages
Physical Sciences
Random access
title Parallel molecular computation on digital data stored in DNA
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T11%3A15%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Parallel%20molecular%20computation%20on%20digital%20data%20stored%20in%20DNA&rft.jtitle=Proceedings%20of%20the%20National%20Academy%20of%20Sciences%20-%20PNAS&rft.au=Wang,%20Boya&rft.date=2023-09-12&rft.volume=120&rft.issue=37&rft.spage=e2217330120&rft.pages=e2217330120-&rft.issn=0027-8424&rft.eissn=1091-6490&rft_id=info:doi/10.1073/pnas.2217330120&rft_dat=%3Cproquest_pubme%3E2861644315%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2865046290&rft_id=info:pmid/37669382&rfr_iscdi=true