Molecular-level similarity search brings computing to DNA data storage
Callista Bee,
Yuan-Jyue Chen,
Melissa Queen,
David Ward,
Xiaomeng Liu,
Lee Organick,
Georg Seelig,
Karin Strauss () and
Luis Ceze ()
Additional contact information
Callista Bee: University of Washington
Yuan-Jyue Chen: Microsoft Research
Melissa Queen: University of Washington
David Ward: University of Washington
Xiaomeng Liu: University of Washington
Lee Organick: University of Washington
Georg Seelig: University of Washington
Karin Strauss: Microsoft Research
Luis Ceze: University of Washington
Nature Communications, 2021, vol. 12, issue 1, 1-9
Abstract:
Abstract As global demand for digital storage capacity grows, storage technologies based on synthetic DNA have emerged as a dense and durable alternative to traditional media. Existing approaches leverage robust error correcting codes and precise molecular mechanisms to reliably retrieve specific files from large databases. Typically, files are retrieved using a pre-specified key, analogous to a filename. However, these approaches lack the ability to perform more complex computations over the stored data, such as similarity search: e.g., finding images that look similar to an image of interest without prior knowledge of their file names. Here we demonstrate a technique for executing similarity search over a DNA-based database of 1.6 million images. Queries are implemented as hybridization probes, and a key step in our approach was to learn an image-to-sequence encoding ensuring that queries preferentially bind to targets representing visually similar images. Experimental results show that our molecular implementation performs comparably to state-of-the-art in silico algorithms for similarity search.
Date: 2021
References: Add references at CitEc
Citations: View citations in EconPapers (3)
Downloads: (external link)
https://www.nature.com/articles/s41467-021-24991-z Abstract (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:12:y:2021:i:1:d:10.1038_s41467-021-24991-z
Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/
DOI: 10.1038/s41467-021-24991-z
Access Statistics for this article
Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie
More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().