{"id":721579,"date":"2021-01-27T18:55:36","date_gmt":"2021-01-28T02:55:36","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-research-item&p=721579"},"modified":"2021-01-27T18:55:36","modified_gmt":"2021-01-28T02:55:36","slug":"an-empirical-comparison-of-preservation-methods-for-synthetic-dna-data-storage","status":"publish","type":"msr-research-item","link":"https:\/\/www.microsoft.com\/en-us\/research\/publication\/an-empirical-comparison-of-preservation-methods-for-synthetic-dna-data-storage\/","title":{"rendered":"An Empirical Comparison of Preservation Methods for Synthetic DNA Data Storage"},"content":{"rendered":"
Synthetic DNA has recently risen as a viable alternative for long\u2010term digital data storage. To ensure that information is safely recovered after storage, it is essential to appropriately preserve the physical DNA molecules encoding the data. While preservation of biological DNA has been studied previously, synthetic DNA differs in that it is typically much shorter in length, it has different sequence profiles with fewer, if any, repeats (or homopolymers), and it has different contaminants. In this paper, nine different methods used to preserve data files encoded in synthetic DNA are evaluated by accelerated aging of nearly 29 000 DNA sequences. In addition to a molecular count comparison, the DNA is also sequenced and analyzed after aging. These findings show that errors and erasures are stochastic and show no practical distribution difference between preservation methods. Finally, the physical density of these methods is compared and a stability versus density trade\u2010offs discussion provided.<\/p>\n","protected":false},"excerpt":{"rendered":"
Synthetic DNA has recently risen as a viable alternative for long\u2010term digital data storage. To ensure that information is safely recovered after storage, it is essential to appropriately preserve the physical DNA molecules encoding the data. While preservation of biological DNA has been studied previously, synthetic DNA differs in that it is typically much shorter […]<\/p>\n","protected":false},"featured_media":0,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","footnotes":""},"msr-content-type":[3],"msr-research-highlight":[],"research-area":[13552,13553,13547],"msr-publication-type":[193715],"msr-product-type":[],"msr-focus-area":[],"msr-platform":[],"msr-download-source":[],"msr-locale":[268875],"msr-post-option":[],"msr-field-of-study":[251170],"msr-conference":[],"msr-journal":[],"msr-impact-theme":[],"msr-pillar":[],"class_list":["post-721579","msr-research-item","type-msr-research-item","status-publish","hentry","msr-research-area-hardware-devices","msr-research-area-medical-health-genomics","msr-research-area-systems-and-networking","msr-locale-en_us","msr-field-of-study-dna-data-storage"],"msr_publishername":"","msr_edition":"","msr_affiliation":"","msr_published_date":"2021-1","msr_host":"","msr_duration":"","msr_version":"","msr_speaker":"","msr_other_contributors":"","msr_booktitle":"","msr_pages_string":"","msr_chapter":"","msr_isbn":"","msr_journal":"Small Methods","msr_volume":"","msr_number":"","msr_editors":"","msr_series":"","msr_issue":"","msr_organization":"","msr_how_published":"","msr_notes":"","msr_highlight_text":"","msr_release_tracker_id":"","msr_original_fields_of_study":"","msr_download_urls":"","msr_external_url":"","msr_secondary_video_url":"","msr_longbiography":"","msr_microsoftintellectualproperty":1,"msr_main_download":"","msr_publicationurl":"","msr_doi":"","msr_publication_uploader":[{"type":"url","viewUrl":"false","id":"false","title":"http:\/\/Synthetic%20DNA%20has%20recently%20risen%20as%20a%20viable%20alternative%20for%20long\u2010term%20digital%20data%20storage.%20To%20ensure%20that%20information%20is%20safely%20recovered%20after%20storage,%20it%20is%20essential%20to%20appropriately%20preserve%20the%20physical%20DNA%20molecules%20encoding%20the%20data.%20While%20preservation%20of%20biological%20DNA%20has%20been%20studied%20previously,%20synthetic%20DNA%20differs%20in%20that%20it%20is%20typically%20much%20shorter%20in%20length,%20it%20has%20different%20sequence%20profiles%20with%20fewer,%20if%20any,%20repeats%20(or%20homopolymers),%20and%20it%20has%20different%20contaminants.%20In%20this%20paper,%20nine%20different%20methods%20used%20to%20preserve%20data%20files%20encoded%20in%20synthetic%20DNA%20are%20evaluated%20by%20accelerated%20aging%20of%20nearly%2029%20000%20DNA%20sequences.%20In%20addition%20to%20a%20molecular%20count%20comparison,%20the%20DNA%20is%20also%20sequenced%20and%20analyzed%20after%20aging.%20These%20findings%20show%20that%20errors%20and%20erasures%20are%20stochastic%20and%20show%20no%20practical%20distribution%20difference%20between%20preservation%20methods.%20Finally,%20the%20physical%20density%20of%20these%20methods%20is%20compared%20and%20a%20stability%20versus%20density%20trade\u2010offs%20discussion%20provided.","label_id":"243109","label":0}],"msr_related_uploader":"","msr_attachments":[],"msr-author-ordering":[{"type":"text","value":"Lee Organick","user_id":0,"rest_url":false},{"type":"user_nicename","value":"Bichlien Nguyen","user_id":35942,"rest_url":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/microsoft-research\/v1\/researchers?person=Bichlien Nguyen"},{"type":"text","value":"Rachel McAmis","user_id":0,"rest_url":false},{"type":"text","value":"Weida D. Chen","user_id":0,"rest_url":false},{"type":"text","value":"A. Xavier Kohll","user_id":0,"rest_url":false},{"type":"text","value":"Siena Dumas Ang","user_id":0,"rest_url":false},{"type":"text","value":"Robert Grass","user_id":0,"rest_url":false},{"type":"text","value":"Luis Ceze","user_id":0,"rest_url":false},{"type":"user_nicename","value":"Karin Strauss","user_id":32587,"rest_url":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/microsoft-research\/v1\/researchers?person=Karin Strauss"}],"msr_impact_theme":[],"msr_research_lab":[199565],"msr_event":[],"msr_group":[901101],"msr_project":[212072],"publication":[],"video":[],"download":[],"msr_publication_type":"article","related_content":{"projects":[{"ID":212072,"post_title":"DNA Storage","post_name":"dna-storage","post_type":"msr-project","post_date":"2015-01-01 00:00:45","post_modified":"2022-12-22 16:36:38","post_status":"publish","permalink":"https:\/\/www.microsoft.com\/en-us\/research\/project\/dna-storage\/","post_excerpt":"Transitioned | This project enables molecular-level data storage into DNA molecules by leveraging biotechnology advances in synthesizing, manipulating and sequencing DNA to develop archival storage. Microsoft and University of Washington researchers are collaborating to use DNA as a high density, durable and easy-to-manipulate storage medium.","_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/212072"}]}}]},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-research-item\/721579"}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-research-item"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-research-item"}],"version-history":[{"count":1,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-research-item\/721579\/revisions"}],"predecessor-version":[{"id":721582,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-research-item\/721579\/revisions\/721582"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=721579"}],"wp:term":[{"taxonomy":"msr-content-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-content-type?post=721579"},{"taxonomy":"msr-research-highlight","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-research-highlight?post=721579"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=721579"},{"taxonomy":"msr-publication-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-publication-type?post=721579"},{"taxonomy":"msr-product-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-product-type?post=721579"},{"taxonomy":"msr-focus-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-focus-area?post=721579"},{"taxonomy":"msr-platform","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-platform?post=721579"},{"taxonomy":"msr-download-source","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-download-source?post=721579"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=721579"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=721579"},{"taxonomy":"msr-field-of-study","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-field-of-study?post=721579"},{"taxonomy":"msr-conference","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-conference?post=721579"},{"taxonomy":"msr-journal","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-journal?post=721579"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=721579"},{"taxonomy":"msr-pillar","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-pillar?post=721579"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}