{"id":978618,"date":"2023-10-26T09:00:00","date_gmt":"2023-10-26T16:00:00","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?p=978618"},"modified":"2023-10-25T07:21:43","modified_gmt":"2023-10-25T14:21:43","slug":"project-silica-sustainable-cloud-archival-storage-in-glass","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/project-silica-sustainable-cloud-archival-storage-in-glass\/","title":{"rendered":"Project Silica: Sustainable cloud archival storage in glass"},"content":{"rendered":"\n

This research paper was presented at the <\/em><\/strong>29th<\/sup> ACM Symposium on Operating Systems Principles<\/em><\/strong> (opens in new tab)<\/span><\/a> (SOSP 2023), the premier forum for the theory and practice of computer systems software.<\/em><\/strong><\/p>\n\n\n\n

\"SOSP<\/figure>\n\n\n\n

Data growth demands a sustainable archival solution<\/h2>\n\n\n\n

For millennia, data has woven itself into every facet of our lives, from business and academia to personal spheres. Our production of data is staggering, encompassing personal photos, medical records, financial data, scientific insights, and more. By 2025, it’s estimated that we will generate a massive 175 zettabytes of data annually. Amidst this deluge, a substantial portion is vital for preserving our collective heritage and personal histories.  <\/p>\n\n\n\n

Presently, magnetic technologies like tape and hard disk drives provide the most economical storage, but they come with limitations. Magnetic media lacks the longevity and durability essential for enduring archival storage, requiring data to be periodically migrated to new media\u2014for hard disk drives, this is every five years, for magnetic tape, it\u2019s around ten. Moreover, ensuring data longevity on magnetic media requires regular \u201cscrubbing,\u201d a process involving reading data to identify corruption and fixing any errors. This leads to substantial energy consumption. We need a sustainable solution, one that ensures the preservation of our digital heritage without imposing an ongoing environmental and financial burden.<\/p>\n\n\n\n

Project Silica: Sustainable and durable cloud archival storage<\/h2>\n\n\n\n

Our paper, \u201cProject Silica: Towards Sustainable Cloud Archival Storage in Glass, (opens in new tab)<\/span><\/a>\u201d presented at SOSP 2023 (opens in new tab)<\/span><\/a>, describes Project Silica, a cloud-based storage system underpinned by quartz glass. This type of glass is a durable, chemically inert, and resilient low-cost media, impervious to electromagnetic interference. With data\u2019s lifespan lasting thousands of years, quartz glass is ideal for archival storage, offering a sustainable solution and eliminating the need for periodic data refreshes.<\/p>\n\n\n\n

Writing, reading, and decoding data<\/h2>\n\n\n\n

Ultrafast femtosecond lasers enable the writing process. Data is written inside a square glass platter similar in size to a DVD through voxels<\/em>, permanent modifications to the physical structure of the glass made using femtosecond-scale laser pulses. Voxels encode multiple bits of data and are written in 2D layers across the XY plane. Hundreds of these layers are then stacked in the Z axis. To achieve high write throughput, we rapidly scan the laser pulses across the length of the media using a scanner similar to that used in barcode readers. <\/p>\n\n\n\n

To read data, we employ polarization microscopy to image the platter. The read drive scans sectors in a single swift Z-pattern, and the resulting images are processed for decoding. Different read drive options offer varying throughput, balancing cost and performance.<\/p>\n\n\n\n

Data decoding relies on ML models that analyze images captured by the read drive, accurately converting signals from analog to digital. The glass library design includes independent read, write, and storage racks. Platters are stored in power-free storage racks and moved by free-roaming shuttles, ensuring minimal resource consumption for passive storage, as shown in Video 1. A one-way system between write racks and the rest of the library ensures that a written platter cannot be over-written under any circumstances, enforcing data integrity.<\/p>\n\n\n\n

\n