{"id":244409,"date":"2011-02-14T15:00:14","date_gmt":"2011-02-14T23:00:14","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?p=244409"},"modified":"2016-12-08T06:26:02","modified_gmt":"2016-12-08T14:26:02","slug":"faster-servers-services-flashstore","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/faster-servers-services-flashstore\/","title":{"rendered":"Faster Servers, Services with FlashStore"},"content":{"rendered":"<p><em>By Doug Gantenbein<\/em><\/p>\n<p>Memory has its faults\u2014and not only the human variety.<\/p>\n<p>Hard drives, for instance, can hold terabytes cheaply. But they\u2019re slow. Random-access memory (RAM) is fast but expensive, and data in RAM disappear the instant the power goes off. Flash memory is faster than hard drives and cheaper than RAM, and it retains information. But the way it handles data writes handicaps its usefulness in server environments.<\/p>\n<p>Still, the distinct advantages of flash\u2014particularly its ability to retain data when power is off, as well as its speed\u2014led two Microsoft Research scientists, <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/sudipta\/\">Sudipta Sengupta<\/a> and <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/jinl\/\">Jin Li<\/a>, to develop what they call FlashStore. It\u2019s a flash-based \u201cbridge\u201d between RAM and a hard drive that helps to overcome the handicaps of each while maximizing flash memory\u2019s capabilities and minimizing its weaknesses.<\/p>\n<p>FlashStore operates as a key-value store and uses a \u201ckey\u201d to access the \u201cvalue\u201d associated with each piece of a data record. It supports the operations of read, write, update, and deletion of such data records. A third collaborator on FlashStore, Biplob Debnath, was a research intern at Microsoft and now works for EMC, a network-storage and data-recovery firm.<\/p>\n<p>FlashStore has potentially significant usefulness across a range of computing applications, from server farms to cloud applications. It already has shown it can speed up online gaming for <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.xbox.com\/en-US\/live\/\" target=\"_blank\">Xbox LIVE<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> players, as well as data-intensive server applications.<\/p>\n<div class=\"imageFloatLeft\">\n<div id=\"attachment_199432\" style=\"width: 310px\" class=\"wp-caption alignleft\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-199432\" class=\"wp-image-199432 size-medium\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/02\/Headshot__0037_cropped_sudipta-sengupta-300x300.jpg\" alt=\"Sudipta Sengupta\" width=\"300\" height=\"300\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/02\/Headshot__0037_cropped_sudipta-sengupta-300x300.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/02\/Headshot__0037_cropped_sudipta-sengupta-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/02\/Headshot__0037_cropped_sudipta-sengupta-180x180.jpg 180w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/02\/Headshot__0037_cropped_sudipta-sengupta.jpg 360w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><p id=\"caption-attachment-199432\" class=\"wp-caption-text\">Sudipta Sengupta, Principal Research Scientist<\/p><\/div>\n<p>Flash memory sits conveniently in the huge gap between RAM and a hard disk in terms of both cost and performance. With its properties of low power consumption, physical ruggedness, and small size, flash has enabled new experiences with many consumer electronic devices for more than a decade. But it is only recently that flash is seeing widespread adoption in desktop and server applications, in the form of solid-state drives. New applications of flash involve different storage-access patterns from those in typical consumer devices and, because of flash\u2019s device properties, pose new challenges to its ability to deliver sustained high throughput and low latency.<\/p>\n<p>\u201cFlash is great technology, but it requires intelligent software to make the best use of it,\u201d says Sengupta, a researcher at <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/lab\/microsoft-research-redmond\/\">Microsoft Research Redmond<\/a>. \u201cThat\u2019s where our work comes in.\u201d Sengupta and Li have written several papers for technical conferences on intelligent software design for utilizing flash memory, one of which is <i><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/flashstore-high-throughput-persistent-key-value-store\/\">FlashStore: High Throughput Persistent Key-Value Store<\/a><\/i><em>, which appeared in the 36th International Conference on Very Large Data Bases in September 2010.<\/em><\/p>\n<h1>Peculiarities of Flash Memory<\/h1>\n<p>In FlashStore, flash \u201csits\u201d between a hard drive and RAM, acting as a high-speed holding area for frequently used data. FlashStore works by overcoming flash memory\u2019s drawbacks. These relate to the way in which flash stores and manages data. In flash memory, data is not as easily overwritten as it is on a hard drive. Flash memory starts in an erased state, then collects data in page-sized units at a time, where a page can vary in size from 512 bytes to eight kilobytes, depending on the device.<\/p>\n<p>But if you need to erase data, you can\u2019t do it a page at a time. Typically, an erase block consists of 32 to 64 data pages. Li compares the way flash works to a line of water buckets.<\/p>\n<div class=\"imageFloatLeft\">\n<div id=\"attachment_303515\" style=\"width: 243px\" class=\"wp-caption alignleft\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-303515\" class=\"wp-image-303515 size-medium\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/10\/Jin-Li-233x300.png\" alt=\"Jin Li\" width=\"233\" height=\"300\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/10\/Jin-Li-233x300.png 233w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/10\/Jin-Li.png 310w\" sizes=\"auto, (max-width: 233px) 100vw, 233px\" \/><p id=\"caption-attachment-303515\" class=\"wp-caption-text\">Jin Li, Partner Researcher Manager of the Cloud Computing and Storage (CCS) group in Microsoft Research \u2013 Technologies.<\/p><\/div>\n<p>\u201cThe write operation is like pouring water into a series of buckets,\u201d says Li, a principal researcher with Microsoft Research Redmond\u2019s <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/communication-and-collaboration-systems\/\">Communication and Collaboration Systems<\/a> group. \u201cThe cheapest flash devices are designed so that if you need to erase a bank\u2014the block\u2014you have to drain the water from all the buckets. You can\u2019t drain just one bucket.\u201d<\/p>\n<p>To handle data writes and reads, the memory controller in a flash-memory device uses a mapping system called Flash Translation Layer (FTL), which maps the logical page address of the incoming data to the actual physical page location of the data in flash memory. If a page of data is rewritten, it is written to a new location with an updated FTL entry.<\/p>\n<p>During a process known in computing circles as \u201cgarbage collection,\u201d the flash controller takes valid data from a memory block, as well as new data, and writes both to a new memory block. The freed memory block then can be erased for storing new data\u2014in effect, draining all the buckets in one memory block. But this process isn\u2019t efficient, in large part because of the requirement to reclaim space with pages that are rendered invalid by such operations. In time, repeated erase cycles also degrade the life expectancy of each flash memory block.<\/p>\n<p>Flash also writes only to pages\u2014nothing smaller. Data written to a page that is less than the page size, typically 512 bytes to eight kilobytes, make poor use of that page. Any attempt to write a small chunk of data within a page means the controller will have to move the existing data to another page, along with the new data.<\/p>\n<p>Flash works fine on consumer electronic devices such as cameras and digital music players, where each photo or music file is at least megabytes in size, and data are written to flash mostly sequentially.<\/p>\n<p>But flash performance drops drastically when data writes are random, which is the case for many desktop computing and server applications. Even today\u2019s high-performance, flash-based disk-replacement devices experience a dramatic drop in access latencies and throughput when performing random write operations.<\/p>\n<p>Then there is the cost aspect of flash memory, which is about 10 times more expensive than hard disk per GB\u2014and about 10 times less expensive than RAM. Flash makes most sense for applications that can identify and exploit the sweet spot between cost and performance.<\/p>\n<h1>FlashStore Efficiency<\/h1>\n<p>To make flash work efficiently enough to be more attractive on the price-performance tradeoff for heavy-duty server and cloud computing, FlashStore\u2019s creators had to create a more flash-friendly data structure while avoiding random data writes to flash. They did that by giving FlashStore three important design features.<\/p>\n<p>First, FlashStore introduces flash as a cache in the memory hierarchy between RAM and hard disk so that the flash memory holds a \u201cworking set\u201d of data\u2014information a computer is most apt to need. FlashStore tracks data use so that, as data becomes less frequently accessed, it eventually is sent back to the hard drive so that new, fresh data can take its place. That makes better use of flash while still getting much faster retrievals than are possible with a hard drive.<\/p>\n<p>Second, FlashStore is designed to eliminate random writes. It organizes data in a log structure on flash so that new data sent to flash does not lead to random writes and, hence, is not subject to garbage collection by the device. FlashStore aggregates small writes from the application into a write buffer in memory and writes to flash when there is enough data to fill a flash page. By making writes sequential to flash, FlashStore makes much better use of flash-device architecture. If the application needs strict data-durability guarantees, FlashStore can aggregate writes in memory up to a timeout interval provided by the application or when a flash page worth of data is amassed, whichever comes first.<\/p>\n<p>Finally, FlashStore is designed to use RAM efficiently by employing a specialized RAM index to access the data on flash. It uses a hash-table-based index in RAM that is designed to save space along both the vertical and horizontal dimensions\u2014the number of slots and the size of each slot, respectively. To reduce the number of slots, it uses a variant of cuckoo hashing that achieves high hash-table load factors while keeping lookup times fast. To reduce the size of each slot, it stores a compact key signature of about two bytes in each slot instead of the full key, which could be tens of bytes or larger, depending on the application. Each slot also stores a pointer, typically four bytes, that points to the full key-value pair record on flash. During key access, the pointer to flash is followed only if the signature in the hash-table slot matches that of the key being searched. These techniques keep the RAM usage frugal\u2014at about six bytes per key-value pair, independent of the key-value pair size\u2014and key-access times fast, involving at most one flash read per lookup on the average.<\/p>\n<p><em>In continuing work on a system called\u00a0<\/em><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/skimpystash-ram-space-skimpy-key-value-store-on-flash\/\">SkimpyStash<\/a><em>, a project that will be described in a research paper to be delivered during the 2011 conference of the Association for Computing Machinery\u2019s Special Interest Group on Management of Data, the Microsoft Research scientists have reduced the memory requirements of FlashStore even more, about six-fold, to about one byte per key-value pair. SkimpyStash reduces memory usage by making a tradeoff with key-access times and by using multiple flash reads per lookup, which is still orders of magnitude faster than hard-disk lookup times. The tradeoff choice between memory usage and key lookup times in SkimpyStash is adjustable and can be made by the application using a simple parameter.<\/em><\/p>\n<h1>FlashStore Performance<\/h1>\n<p>FlashStore\u2019s results are impressive. Server systems using FlashStore can show as much as several tens of factors of increased data throughput compared with an ordinary RAM\/hard-drive configuration and existing software that is not flash-aware, such as a key-value store like Berkeley DB. When compared with simple hard-disk replacement with flash but no changes in software, as in using a flash-unaware key-value store, FlashStore demonstrated several factors of improvement in performance\u2014underscoring the impact of using flash-aware data structures and algorithms in FlashStore. It could enable the creation of much less expensive and more power-efficient computing designs. In some cases, Sengupta says, system designers buy arrays of hard disks that remain only partially filled with data, because they need additional disk heads for higher input\/output (IO) operations per second and not for disk space. FlashStore can be used to absorb the IO intensive operations at the flash layer, while leaving the hard drive to handle operations involving large data size that amortize disk-seek time overheads.<\/p>\n<p>\u201cYou could replace 10 to 20 hard drives with one flash drive using FlashStore for such applications,\u201d he says. \u201cThat gives you a capital-expenditure savings, power savings, and operational-expenditure savings, as well, and you also get much faster throughput: an all-win situation on the three metrics of price, power, and performance.\u201d<\/p>\n<h1>Applications That Benefit<\/h1>\n<p>FlashStore is showing its potential in practical applications. Xbox LIVE Primetime <i>1 vs. 100<\/i>, for example, operates a challenging computing environment\u2014when a player initiates actions within a game environment, those actions must be logged and communicated to all other participants in a timely manner to maintain the responsiveness of the gaming experience. Game activity can scale up or down rapidly, depending on the number of online players. A game\u2019s ability to track and update thousands of player actions relies heavily on the back-end system\u2019s ability to map what the players are doing and retrieve changes in game play.<\/p>\n<p>The Xbox LIVE Primetime <i>1 vs. 100<\/i> back end uses database servers running on hard drives to log and update game-state changes. But, Sengupta says, this results in a lot of redundant hardware\u2014and latency issues as the drives retrieve and disseminate data. A flash-based system would be ideal, he says, because flash\u2019s speed could accelerate and improve the responsiveness of online game play. It also could hold relevant data until it is no longer needed, then send it to a hard drive for post-game analytics.<\/p>\n<p>The FlashStore team replayed Xbox LIVE Primetime <i>1 vs. 100<\/i> game traces to test whether FlashStore could boost performance. It did\u2014performing operations 60 times faster than a standard RAM\/hard-drive configuration and five times as fast as a flash-unaware key-value store running on a top-of-the line commercial flash-based drive.<\/p>\n<p>FlashStore also achieved significantly better results than a RAM\/hard drive or a flash drive when tested with the task of performing data-chunk indexing for data deduplication, a method of reducing storage-capacity needs by eliminating redundant data. The FlashStore research team also has built a complete flash-assisted-storage deduplication system, called ChunkStash. The team reported its design and evaluation in a research paper entitled <i><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/chunkstash-speeding-up-inline-storage-deduplication-using-flash-memory\/\">ChunkStash: Speeding Up Inline Storage Deduplication Using Flash Memory<\/a><\/i>, presented in 2010 during the USENIX Annual Technical Conference.<\/p>\n<p>A FlashStore-like application also might find utility in areas such as ad-sponsored online searches, where relevant advertisements need to be summoned in response to certain queries with tight time constraints on advertisement selection and ranking. While RAM-based data processing is a choice for such latency-sensitive applications, a flash-based solution such as FlashStore could provide acceptable performance in many cases at a fraction of the hardware cost.<\/p>\n<p>Flash memory has come a long way from its beginnings in consumer electronic devices such as digital cameras and portable music players. It is seeing widespread deployment in desktops and servers, spanning consumer, enterprise, and cloud applications. These developments present the computing industry with opportunities to identify new applications for flash memory\u2014as well as challenges across the software and hardware stacks to get maximum performance at lowest cost.<\/p>\n<p>\u201cThat,\u201d Sengupta says, \u201cshows the relevance and timeliness of our efforts.\u201d<\/p>\n<\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>By Doug Gantenbein Memory has its faults\u2014and not only the human variety. Hard drives, for instance, can hold terabytes cheaply. But they\u2019re slow. Random-access memory (RAM) is fast but expensive, and data in RAM disappear the instant the power goes off. Flash memory is faster than hard drives and cheaper than RAM, and it retains [&hellip;]<\/p>\n","protected":false},"author":39507,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-author-ordering":[],"msr_hide_image_in_river":0,"footnotes":""},"categories":[194475,194476],"tags":[206690,187216,206678,206681,206684,206687,187150],"research-area":[13552],"msr-region":[256048],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-244409","post","type-post","status-publish","format-standard","hentry","category-database-data-analytics-platforms","category-devices-and-hardware","tag-chunkstash","tag-flash-memory","tag-flashstore","tag-hard-drive","tag-ram","tag-skimpystash","tag-xbox-live","msr-research-area-hardware-devices","msr-region-global","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[199565],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[],"related-projects":[],"related-events":[],"related-researchers":[],"msr_type":"Post","byline":"","formattedDate":"February 14, 2011","formattedExcerpt":"By Doug Gantenbein Memory has its faults\u2014and not only the human variety. Hard drives, for instance, can hold terabytes cheaply. But they\u2019re slow. Random-access memory (RAM) is fast but expensive, and data in RAM disappear the instant the power goes off. Flash memory is faster&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/244409","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/39507"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=244409"}],"version-history":[{"count":9,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/244409\/revisions"}],"predecessor-version":[{"id":333554,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/244409\/revisions\/333554"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=244409"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=244409"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=244409"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=244409"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=244409"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=244409"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=244409"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=244409"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=244409"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=244409"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=244409"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}