Preview only show first 10 pages with watermark. For full document please download

Enterprise Ssd Requires Ras-handout

   EMBED


Share

Transcript

STORAGE SWITZERLAND ENTERPRISE SSD REQUIRES RAS (RELIABILITY, AVAILABILITY AND SERVICEABILITY) George Crump, Senior Analyst Today Solid State Disk (SSD) is viewed by most storage managers as a niche solution to solve specific storage Some systems, to address the flash reliability issue, use single-level cell (SLC) technology which writes only a performance problems. For SSD to move out of the niche single bit of data per memory cell and typically supports category and become a more mainstream storage consideration for most of the active data in the enterprise, about 100,000 erase cycles. This provides the highest level of erase cycles but does so at a premium price point, it must address the reliability, availability and serviceability (RAS) concerns that those managers have about the the factor that has typically relegated SSD to the role of a niche performance solution. At the other end of the erase technology. Solid-State-only storage systems, also called cycle curve is multi-level cell (MLC) technology which “Sustainable Storage”, like those from Nimbus Data Systems, are addressing this requirement. writes two bits of data into each cell and is typically rated at 5,000 erase cycles. With continuing reduction in NAND Enterprise SSD Requires Reliability manufacturing from 3x to 2x NAND (such as 25 nm), maintaining erase cycles in MLC becomes even more challenging, reducing the longevity to 1,000 to 3,000 When a flash memory cell is written to, it must first be cleared of old data. This process includes reading the cell cycles. While the additional bit per cell increases capacity and reduces the cost of the SSD, the significantly lower to confirm that it actually contains old data, and then writing ‘empty data’ (0’s) to every bit in the cell. This is erase cycles means a higher probability for failure. To provide high reliability at an acceptable cost, Enterprise called an “erase cycle” and is the cornerstone in flash MLC (EMLC), which is similar in capacity to MLC, has reliability concerns as every cycle shortens the life expectancy of the memory. The solid state memory that is emerged, providing a significantly higher rating of over 30,000 erase cycles which, especially for sustainable installed in enterprise-class sustainable storage systems must accommodate a very high number of these erase storage systems, may strike a perfect balance. cycles in order to maintain reliability. [1] Sustainable Storage Has Different Write Patterns as write I/O is evenly balanced across the system. This is where the flash controller plays a vital role. Today the most common enterprise uses of solid state Intelligent Flash Controller storage are in very high I/O or high transaction environments, like a cache. Caches are small, keeping Having the right type of flash for the workload is important, costs down, but typically have a very high turnover rate, but having a controller to exploit the advantages of that meaning that they’re constantly being emptied (erased) and flash and minimize its weaknesses is equally important. As refilled (written to). Cache is dynamic and its contents will mentioned above, one of the key responsibilities of the change to reflect the data being accessed at that moment in time. Its relatively small size means that the same cells flash controller is to spread out the write pattern to the flash memory so that each cell is written to evenly. This are written to over and over again. As a result, in order to get a reasonable life expectancy out of these devices capability, known as “wear leveling”, makes sure that one cell does not get written to over and over causing it to wear requires the extreme endurance of SLC flash memory. out prematurely. The more available memory cells per flash Sustainable storage on the other hand has all the data on controller, which is the case with sustainable storage, the more distributed that writing can be and the longer the life solid state storage, a storage area which is comparatively large and consistent. Since the sustainable storage system expectancy of each individual cell will be. In addition, in the case of a cell failure, the excess cell capacity, known to is where the data lives, as opposed to being a temporary insiders as “overprovisioning”, makes it easy to find an area like cache, the data stored is not constantly erased and rewritten. As a result, an intelligent controller can alternate location to write the data. Sustainable storage systems, like the Nimbus S class, have 28% spread those writes out across more flash memory modules, decreasing the amount of erase cycles overprovisioned spare capacity to accommodate this failover built into the system. Think of it as a hot swap drive consumed per cell. Therefore while enterprise solid state without having to replace the drive. In sustainable storage storage needs higher levels of reliability than MLC it does not need the extreme reliability of SLC. EMLC modules like systems, this extra capacity is transparent and does not take away from the advertised storage capacity of the those used in Nimbus’ S-Class sustainable storage systems provide the perfect balance of affordability and system. reliability. Observed another way, assume a 10TB Intelligent flash controllers can reduce the time required for Sustainable Storage System, such as the Nimbus S-Class. With 30,000 erase cycle NAND throughout the system, one the above erase cycle to take place. During this process the controller will pre-scan the flash memory to remove old could write 10TB of data 30,000 times, or 300,000 TB, before the NAND cells would wear out. That’s 165 TB of data so that when a write occurs, half of the erase cycle process will have already been performed. If this process, data written per day, 365 days per year, for 5 years. Most IT often called “garbage collection”, can stay ahead of managers would agree that this level of reliability is ample in an enterprise applicable. However, with 25 nm MLC inbound writes, then performance will be consistent as an application writes data to the storage. If not, the application rated at 3,000 erase cycles, this write load would burn out all the NAND memory in a mere 6 months. Thus, unless can experience performance slowdowns and may actually appear to pause. Many traditional SSDs without customers are expecting writing more than 16x the overprovisioned capacity suffer noticeable performance capacity of their storage array in fresh data per year, for over 5 years, EMLC provides adequate protection, as long degradation as they fill up with data, and in the enterprise, predictable performance is essential, making overprovisioning an essential must-have feature. [2] The amount of contiguous memory available also impacts how quickly the garbage collection process can complete. provide protection from a component failure. This means multiple paths to the storage network infrastructure, Similar to how disk defragmentation takes longer as a disk redundant power, cooling and redundant controllers. It also fills up, garbage collection takes longer when there is less flash memory available to erase and reuse. The advantage should provide redundant paths to flash modules themselves so that data access can continue despite a of sustainable storage, since it is 100% solid state, is that there is typically more addressable memory per flash drive path failure. controller than in conventional hybrid systems. The Reliability of Power Consumption Flash RAID One of the well-known benefits of a solid state storage Even with the reliability of the NAND components and the system is its power efficiency. This is where the sustainable storage moniker is derived. Efficient power consumption intelligence of the flash controller, there is still a slight however is often thought of only in terms of dollars saved chance that a component may fail. If that occurs, enterprise-grade solid state storage systems should be through lower electric bills. While that is certainly a positive, increased power efficiency has another benefit as well. protected by RAID technology similar to that used in mechanical hard drive-based systems. A solid-state-only According to a 2007 study by Google, who ranks among the largest consumers of storage in the world, heat and system, as it does with other storage challenges, makes vibration are leading causes of drive failure in mechanical RAID better. RAID algorithms are struggling to keep pace with the ever increasing capacity of mechanical hard systems. Sustainable storage systems run significantly cooler and have no moving parts other than the fans, so drives. Today when a drive fails it can take double-digit hours, if not days, for a rebuild to complete. A comparable there is no drive vibration to reduce drive life expectancy. This means the two biggest culprits in mechanical drive sustainable storage system, based on a multitude of flash failure simply don’t exist in a solid-state-only storage blades, typically rebuilds in less than an hour and is less impacted by capacity like legacy HDDs are. system. Enterprise IT managers expect their storage systems to Flash RAID also provides system administrators flexibility in provide near 100% uptime. A concern about implementing RAID protection levels because performance between the solid state storage in those environments has traditionally different RAID levels is virtually identical. This allows them to choose the greater redundancy of double or even triple been its reliability. A solid-state-only storage system, like the Nimbus S-Class, not only addresses the concerns parity and see practically zero performance impact. specific to solid state storage it can actually address some of the traditional reliability issues that plague mechanical System Redundancy drives, such as RAID rebuild times and heat and vibration All of this redundancy to provide data protection is useless related failures. Solid-state-only storage should be considered not only because of its obvious performance if the system itself fails. As is the case with conventional enterprise storage systems sustainable storage needs to advantages but also for its ability to improve reliability, availability and serviceability. About Storage Switzerland Storage Switzerland is an analyst firm focused on the virtualization and storage marketplaces. For more information please visit our web site: http://www.storage-switzerland.com Copyright © 2011 Storage Switzerland, Inc. - All rights reserved [3]