Home > Data Storage Tips > > Data reduction techniques for primary storage
Storage UK Tips:
EMAIL THIS
 TIPS & NEWSLETTERS TOPICS 


Data reduction techniques for primary storage


George Crump, Contributor
08.17.2009
Rating: --- (out of 5)


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


Over the past few weeks, we've addressed how to develop a data reduction strategy for your customers' backup tier and archive tier. The final area of focus is primary storage, and this is where it really gets interesting. Performance matters here -- more than space savings. The benefits of data reduction on primary storage have to be carefully measured against its impact on application performance. With few exceptions -- which we talk about below -- most primary storage data reduction products impact performance.

Most products that focus on reducing data on primary storage classify that data into two types: the very active data set (databases and files that are currently being edited), which constitutes a very small percentage of the overall storage environment; and data that is not in use, which constitutes the bulk of primary storage.

More on data reduction
How to develop a backup data reduction strategy for customers

The only practical way to reduce the footprint of active data on primary storage today is to use an inline compression appliance, like the type offered by Storwize. Despite perception to the contrary, with inline compression, you can reduce the size of the data set with little or no performance impact on most operations.

Unlike with archive or backup data, primary storage has almost no duplication of data, so deduplication here has limited value, except on virtual machine OS images. (NetApp's deduplication extension, formerly called A-SIS, is effective at reducing the size of the virtual machine footprint.) Beyond that there are exceptions, but finding the duplicate data requires specific knowledge of file formats and processing time for analysis. Ocarina Networks' Optimizer appliances have a post-process crawl technique that works well here, without impacting performance. They can compress and deduplicate and store the reduced file in place or subsequently move it to a secondary tier of storage.

That brings us to the most effective means of data reduction on primary storage: Get rid of inactive data by archiving it. Seriously, if 80 to 90 percent of the data on primary storage remains unchanged and unaccessed, what is it doing on your customer's most expensive tier of storage? It's there because most users fear the process of moving it to a less expensive tier; that's where you come in with an effective data reduction strategy that covers all tiers of storage.

For those not using Ocarina's Optimizer, the identification of that data can be made easy with products from companies like Tek-Tools or Aptare.

Once identified, it can be manually moved to the secondary tier of storage,¬ such as a disk-based archive. This is very cost-effective and simple. That's because most disk archives show up as a NAS mount point, and moving data to and from them is as simple as a copy command. If a more automated move and recovery feature is required, there are plenty of mature tools to do that, and global file systems and archiving software have built-in retrieval capabilities.

Archiving has no impact on performance of the active data set. In fact, with less data on a storage system, its performance may improve. There's a chance that users will notice a performance loss when accessing data that has been migrated from primary storage to the archive tier, but that should happen rarely, and certainly that data reduction technique won't impact day-to-day work.

About the author

George Crump is president and founder of Storage Switzerland, an IT analyst firm focused on the storage and virtualization segments. With 25 years of experience designing storage solutions for data centers across the United States, he has seen the birth of such technologies as RAID, NAS and SAN. Prior to founding Storage Switzerland, George was chief technology officer at one of the nation's largest storage integrators, where he was in charge of technology testing, integration and product selection. Find Storage Switzerland's disclosure statement here.

Rate this Tip
To rate tips, you must be a member of SearchStorage.co.UK.
Register now to start rating these tips. Log in if you are already a member.




Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   



RELATED CONTENT
Data Storage Management Initiatives
Improving storage utilization with thin provisioning
Managing capacity planning with thin provisioning
Thin provisioning brings utilization and capacity benefits to data storage, but with a caveat
Fail-in-place systems: Avoiding hard disk drive failures
Data storage resources needed to implement a virtual desktop infrastructure
Maximizing your enterprise data storage capacity: Improve efficiency and utilization
VMworld 2009: Storage admins grapple with growing VMware deployments
Managing enterprise data storage more efficiently, Part 2: Reclaim storage and consolidate data
How to select a storage automation product
Unified Storage FAQ

Storage management for the enterprise
Future enterprise hard drive technology: Hard drive capacity over performance
Data center migration tips for SMBs
Performance metrics: Evaluating your data storage efficiency
Fail-in-place systems: Avoiding hard disk drive failures
Storage virtualisation can boost utilisation, simplify management
Change control management
IT efficiency through data classification, consolidation and control
Open-source storage explained
Three ways to add capacity to an existing environment
How to plan for a disaster-free storage Christmas

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary

DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.



Data Storage Reports - Data Backup, Data Protection, Storage Hardware
About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2008 - 2010, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts