Home > Data classification methods and guidelines
Special Report:
EMAIL THIS

Data classification methods and guidelines

27 Apr 2009 | SearchStorageUK

Enterprise IT tips and expert advice
Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google

The data holdings of organisations regularly extend to many terabytes of individual files that have accumulated over years of often unplanned or semi-planned activity. Companies frequently don't have data classification methods and/or guidelines in place that allow them to know what data they hold and where it's located. This can have a number of deleterious effects, such as:

  • Finding files can become a difficult task, which can impact an organization's ability to gain the full benefits of the knowledge accumulated in its data.
  • Legal ramifications can result from not being able to quickly find or produce documents for a court hearing.
  • Security might be compromised if data isn't correctly matched to staff access profiles.
  • You could be spending more money than required if data isn't matched to storage media of the appropriate cost per GB. Data classification allows low-priority data to be moved from high-performance storage systems to secondary media, for example.

The benefits of data classification

Data classification projects and products help businesses discover what data they hold, where it's located, who has access to it and how long it must be retained, as well as what level of protection it should receive and on what type of media it should be retained to meet legal and regulatory requirements. It can also enable a business to gain intelligence from data it would otherwise be ignorant of.

Data classification isn't necessarily a complex process, but it's one that should be well planned. Potentially large amounts of data have to be discovered and matched to a wide range of business, legal and compliance needs.

Editor's Tip: Simon Johnson, data recovery practice lead at GlassHouse Technologies UK, discusses why businesses embark on data classification projects and the benefits to be gained from such an endeavor. Listen to FAQs on data classification to learn more about the available data classification processes and how to avoid the key pitfalls of data classification projects.

Know why you're classifying data

Business need and risk tolerance should be the drivers behind a data classification project. It shouldn't be an IT-only project. The first question to ask should be "What do we want to achieve?" Reasons can include legal compliance, the ability to respond quickly to legal discovery requests, making data more accessible to users to aid business activity, restricting data to users to whom access is inappropriate, moving data to storage media that's most appropriate for its frequency of use and ease of access, and freeing up high-performance storage systems by offloading secondary data.

The best way to formulate answers to these questions is to initiate a discussion that involves representatives from those parts of the business that have a stake in corporate data – namely the board, IT, finance, the various business departments and company lawyers.

Once you understand why you're undertaking the project, you can create a data classification policy and begin the process. This can be a long and involved project, in large part because it's manual in nature. There are software products that can help with data classification and lifecycle management, but at the key early stages of the process no tool can tell you what's important to your business. You'll get those answers from discussions with staff from key areas of the organisation.

Experts recommend coming up with a set of classifications of data type and then tackling one or two of them. By dealing with one type of data and knowing what you want from it – retention time, storage media, access rights, etc. – you can begin to make some project wins and get some policies in place.

Data classification products

Once an organisation has decided what it wants from the data classification exercise, tools can help locate and organise data on the basis of user-defined rules.

Editor's Tip: A data classification product should locate and organize data based on user-defined rules. It should also help move and migrate data to the appropriate storage subsystem. Read how to evaluate a data classification product for more information.

Many tools allow organisations to discover the data they hold and its location, and to then apply classification rules to this information. Metadata can be applied to files, making them more manageable and searchable, and allowing data to be migrated to the appropriate tier.

Metadata created in the classification process can enable searches based on values that go much further than standard attributes such as filename or creation date. Such information can aid in compliance or business intelligence searches.

Data classification products are often limited in their abilities. Today's tools can work well in their own niche, but no one product does everything. As mentioned earlier, no product can determine the value of specific data types to your business, so there's always a good deal of manual work involved.

Decide what you need from a tool and then look for the ones that fit those needs. You should be prepared to go through in-house testing to determine whether the data classification tool can do what you need it to do using the data in your environment.



Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google



RELATED CONTENT
Email archiving and compliance initiatives
Mimosa NearPoint email archiving software cuts PST file sprawl at BAM Nuttall
Symantec plans Data Insight software to link storage resources with data owners
Data storage compliance in the UK
Iron Mountain Digital spends $112 million on Mimosa Systems for on-premise data archiving
Training and education key to IT compliance, says Hillingdon's Bearpark
Microsoft Exchange 2010 adds email archiving and high availability features
Dexrex Gear offers cloud instant messaging and social media data archiving
EMC lays out data archiving and eDiscovery plans
Data storage management case studies from SearchStorage.co.UK
The difference between data backup and data archiving

Tools for managing data
Symantec plans Data Insight software to link storage resources with data owners
Tiered storage: A look at internal and external tiered storage models
How to use flash-based SSDs in your environment
The green data centre: Business best practices
Green storage essentials: Addressing power, cooling and space issues
Improve storage utilization rates with storage optimization, capacity reduction techniques
How to purchase and manage your data storage capacity
What is thin provisioning and how does it work?
EMC upgrades Symmetrix V-Max arrays, thin provisioning
Avoid data migration project failure: Five best practices

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
Data Protection Act 1998  (SearchStorageUK.com)
Freedom of Information Act 2000  (SearchStorageUK.com)
Information Commissioner's Office (ICO)  (SearchStorageUK.com)
MiFID  (SearchStorageUK.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary




Data Backup Solutions for UK - Data Reduction, Data Deduplication, Tape Storage
About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2008 - 2010, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts