Archive

Posts Tagged ‘Virtualization’

Data Protection Management from ‘Nice to Have’ to ‘Need to Have’

December 15th, 2009 Steve Kenniston No comments

Data protection management has come a long way in the past decade.  More importantly the features and functionality that are in products these days and what customers have come to expect are now no longer ‘nice to have’ feature in the data center, they are ‘need to have’ features.

Additionally, the term ‘data protection’ is morphing every day and has different meanings to different people.  Questions like ‘is replication data protection?’ or ‘is archive data protection?’ or ‘is DR / BC a function of protection?’ are now common in IT circles.  Each in their own right is a methodology for protecting information or has some play in the grand scheme of data protection.  The reality is, much like every answer in IT, the answer to these questions is ‘it depends’.  Data Protection has many different definitions, which start to expand the scope of what it actually is and more importantly, how it is managed cost effectively across the whole environment.

It is this expanding scope of data protection  where data protection management tools come into play, and the more flexible and granular the tool, the more effective.  It is hard to have good data protection capabilities without having insight to the environment.  First, understanding what type of data lives in the environment, where it is, how it is used and some characteristics about its age or its access frequency helps to determine how to best protect the information.  This is where a data protection management tool that provides some insight to the file system adds a great deal of value.

Next, if archive is a part of data protection (and I would argue that a functional archive, when used properly, is) then a data protection management tool that provides insight to the data in the archive can also help manage the overall protection process within the greater environment.  Knowing if the data in the archive is actually being accessed or if it can be deleted (unless stored for compliant purposes) can help to control archive costs.

If replication is a part of the overall data protection scheme, a data protection management tool that provides insight to this process can also add a great deal of value.  Identifying if links are up, if data is moving between sites and if the data is available, accessible and meets my recovery point objectives at the remote site can ease the concern of recoverability in the event of a disaster.

And finally, providing as much information as possible such as deduplication rates,  tape growth, disk growth (in disk based backup targets – including deduplication targets), as well as providing true analytics into the backup environment to help make decisions as to when to switch from a tape-based solution to a disk-based solutions.  These analytics need to be in-depth enough to show that if some data that is being protected with traditional backup technologies are moved to a next generation solution, such as source-based deduplication, then what affect will it have on the overall backup environment, will it help to better control costs, will it help to increase SLAs?

At a higher level, customers are telling me that they no longer want to manage backup, they just want it to work and they want proof it is working.  As customers move to a more virtualized IT infrastructure, they find that they are being forced to rearchitect their data protection environment and they are now looking to solutions that elevate the process.  IT is looking for tools to make their environment “data protection aware.” As virtual machines are added to the environment they are automatically protected and want notification if they are not so they can mitigate any risk, and let’s face it, backup is all about risk mitigation.  Backup is insurance.  Wouldn’t it be nice if your insurance company had deeper insight to all the cars / drivers in your family and told you when your teenager was speeding on a monthly basis and told you that your premiums are going to go up if they don’t start driving the speed limit before they got the ticket and your premiums increased?

Any tool that IT invests in for a common process, data protection in this case, needs to be flexible enough to allow IT to manage as much of the overall process from a single pain of glass.  Good data protection management tools need to provide IT as much visibility into the overall data protection environment as possible in order to help make good decisions about what data technologies should be invested in, in order to help IT meet its overall SLAs and hence business objectives.

There is no sense spending a great deal of money on rearchitecting a backup environment if there is no insight to the success of the new architecture.  Sooner or later, management needs to have the pretty graphs that prove to someone that the right decisions are being made when it comes to protecting information, or when it comes to how much is spent on data protection or if the SLAs can be met.  Not having good data protection management tool, and spending too much on new data protection architectures while not meeting your SLAs could lead to a RGE (resume generating event).  Data protection management tools today are a need to have, not a nice to have.  Make the investment and put your data protection environment back on the Road to Recovery.

Post to Twitter Tweet This Post

Scridb filter

The Side Effects of Backup on Server Virtualization

September 14th, 2009 Steve Kenniston 2 comments

Server virtualization has changed the IT landscape dramatically.  It has become a magic potion curing a number of ills in the physical server world such as low individual CPU utilization and excess use of space, power and cooling in the data center.  However, like all potions that cure what ails you, there can be side effects.  You need to be careful of what the Witch Doctor orders.

When I speak with customers who have aggressively implemented a virtual server infrastructure, 9 out of 10 will tell me that they underestimated the affect that virtualization would have on their backups and backup process and how backup might actually make virtualization less of the magic potion they had hoped, when not considered during the virtual server assessment and planning process.  So what is the issue?  Backup is a virtualization bottleneck, and without addressing it, you may not be able to obtain the server consolidation ratios you had been expecting which can have a negative effect on your virtual server TCO and ROI.

This is a timely discussion as VMworld has just concluded.  VMware users flocked to VMworld looking for best practices when it comes to implementing virtual server technology.  Because virtualization allows IT to reduce the overall physical hardware infrastructure, users will be looking at how to maximize their server consolidation ratios (get as many virtual servers on a physical server as they can and still provide good application performance).

I often hear that companies assess their environments by looking at the production applications on their physical server environment, identify their work loads and translating that into some consolidation ratio of physical servers to virtual servers.  I also hear, from these same customers, that backup was never taken into consideration during the assessment phase when trying to identify the best possible consolidation ratios.  These customers implement their new virtual server environments, install the backup agent they had previously been using for physical server backups and attempt to backup their virtual servers and they find that they would only be able to protect 50% to 60% of the new environment.  Why?

Let’s look at the physics.  Let’s say your virtualization ratio is 12 virtual servers to 1 physical server.  Ten physical servers backup with 12 NIC cards, 12 CPUs, 12 Memory ‘chunks’, etc… When you moved these 12 physical servers into the virtual world and put them on one physical server did you put 12 NIC cards in the new physical server?  Did you put 12 CPUs in the new server?  Do you have 12x the memory?  Chances are, probably not.  However the capacity didn’t change did it?  So how could one expect that the backup performance, which is I/O, memory and CPU intensive would operate well in a virtual world?

Diagram 1 below show how when you backup 12 servers, the resource drain on each server is roughly 25% (per system during a full backup).  When you virtualize these 12 servers onto one or two physical servers, your physical system utilization shoots up to 80%+.  This utilization can be so dramatic that it actually effects the number of virtual servers you can have on these systems which can ruin your virtual server TCO / ROI.

Figure 1

Figure 1

Simple math dictates, unless you have all the same resources on your new physical server as you did on all your physical servers before the consolidation, you won’t get the same backup performance.  I have spoken with customers who aimed to do a 25 to 1 virtual to physical server consolidation, who  were only actually able to get a 15 to 1 consolidation ratio in reality because their backup application couldn’t handle 25 virtual servers on one physical server, leaving some unprotected.

People could argue that if you properly schedule each virtual machine to backup in a window when all the other systems are not backing up, then perhaps you could get by with traditional backup.  The flip side is, IT has been telling me they don’t want to manage the backup process anymore than they have to.  So how do you ‘fix’ this problem?

The issue is that backup is a very intensive I/O application therefore there is only one way to fix the problem.  You need to reduce the amount of I/O generated and sent through the physical devices that house the virtual servers during backup.  Virtual servers were designed to provide a lot of benefits but high I/O capabilities is not one of them.  (This is okay, every technology implementation has its tradeoffs.  When the positives outweigh the negatives, especially in a substantial way, as they do with virtual servers, you usually have a paradigm shift, and this is what we are seeing with virtual servers.)

So how do you change the I/O pattern of backup?   You do so by decreasing the amount of data that is utilizing the shared resources during backup.  There are a couple of ways to do this.  One way is to leverage the storage array and snapshot the data.  Snapshots allow you to make copies of virtualized server data and mount this snapshot to a proxy host and off-load the backups from the physical server that house the virtual servers.  The downsides are:

1)      This becomes a new set of processes to manage unlike traditional backup processes

2)      You need extra storage capacity with this solution

3)      You will need to manage another physical server (proxy server)

4)      You will need more backup agents from your backup software provider

The most efficient way, however, is to take advantage of a new backup software application that leverages data reduction (data deduplication) on the client.  Your processes stay the same, there is no need for additional primary storage hardware and by leveraging a ‘smarter’ backup client, you will reduce the I/O tax on your physical server devices and thereby have the ability to maximize your TCO / ROI for your new virtual server environment.

Additionally, a number of these technologies have additional offerings that truly make them next generation.  Backup licensing is slowly moving to a capacity based license model.  One great feature of these new products is the fact that there is no charge for clients or agents.  This allows you to create a virtual server template with the backup agent embedded within it.  You no longer have to worry about proliferating backup clients and then paying for all those clients when it is time to ‘true up’ with your backup software vendor.  Data deduplication technologies also offer the ability to replicate the backup data efficiently to disk at a remote site so you can develop a more efficient disaster recovery plan that reduces the reliance on a tape and increases your overall operational efficiency.

Regardless of which path you choose, each requires IT to rethink their backup strategies when it comes to protecting virtual server environments.

I encourage you to do two things as you consider moving to a virtual server infrastructure:

1)      Make sure you are thinking about data protection when architecting your new virtual server environment

2)      Check out some of the new technologies and best practices offered by vendors for protecting virtual servers.

Hopefully this will help put your virtual server world back on the Road to Recovery!

Post to Twitter Tweet This Post

Scridb filter

A Data Protection Tribute to Michael Jackson

July 7th, 2009 Steve Kenniston 6 comments

I was walking through the data center the other day when I heard one of my colleagues, MJ “Scream”, “I wish I had some ‘Morphine’”.  Well, I have to say I was “Speechless”.  I walked over to where MJ was standing, near the tape library, and when I asked him what was wrong, he replied “there was another backup tape ’Jam‘.”  MJ told me he had been “Working Day and Night” on a major backup problem and he was now bouncing “Off The Wall”.  He told me he was sick of dealing with traditional backup tools and just wanted to get rid of tape.  I told MJ that it was “Human Nature” to feel “Bad” in a time like this but I also told him, “You Are Not Alone”.  I said MJ, “’Keep The Faith’, we all ‘Remember The Time’ when backups ran like a ‘Speed Demon’ and were ‘Unbreakable’, but that is ‘HIStory’, tape isn’t that fast any more given the amount of data we now have.  I also told him that “We are Backup Administrators, we are ‘Invincible’ and ‘Heaven Can Wait’ for us, and while we may not have our issue fixed at the ‘Break Of Dawn’, we would ‘Come Together’ to ‘Heal The World,’ or at least the datacenter’ (I chuckled).  I proceeded to tell him about a revolutionary new backup concept utilizing source-based deduplication technology.  It’s “PYT”, a pretty young thing, but  more importantly it’s here to stay.  EMC  offers it with a product called Avamar , the most efficient variable block,  source-based, deduplication technology on the market that:

  • Helps to eliminate tape all together
  • Is perfect for VMware environments
  • Protects remote offices most efficiently
  • Stems the tide of data growth on NAS platforms

Well I thought MJ was going to give me “Trouble” for my comments.  I mean it, all of the sudden I had “Butterflies”, I felt “Threatened” because I knew this guy could be a loose cannon when it came to trying something new, he could be “Dangerous” he may moonwalk over to me and slap me with his glove. Change can be scary.  But just then MJ let out a “Smile” (quite frankly I thought he was going to “Cry”) and said “’I Can’t Help It’, my job is ‘On The Line’ and I ‘Wanna Be Startin’ Somethin’’ soon before my boss tells me to ‘Beat it’” he just felt “2 Bad”.  I told him, “’Don’t Walk Away’ and ‘Whatever Happens’ ‘Billie Jean’ and I were going to help get him out of ‘Trouble’ and together we would replace the tape infrastructure, make backups run 10x faster, provide him with tools that actually verified his backups and make his backup problems ‘Ghosts’”.

I called Billie Jean and at first she said, “’Leave Me Alone’, ‘Why You Wanna Trip On Me’”, but I told her we need her help, so she said she could help MJ and I.  When she asked what the trouble was, I told her that our backup environment was in shams and if MJ didn’t get it fixed, with the right solution that they were going to put MJ on a “Carousel”, that there would be “Blood On The Dance Floor” and he would end up being “Someone In The Dark” “In The Closet”.  Billie Jean hopped on the phone and called “Dirty Diana”, we are all “Just Good Friends” really.  She told her the story and when it came right down to it, it really was “Black or White”.  We needed some “Money”, “2000 Watts”, to replace the old tape libraries with the new Avamar technology and “One More Chance” to fix all of MJ’s backup issues.

I told MJ the plan; we were going to sneak past the guards (that would be simple because “They Don’t Care About Us”) and then replace the old equipment with the new equipment.  MJ asked, “’Is It Scary’ in the datacenter at night?”  I told him we would be fine, that this would not be like his “Childhood” days.  MJ just said, “I Wanna ‘Rock With You’”.  The next night we snuck into the data center like a “Smooth Criminal”.  First, we had to “Get On The Floor” the new Avamar technology.  Next we installed Avamar and it fixed our backup problem right away.  I said, “Man ‘Is It Scary’ or what?”  “Another Part of Me” was just proud of the work we had all accomplished.

The next morning we went into the office of “Little Susie” and knocked on her door (it was always closed because she liked her “Privacy”).  She was MJ’s boss and she was no “Tabloid Junkie” she was a real “Superfly Sister”.   She said, “’Who Is It’”?  We told her and she let us in.  We showed here some reports we had generated from another product we acquired called Data Protection Advisor.  We showed her where all the previous backups had been failing due to problems with network performance, tape libraries and not enough time to back everything up.  Then we showed her that with Avamar we were backing up data in just 1 hour with 100% success because we were seeing 99.5% duplicate data in our NAS environment and that was why we couldn’t meet our backup windows with tape.  We also showed her that our VMware environment could go from 10 to 20 virtual servers per ESX host because backup was no longer the bottleneck keeping us from implementing more virtual guests.  Well she was pretty happy, she said “You Rock My World” and she was not upset that the tape environment was “Gone Too Soon” because it was a true “Heartbreak”.  I told her it was a team effort and we couldn’t have done it without the help of a lot of people including EMC. It was a real “Thriller”.

Post to Twitter Tweet This Post

Scridb filter

Paradigm Perturbations

April 23rd, 2009 Alan Atkinson No comments

Once upon a time (about 18 months ago actually) data protection was considered one of the most boring areas of the storage market.  If ever there was an area ripe for change (in fact, ripe for an entire paradigm shift), it was backup.  Well, ask and ye shall receive.  Data protection is now the most dynamic area in storage today.

In the interest of brevity, I’ll confine this discussion to just backup (although there’s a lot happening in replication too).  First, let’s start with tape.  For most companies, tape now comes in two very distinct flavors: real (the old fashioned kind) and virtual (really a disk library).  For the most part, the virtual kind of tape is eventually written out to real tape for vaulting (as well as costs and long term storage) purposes.  Secondly, there’s deduplication.  Deduplication comes in many flavors, but the net effect is that less data is stored (sometimes, less data is moved over the wire as well).  Deduplication is complicated because not all data de-dups well.  It’s good to know which data does and which data doesn’t (by the way, this often depends on the deduplication solution being used).  Thirdly, there’s virtualization.  Now, virtualization is not a data protection technology, however, it is the impetus behind this inflection point for backing up data.  Virtualization basically destroys the old fashioned “back everything up to tape every night” backup strategy.  Why?  For starters, take the most I/O intensive process in your whole IT operation, backup, and layer it on top of the worst technology available for I/O performance, virtualization.  Also, in a virtualizaed environment, there is a lot more data because there are a lot more servers.  Additionally this data has a ton of redundancy.  Lastly, virtual servers raise a huge number of configuration issues.  It’s not as simple as the old days when a server was really a server, and it was backed up to a physical tape.  If you don’t get this right, recovery can be unbelievably fun (e.g., sorting through tapes to figure out what data was where on a given day) (NOT!).

Enter Data Protection Management (DPM)…

DPM has been one of the fastest growing sectors in storage software for all the reasons stated above.  Put another way, backup is: too expensive, too risky and too hard to properly manage.  These are the problems that DPM solves.   Most DPM products are focused on backup/recovery today,  however this is changing rapidly.  Vendors in the space are hearing from their customer that they to manage the entire stack of data protection technologies including replication as simply, cheaply and with as little risk as possible.  Fundamentally, customers are telling me that they want to be able to apply one service level to critical production data, another to email and a thrid to less important generic user data.  These SLA’s cover everything from recovery time objectives (RTO) to retention periods.  Customers want to  be able to leverage old and new technologies (traditional tape backup and deduplication for example) to create an efficient, cost-effective data protection environment that meets their business requirements for availability.  Using DPM, you not only guarantee efficiency and utilization levels (thus eliminating costs by purchasing more capacity than is needed), you also reduce risk and manage configuration changes and such that occur in today’s hybrid virtual/physical environments.  Customers have told me that they are seeing payback of 12 months or less in real, hard dollars after implementing a DPM solution.  That’s real money.  As an added bonus, they also sleep better at night knowing that their data protection policies (especially policies associated with recoverability) have been rigorously enforced and that they can cleanly demonstrate this via flexible reports to anyone who cares.  To me, DPM goes hand in hand with all the new, emerging data protection technologies.  Given the payback period, one might wonder why every company hasn’t implemented DPM.  The surprising fact is that an ever increasing percentage have implemented DPM and are already reaping the benefits.

Posted by Alan Atkinson

Post to Twitter Tweet This Post

Scridb filter

Twitter links powered by Tweet This v1.6.1, a WordPress plugin for Twitter.