Critical.
Authoritative.
Strategic.
Subscribe to CIO Magazine »

Blog: How to Recover From Virtualization Disasters

Disaster recovery, to many people, means not much more than a hot site, but there is much more involved. What exactly is involved depends on how much money you have to put to the problem.

Fully redundant hot sites cost quite a bit in hardware, software, and licensing. At best, they should be exact duplicates of your current environment; at worst, they should be able to run your most important virtual machines.

However, this is not the only aspect of DR that should be considered. Disasters come in all sizes, from the small-scale application failure to the catastrophic natural disaster. Both of these are fairly well understood.

But what about the middle of the road business-continuity and disaster issues, which somewhere in between the extremes in the scope of disaster, but are specific to virtualization infrastructures: single machine failures, SAN failures, VM failures, etc.

For these there are a few tools, mostly from VMware that will help. VMware High Availability tops the list. But any VM-to-VM clustering service will also work to solve these issues.

To help with storage server issues there is also the LeftHand Networks VSA and Xtravirt XVS products. These products use local machine disk to mirror between the systems using software. This way if one system failed, the data is not lost. These technologies add increased redundancy to the software stack and can replace redundant SANs in smaller shops.

Even good backups add to this concept of redundancy by adding replication features (VizionCore vReplicator and Veeam Backup). These will allow you to replicate VMs from storage device to storage device and place VMs in locations where they are ready to power on at a moments notice. Which is another good way to keep things running if your SAN or NAS device fails.

VMware SRM works with various SAN and NAS devices to allow the SAN or NAS's own mirroring software to work better with virtualization.

As we put more and more VMs on a system we need to consider adding more and more redundancy into the systems. There are already some hardware solutions, like RAID Blade and RAID memory technologies; we have the ability to have redundant switching fabrics.

These software storage technologies add into the existing RAID level redundancy and expand them to include multiple systems.

While hot sites are the end goal for natural disasters, don't forget to plan for the middling disasters by increasing your local redundancy, using these or other tools.

Virtualization expert Edward L. Haletky is the author of "VMWare ESX Server in the Enterprise: Planning and Securing Virtualization Servers," Pearson Education (2008.) He recently left Hewlett-Packard, where he worked in the Virtualization, Linux, and High-Performance Technical Computing teams. Haletky owns AstroArch Consulting, providing virtualization, security, and network consulting and development. Haletky is also a champion and moderator for the VMware discussion forums, providing answers to security and configuration questions.

Join the CIO Australia group on LinkedIn. The group is open to CIOs, IT Directors, COOs, CTOs and senior IT managers.

More about: Hewlett-Packard, LeftHand Networks, Linux, Pearson, Pearson Education, Veeam, VizionCore, VMware, VSA

Comments

Post new comment

The content of this field is kept private and will not be shown publicly.
Users posting comments agree to the CIO comments policy.
Login or register to link comments to your user profile, or you may also post a comment without being logged in.
Related Coverage
Related Whitepapers
Latest Stories
Community Comments
Tags: virtualisation
Latest Blog Posts
Whitepapers
  • Cost Effective Security and Compliance with Oracle Database 11g Release 2
    Information ranging from trade secrets to financial data to privacy related information has become the target of sophisticated attacks from both sides of the firewall. Built upon 30 years of security experience, the Oracle database provides defense-in-depth security controls that enable organizations to transparently protect data. By leveraging these controls, organizations can safeguard data, ensure regulatory compliance, and achieve business goals such as consolidation, globalization, right sourcing and cloud computing while still maintaining scalability, performance and availability. Read this whitepaper.
    Learn more »
  • Oracle x86 Rack Servers Optimized for Rapid Deployments and Operational Efficiency
    Business-critical and mission-critical workloads — demanding applications and databases — require stable and secure environments. When these types of workloads are deployed on x86 servers, the need to ensure business continuity, maximum uptime, and consistent processing means that IT managers and business unit managers are looking at enterprise x86 servers in a new way: They realize that the business depends on these servers and that x86 server platforms for the enterprise are no longer expendable, as they might have been when servers were dedicated to a single application — or when they were deployed as small Web servers that could be easily taken offline and replaced.
    Learn more »
  • 10 Essential Steps to Web Security
    This short guide outlines 10 simple steps to best practice in web security. Follow them all to step up your organisation’s information security and stay ahead of your competitors. But remember that the target never stands still. Focus on the principles behind the steps – policy, vigilance, simplification, automation and transparency – to keep your information security bang up to date.
    Learn more »
All whitepapers
rhs_login_lockGet exclusive access to Invitation only events CIO, reports & analysis.
Recent comments