vs5icm_m11_highavailability

51
© 2011 VMware Inc. All rights reserved High Availability and Fault Tolerance Module 11

Upload: luis-armando-mejia-carbajal

Post on 19-Jan-2016

17 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: VS5ICM_M11_HighAvailability

© 2011 VMware Inc. All rights reserved

High Availability and Fault Tolerance

Module 11

Page 2: VS5ICM_M11_HighAvailability

11-2

© 2011 VMware Inc. All rights reserved

You Are Here

VMware vSphere: Install, Configure, Manage – Revision A

Course Introduction

Introduction to Virtualization

Virtual Machines

VMware vCenter Server

Configure and Manage Virtual Networks

Configure and Manage Virtual Storage

Managing Virtual Machines

Data Protection

Access & Authentication Control

Resource Management and Monitoring

High Availability

Scalability

Patch Management

Installing vSphere Components

Page 3: VS5ICM_M11_HighAvailability

11-3

© 2011 VMware Inc. All rights reserved

Importance

VMware vSphere: Install, Configure, Manage – Revision A

Most organizations rely on computer-based services like email, databases, and Web-based applications. The failure of any of these services can mean lost productivity and revenue. Configuring highly available, computer-based services is extremely important for an organization to remain competitive in contemporary business environments.

With VMware vSphere® 5, a new high availability architecture has been released.

Page 4: VS5ICM_M11_HighAvailability

11-4

© 2011 VMware Inc. All rights reserved

Module Lessons

VMware vSphere: Install, Configure, Manage – Revision A

Lesson 1: Introduction to vSphere High Availability

Lesson 2: Configuring vSphere High Availability

Lesson 3: vSphere High Availability Architecture

Lesson 4: Introduction to Fault Tolerance

Page 5: VS5ICM_M11_HighAvailability

11-5

© 2011 VMware Inc. All rights reserved

Lesson 1:Introduction to

vSphere High Availability

VMware vSphere: Install, Configure, Manage – Revision A

Page 6: VS5ICM_M11_HighAvailability

11-6

© 2011 VMware Inc. All rights reserved

Learner Objectives

VMware vSphere: Install, Configure, Manage – Revision A

After this lesson, you should be able to do the following:

Describe the various options that you can configure to ensure high availability in a vSphere 5 environment.

Discuss the response of vSphere High Availability when a VMware® ESXi™ host, a virtual machine, or an application fails.

Page 7: VS5ICM_M11_HighAvailability

11-7

© 2011 VMware Inc. All rights reserved

VMware vSphere: Install, Configure, Manage – Revision A

VMware Offers Protection at Every Level

NIC Teaming, Storage

Multipathing

VMware vSphere® vMotion®, DRS

vSphere Storage VMotion

Site Recovery Manager

Component Server Storage Data Site

3rd-Party Backup Solutions,

VMware Data Recovery

VMFS

VMFS

High Availability & Fault Tolerance

vSphere 5

VM

VMVM

Protection against hardware failures Planned maintenance with zero downtime Protection against unplanned downtime

and disasters

Page 8: VS5ICM_M11_HighAvailability

11-8

© 2011 VMware Inc. All rights reserved

vCenter Server Availability - Recommendations

VMware vSphere: Install, Configure, Manage – Revision A

Make VMware vCenter Server™ and the components it relies on highly available.

vCenter Server relies on:

vCenter Server database:

• Cluster the database. Refer to the specific database documentation.

Active Directory structure:

• Set up with multiple redundant servers.

Methods for making vCenter Server available:

Use vSphere High Availability to protect the vCenter Server virtual machine.

Use VMware vCenter Server Heartbeat™.

Page 9: VS5ICM_M11_HighAvailability

11-9

© 2011 VMware Inc. All rights reserved

High Availability

VMware vSphere: Install, Configure, Manage – Revision A

A highly available system is one that is continuously operational for a desirably long length of time.

Level of availability Downtime per year

99% 87 hours (3.5 days)

99.9% 8.76 hours

99.99% 52 minutes

99.999% 5 minutes

What level of virtual machine availability is

important to you?

Page 10: VS5ICM_M11_HighAvailability

11-10

© 2011 VMware Inc. All rights reserved

vSphere High Availability

VMware vSphere: Install, Configure, Manage – Revision A

vSphere HA

Level of availability High availability

Amount of downtime Minimal

Guest operating systems supported Works with all supported guest operating systems

VMware ESXi hardware supported Works with all supported ESXi hardware

UsesUse to provide high availability for the virtual machines that require that level of protection.

Page 11: VS5ICM_M11_HighAvailability

11-11

© 2011 VMware Inc. All rights reserved

VMware vSphere: Install, Configure, Manage – Revision A

vSphere HA Failure Scenarios

ESXi host failure

Guest OS failure

Application failure

Page 12: VS5ICM_M11_HighAvailability

11-12

© 2011 VMware Inc. All rights reserved

LUN 1 LUN 2 LUN 3

High Availability Failure Scenario - Host

VMware vSphere: Install, Configure, Manage – Revision A

virtual machine A

virtual machine B

virtual machine C

virtual machine F

vCenter Server

ESXi host ESXi hostESXi host

virtual machine D

virtual machine E

virtual machine A virtual machine B

When a host fails, vSphere HA restarts the affected virtual machines on other hosts

= vSphere HA cluster

Page 13: VS5ICM_M11_HighAvailability

11-13

© 2011 VMware Inc. All rights reserved

LUN 1 LUN 2 LUN 3

High Availability Failure Scenario – Guest Operating System

VMware vSphere: Install, Configure, Manage – Revision A

vCenter Server

ESXi host ESXi host

= vSphere HA cluster

ESXi host

When a virtual machine stops sending heartbeats or the virtual machine process crashes (vmx), vSphere HA resets the virtual machine

virtual machine C

VMware tools VMware tools

virtual machine E

VMware tools

virtual machine F

VMware tools

virtual machine A

VMware tools

virtual machine BVMware tools

virtual machine D

Page 14: VS5ICM_M11_HighAvailability

11-14

© 2011 VMware Inc. All rights reserved

LUN 1 LUN 2 LUN 3

HA Failure Scenario - Application

VMware vSphere: Install, Configure, Manage – Revision A

vCenter Server

ESXi host ESXi hostESXi host

virtual machine E

application When an application fails, vSphere HA restarts the affected virtual machine on the same host.

Requires VMware Tools to be installed

virtual machine C

application

virtual machine F

applicationvirtual machine D

application

virtual machine A

application

virtual machine B

application

= vSphere HA cluster

Page 15: VS5ICM_M11_HighAvailability

11-15

© 2011 VMware Inc. All rights reserved

Review of Learner Objectives

VMware vSphere: Install, Configure, Manage – Revision A

You should be able to do the following:

Describe the various options that you can configure to ensure high availability in a vSphere 5 environment.

Discuss the response of vSphere High Availability when an ESXi host, a virtual machine, or an application fails.

Page 16: VS5ICM_M11_HighAvailability

11-16

© 2011 VMware Inc. All rights reserved

Lesson 2:Configuring vSphere High Availability

VMware vSphere: Install, Configure, Manage – Revision A

Page 17: VS5ICM_M11_HighAvailability

11-17

© 2011 VMware Inc. All rights reserved

Learner Objectives

VMware vSphere: Install, Configure, Manage – Revision A

After this lesson, you should be able to do the following:

Configure a vSphere HA cluster.

Page 18: VS5ICM_M11_HighAvailability

11-18

© 2011 VMware Inc. All rights reserved

Enabling vSphere HA

VMware vSphere: Install, Configure, Manage – Revision A

Enable vSphere HA by creating a cluster or modifying a vSphere Distributed Resource Scheduler (DRS) cluster.

Page 19: VS5ICM_M11_HighAvailability

11-19

© 2011 VMware Inc. All rights reserved

Configuring vSphere HA Settings

VMware vSphere: Install, Configure, Manage – Revision A

Disable Host Monitoring when

performing maintenance

on any cluster/host. Enabled is the default setting.

Admission Controlrefers to the amount of available resourcesthat can be used to

start virtual machineson a specific ESXi

host. The default setting

is to disallow power and other operations that will violate the

set Admission Control Policy.

Admission control helps ensure sufficient resources to provide

high availability.Default setting is Host failures the cluster tolerates.

VMware recommended

setting

Page 20: VS5ICM_M11_HighAvailability

11-20

© 2011 VMware Inc. All rights reserved

Admission Control Policy Choices

VMware vSphere: Install, Configure, Manage – Revision A

Policy Description Recommended use

Percentage of cluster resources reserved as failover spare capacity

Reserves specified percentage of total capacity

When virtual machines have highly variable CPU and memory reservations

Host failures cluster tolerates

Reserves enough resources to tolerate specified number of host failures

When virtual machines have similar CPU/memory reservations and similar memory overheads

Specify a failover host Dedicates a host exclusively for failover service

To accommodate organizational policies that dictate the use of a passive failover host

Page 21: VS5ICM_M11_HighAvailability

11-21

© 2011 VMware Inc. All rights reserved

Configuring Virtual Machine Options

VMware vSphere: Install, Configure, Manage – Revision A

Configure options at the cluster level or per virtual machine.

VM restart priority determines relative order in which virtual machines are restarted after a host

failure.

Host Isolation response determines what happens to virtual machines when a host loses

the management network but continues running.

Page 22: VS5ICM_M11_HighAvailability

11-22

© 2011 VMware Inc. All rights reserved

Configuring Virtual Machine Monitoring

VMware vSphere: Install, Configure, Manage – Revision A

Reset a virtual machine if its VMware Tools

heartbeat or VMware Tools application

heartbeats are not received.

Determine how quickly failures are

detected.

Set monitoring sensitivity for individual virtual machines.

Page 23: VS5ICM_M11_HighAvailability

11-23

© 2011 VMware Inc. All rights reserved

Importance of Redundant Heartbeat Networks

VMware vSphere: Install, Configure, Manage – Revision A

In a vSphere HA cluster, heartbeats are:

Sent between the master and the slave hosts

Used to determine if a master or slave host has failed

Sent over a heartbeat network

The heartbeat network is:

Implemented using a VMkernel port marked for management

Redundant heartbeat networks:

Allow for the reliable detection of failures

Page 24: VS5ICM_M11_HighAvailability

11-24

© 2011 VMware Inc. All rights reserved

Redundancy Using NIC Teaming

VMware vSphere: Install, Configure, Manage – Revision A

You can use NIC teaming to create a redundant heartbeat network on ESXi hosts.

Both port groups must be VMkernel ports.

NIC teaming on an ESXi host

Page 25: VS5ICM_M11_HighAvailability

11-25

© 2011 VMware Inc. All rights reserved

Redundancy Using Additional Networks

VMware vSphere: Install, Configure, Manage – Revision A

You can also create redundancy by configuring more heartbeat networks:

On ESXi hosts, add one or more VMkernel networks marked for management traffic.

Configure port group with these settings:

Set Load Balancing to originating port ID.

Do not enable Failback.

Configure port group with active/standby failover.

Page 26: VS5ICM_M11_HighAvailability

11-26

© 2011 VMware Inc. All rights reserved

Network Configuration and Maintenance

VMware vSphere: Install, Configure, Manage – Revision A

Before changing the networking configuration on the ESXi hosts (adding port groups, removing vSwitches):

Deselect Enable Host Monitoring.

Place the host inmaintenance mode.

These steps prevent unwanted attempts to fail over virtual machines.

Page 27: VS5ICM_M11_HighAvailability

11-27

© 2011 VMware Inc. All rights reserved

Cluster Resource Allocation Tab

VMware vSphere: Install, Configure, Manage – Revision A

How much CPU and memory resources is the cluster using now?

How much reserved capacity remains?

Page 28: VS5ICM_M11_HighAvailability

11-28

© 2011 VMware Inc. All rights reserved

Monitoring Cluster Status

VMware vSphere: Install, Configure, Manage – Revision A

The vSphere HA Cluster Status window displays details about host operational status, virtual machine protection, and heartbeat datastores

The Configuration Issues window displays the current vSphere HA operational status, including the specific status and errors for each master and slave host in the cluster.

cluster’s Summary tab

Page 29: VS5ICM_M11_HighAvailability

11-29

© 2011 VMware Inc. All rights reserved

Lab 18

VMware vSphere: Install, Configure, Manage – Revision A

In this lab, you will modify slot sizes and admission control.

1. Create a cluster enabled for vSphere HA.

2. Add your ESXi host to a cluster.

3. Test vSphere HA functionality.

4. Prepare for the next lab.

Page 30: VS5ICM_M11_HighAvailability

11-30

© 2011 VMware Inc. All rights reserved

Review of Learner Objectives

VMware vSphere: Install, Configure, Manage – Revision A

You should be able to do the following:

Configure a vSphere HA cluster.

Page 31: VS5ICM_M11_HighAvailability

11-31

© 2011 VMware Inc. All rights reserved

Lesson 3:vSphere High Availability Architecture

VMware vSphere: Install, Configure, Manage – Revision A

Page 32: VS5ICM_M11_HighAvailability

11-32

© 2011 VMware Inc. All rights reserved

Learner Objectives

VMware vSphere: Install, Configure, Manage – Revision A

After this lesson, you should be able to do the following:

Describe heartbeat mechanisms used by vSphere HA.

Identify and discuss additional failure scenarios.

Page 33: VS5ICM_M11_HighAvailability

11-33

© 2011 VMware Inc. All rights reserved

vSphere HA Architecture: Agent Communication

VMware vSphere: Install, Configure, Manage – Revision A

vCenter Server

ESXi host (slave)

FDM

ESXi host (master)

FDM

ESXi host (slave)

FDM

vpxd

hostdhostdhostd

datastore datastoredatastore

= Management network

vpxa vpxa vpxa

Page 34: VS5ICM_M11_HighAvailability

11-34

© 2011 VMware Inc. All rights reserved

vSphere HA Architecture: Network Heartbeats

VMware vSphere: Install, Configure, Manage – Revision A

vCenter Server

virtual machine A

virtual machine B

virtual machine C

virtual machine D

virtual machine E

virtual machine F

ESXi host(slave)

ESXi host(slave)

ESXi host(master)

NAS/NFS VMFS Local

Management network 1

Management network 2

Page 35: VS5ICM_M11_HighAvailability

11-35

© 2011 VMware Inc. All rights reserved

vSphere HA Architecture: Datastore Heartbeats

VMware vSphere: Install, Configure, Manage – Revision A

Management network 1

Management network 2

vCenter Server

virtual machine A

virtual machine B

virtual machine C

virtual machine D

virtual machine E

virtual machine F

ESXi host(slave)

ESXi host(master)

ESXi host(slave)

NAS/NFS VMFS Local

Cluster Edit Settings Window

Page 36: VS5ICM_M11_HighAvailability

11-36

© 2011 VMware Inc. All rights reserved

Additional HA Failure Scenarios

VMware vSphere: Install, Configure, Manage – Revision A

Slave host failure

Master host failure

Host isolation

Management network failures

• Network partition

• Network isolation

Page 37: VS5ICM_M11_HighAvailability

11-37

© 2011 VMware Inc. All rights reserved

Failed Slave Host

VMware vSphere: Install, Configure, Manage – Revision A

vCenter Server

virtual machine A

virtual machine B

virtual machine C

virtual machine D

virtual machine E

virtual machine F

ESXi host(slave)

ESXi host (master)

ESXi host(slave)

NAS/NFS(lock file)

?

file locks file locks

primary heartbeat network

alternate heartbeat network

VMFS(heartbeat region)

Page 38: VS5ICM_M11_HighAvailability

11-38

© 2011 VMware Inc. All rights reserved

Failed Master Host

VMware vSphere: Install, Configure, Manage – Revision A

virtual machine A

virtual machine B

virtual machine C

virtual machine D

virtual machine E

virtual machine F

ESXi hostRole: slaveMOID: 98

ESXi hostRole: masterMOID: 99

file locks

NAS/NFS(lock file)

?

file locks

vCenter Serverprimary heartbeat network

alternate heartbeat network

MOID = managed object ID

default gateway(isolation address)

ESXi hostRole: slaveMOID: 100

VMFS(heartbeat region)

Page 39: VS5ICM_M11_HighAvailability

11-39

© 2011 VMware Inc. All rights reserved

Isolated Host

VMware vSphere: Install, Configure, Manage – Revision A

virtual machine A

virtual machine B

virtual machine C

virtual machine D

virtual machine E

virtual machine F

ESXi host ESXi host

default gateway(isolation address)

ESXi host

The host is not observing any election traffic on the management and cannot ping its isolation address(es), the host is isolated.

Page 40: VS5ICM_M11_HighAvailability

11-40

© 2011 VMware Inc. All rights reserved

VMware vSphere: Install, Configure, Manage – Revision A

Design Considerations

Host isolation events can be minimized through good design

Implement redundant heartbeat networks

Implement redundant isolation addresses

If host isolation events do occur, good design enables vSphere HA to determine whether the isolated host is still alive

Implement datastores so that they are separated from the management network using one or both of the following approaches:

• Fibre Channel over fibre optic

• Physically separating your IP storage network from the management network

Page 41: VS5ICM_M11_HighAvailability

11-41

© 2011 VMware Inc. All rights reserved

Network Partition

VMware vSphere: Install, Configure, Manage – Revision A

default gateway(isolation address)

virtual machine A

virtual machine B

ESXi hostMASTER

virtual machine C

virtual machine D

ESXi hostSLAVE

virtual machine E

virtual machine F

ESXi hostSLAVE

virtual machine G

virtual machine H

ESXi hostSLAVE

vCenter Server

MASTER

Page 42: VS5ICM_M11_HighAvailability

11-42

© 2011 VMware Inc. All rights reserved

VMware vSphere: Install, Configure, Manage – Revision A

Review of Learner Objectives

You should be able to do the following:

Describe heartbeat mechanisms used by vSphere HA

Identify and discuss additional failure scenarios

Page 43: VS5ICM_M11_HighAvailability

11-43

© 2011 VMware Inc. All rights reserved

Lesson 4:Introduction to Fault Tolerance

VMware vSphere: Install, Configure, Manage – Revision A

Page 44: VS5ICM_M11_HighAvailability

11-44

© 2011 VMware Inc. All rights reserved

Learner Objectives

VMware vSphere: Install, Configure, Manage – Revision A

After this lesson, you should be able to do the following:

List Fault Tolerance requirements and limitations.

Describe Fault Tolerance operation.

Page 45: VS5ICM_M11_HighAvailability

11-45

© 2011 VMware Inc. All rights reserved

What Is Fault Tolerance (FT)?

VMware vSphere: Install, Configure, Manage – Revision A

FT:

A fault-tolerant system is designed so that, in the event of an unplanned outage, a backup virtual machine can immediately take over with no loss of service. (The backup virtual machine is called a secondary virtual machine.)

• Provides a higher level of business continuity than vSphere HA

• Provides zero downtime and zero data loss for applications

FT can be used for any application that needs to be available at all times.

FT can be used with DRS:

Fault-tolerant virtual machines benefit from better initial placement and are included in the cluster’s load-balancing calculations.

Page 46: VS5ICM_M11_HighAvailability

11-46

© 2011 VMware Inc. All rights reserved

VMware Fault Tolerance

VMware vSphere: Install, Configure, Manage – Revision A

Fault Tolerance

Level of availability Fault tolerance

Amount of downtime Zero

Guest operating systems supported Works with all supported guest operating systems

ESXi hardware supported Widely compatible

Uses Use to provide fault tolerance to your critical virtual machines.

Page 47: VS5ICM_M11_HighAvailability

11-47

© 2011 VMware Inc. All rights reserved

Fault Tolerance in Action

VMware vSphere: Install, Configure, Manage – Revision A

FT provides zero-downtime, zero-data-loss protection to virtual machines in a vSphere HA cluster.

primary VM

secondaryVM

newprimary

VM

new secondary

VM

vLockstep technology vLockstep technology

Page 48: VS5ICM_M11_HighAvailability

11-48

© 2011 VMware Inc. All rights reserved

Fault Tolerance Guidelines

VMware vSphere: Install, Configure, Manage – Revision A

Check the requirements and limitations of FT.

Ensure enough ESXi hosts for fault-tolerant virtual machines:

No more than four fault-tolerant virtual machines (primaries or secondaries) on any single host

Store ISOs on shared storage for continuous access:

Especially if used for important operations

Disable BIOS-based power management:

Prevents the secondary virtual machine from having insufficient CPU resources

Page 49: VS5ICM_M11_HighAvailability

11-49

© 2011 VMware Inc. All rights reserved

Enabling Fault Tolerance on a Virtual Machine

VMware vSphere: Install, Configure, Manage – Revision A

Page 50: VS5ICM_M11_HighAvailability

11-50

© 2011 VMware Inc. All rights reserved

Review of Learner Objectives

VMware vSphere: Install, Configure, Manage – Revision A

You should be able to do the following:

List Fault Tolerance requirements and limitations.

Describe Fault Tolerance operation.

Page 51: VS5ICM_M11_HighAvailability

11-51

© 2011 VMware Inc. All rights reserved

Key Points

VMware vSphere: Install, Configure, Manage – Revision A

vSphere HA restarts virtual machines on the remaining hosts in the cluster.

Hosts in vSphere HA clusters have a master/slave relationship.

Implement redundant heartbeat networks either with NIC teaming or by creating additional heartbeat networks.

FT provides zero downtime for applications that need to be available at all times.