high availability deep dive what’s new in vsphere 5 david lane, virtualization engineer high point...

Post on 01-Apr-2015

221 Views

Category:

Documents

2 Downloads

Preview:

Click to see full reader

TRANSCRIPT

High Availability Deep DiveWhat’s New in vSphere 5

David Lane, Virtualization Engineer High Point Solutions

Agenda

• What is High Availability

• What’s New in vSphere 5

• Core Components of High Availability vSphere 5

• How High Availability Works in vSphere 5

• Scenarios for High Availability in vSphere 5

• Exploiting High Availability with vSphere 5

• Q&A

What is High Availability?

• The Answer to Hardware Density Concerns

• Resilient Architecture

• Automated Recovery

• Simple Setup / Familiar Interface

High Availability Prerequisites

• Minimum of 2 Hosts

• Minimum of 3GB of Host Memory

• VMware vCenter Server

• Shared Storage

• Pingable Constant Address (Gateway)

• HA Communication Firewall Ports (TCP/UDP 8182)

• Essentials Plus and Up

Configuring High Availability

• 10 Steps - 10 Minutes

• Create a Cluster

• Drag and Drop Hosts

What’s New for vSphere 5

• FDM (Fault Domain Manager) – New HA Agent

• Master / Slave Nodes

• Datastore Heartbeating

• Enhanced Isolation Validation

• No DNS Dependency

• Supports Management Network Partitions

• Enhanced Admission Control Policies

Core HA Components of vSphere 5

• FDM (Fault Domain Manager)

• VMware vCenter

• hostd

FDM

• Replaces Legato AAM (Automated Availability Manager)

• Single Process Agent with Watchdog Failsafe

• No DNS Dependency No DNS Limitations

• Consolidated Logging with Syslog Compatibility

• Talks Directly to hostd and vCenter Not Dependent on VPXA

VMware vCenter

• Deploys FDM Agents – Parallel (AAM Serial)

• Communicates Configuration Changes in Cluster to Master Node

• Retrieves Virtual Machine Status

• Displays Protection Status of VMs

hostd

• Required for FDM

• Runs on Host

• Relays information about VMs on host

• Responsible to Power On VMs

How Does High Availability vSphere 5 work? The Tools

• Master / Slave Nodes

• Heartbeating

• Isolated vs. Network Partitioned

• Virtual Machine Protection

Master / Slave Nodes

• One Master Node Per Cluster (exception Network Partitioned)

• Master Node Monitors VM Health Directs Slaves

• Master Node Takes Ownership of Datastores where VMs Configuration Files are Located

• Master Node Reports VM Status to vCenter Server

• Master Node Assigned by Election

• Slaves Monitor Their running VMs and send Status to Master and perform restarts on Master Node Requests

• Slaves Also Monitor Master Node Health

Master Node Election• Election held When HA is Enabled or Reconfigured and

When Master Node - Fails, Becomes Isolated or Partitioned, Disconnects from vCenter, In Maintenance Mode, In Standby

• Utilizes UDP

• Takes 15 Seconds

• Host with Most Connected Datastores Wins

• If Multiple Hosts Share Highest Number Of Datastores the Host with the highest Managed Object ID (MOID) Wins

• New Master Node will Attempt to Acquire Ownership of All Datastores by Locking “protectedlist” File (Protected VM List Inventory File, on Datastores in Cluster)

• In The Case of Master Node Isolation File Locks will be Released

Heartbeating

• Network Heatbeating

• Datastore Heartbeating

Network Heartbeating

• Heartbeats sent from Slaves to Master and From Master to Slaves

• Heartbeats Sent Every Second

• Determines the State Of the Hosts

Datastore Heartbeating

• Prevents Unnecessary Restarts

• Extra Heartbeat Added to Determine State if Management Network is Lost

• Validates Failure or Just Isolation • Uses PowerOn File to Determine Isolation

Isolated vs. Network Partitioned

• Isolated (Host Separated from Master VMs May be Restarted)– Not Receiving Heartbeat From Master– Not Receiving Election Traffic– Cannot Ping Isolation Address

• Partitioned (Multiple Host Isolated but Can Communicate to Each Other Over Management Network)– Not Receiving Heartbeats from Master– Does Receive Election Traffic

Virtual Machine Protection

• vCenter Server Performs Protection on State Change

• Protection guaranteed when the master has committed the change of state to disk

• Protectedlist File Contains VM State and Protection

Scenarios For High Availability vSphere 5 Using The Tools

• Failed Host• Isolated Host• Application Monitoring - Failed VM OS

Failed Host

• Failed Master Host– Master Election Initiated– New Master Elected– New Master Restarts all VMs on the Protectedlist with Not

Running State

• Failed Slave Host– Master Check Network heartbeat– Master Checks Datastore Heartbeat– Master Restarts VMs Affected

Isolated Host

• Isolation Responses– Power Off– Leave Powered On– Shut Down

• Isolation Detection– Slaves will Hold Single Server Election and Check Ping Address– Master will Check Ping Address– Master Restarts VMs Affected

Application Monitoring - Failed VM OS

• Restarts Individual VM When Needed

• Configurable VM Tools Heartbeat

• Monitors Network and Storage I/O Activity as Fail-Safe

Exploiting HA with vSphere 5

• Stretched Clusters– Storage DRS

• Blade Chassis Failure

• Larger Clusters Tenant Based Cloud

Q&A

THANK YOU

top related