spi white paper final

12
 Best Practices: VMware Monitoring with HP Operations Manager Written by:  Alec King Senior Product Manager Veeam Software

Upload: jitendra-singh

Post on 06-Apr-2018

217 views

Category:

Documents


0 download

TRANSCRIPT

8/3/2019 Spi White Paper Final

http://slidepdf.com/reader/full/spi-white-paper-final 1/12

Best Practices: VMware Monitoringwith HP Operations Manager

Written by: Alec King

Senior Product ManagerVeeam Software

8/3/2019 Spi White Paper Final

http://slidepdf.com/reader/full/spi-white-paper-final 2/12

 

Ways VMware vSphere Improves Backup and Recovery

Page | 2

Best Practices: VMware Monitoring with HP Operations Manager

CONTENTS 

INTRODUCTION ...............................................................................................................................2 

INTENDED AUDIENCE .....................................................................................................................3 

DISCLAIMER ...................................................................................................................................3 WHAT INFORMATION IS PROVIDED? ....................................................................................3 

QUESTIONS TO ASK ........................................................................................................................3 

HOW IS THE DATA GATHERED? ...............................................................................................5 

 THE WRONG WAY...........................................................................................................................5 

 THE RIGHT WAY..............................................................................................................................7 

ADDITIONAL CONSIDERATIONS.....................................................................................................8 

CERTIFIED “VMWARE READY OPTIMIZED” ..................................................................................8 

HOW IS THE SOLUTION INTEGRATED INTO OPS MANAGER? ...................................8 

INTEGRAL VS. CONNECTED............................................................................................................9 

ADDITIONAL CONSIDERATIONS.....................................................................................................9 HOW DOES THE SOLUTION SCALE? .................................................................................... 10 

SCALABILITY FOR VIRTUAL ENVIRONMENTS.............................................................................. 10 

CONCLUSION .................................................................................................................................11 

ABOUT THE AUTHOR .................................................................................................................12 

ABOUT VEEAM SOFTWARE .....................................................................................................12 

INTRODUCTIONWhen an organization chooses HP Operations Center as its enterprise management

system (EMS), it makes substantial capital and operational investments in order to

reduce downtime and keep the IT environment running smoothly.

Increasingly, that environment includes VMware vSphere (or its predecessor,

VMware Infrastructure). The main objective for deploying vSphere typically is to

reduce costs through increased operational efficiency and capital utilization, and

many organizations have realized these benefits.

However, vSphere also adds complexity and, like anything in the environment,

must be monitored and managed. Ideally, this takes place in the existing EMS. This

kind of integrated approach allows operators to monitor physical and virtualsystems—and the applications and services running there—from the comfort of 

the HP Operations Manager (Ops Manager) console. They can quickly identify the

source of a problem and immediately escalate it to the right team, thereby

speeding problem resolution.

Monitoring vSphere with Ops Manager also minimizes additional management

infrastructure and operator training because it uses the framework that’s already in

place and that operators are already familiar with. It not only leverages existing

investments in Ops Manager, it actually protects those investments by ensuring

that Ops Manager is truly the “single pane of glass” it is intended to be.

8/3/2019 Spi White Paper Final

http://slidepdf.com/reader/full/spi-white-paper-final 3/12

 

Ways VMware vSphere Improves Backup and Recovery

Page | 3

Best Practices: VMware Monitoring with HP Operations Manager

Organizations have several choices for bringing vSphere to the Ops Manager

console. In evaluating these choices, there are four important considerations:

  What information is provided?

  How is the data gathered?

  How is the solution integrated into Ops Manager?

  How does the solution scale?

 This paper examines each of these in greater detail to provide readers with a simple

but highly relevant set of criteria for choosing the best possible solution. With more

and more mission-critical applications and services running on virtual machines, it’s

an important decision with significant implications for IT and the business.

Intended audience

  This document is intended for IT directors, data center managers, and HP OpsManager administrators in organizations evaluating or using VMware vSphere in

their production environment.

Disclaimer

Use this proven practice at your discretion. Veeam Software and the author do not

guarantee any results from the use of this proven practice. It is provided on an as-is

basis for demonstration purposes only.

WHAT INFORMATION IS PROVIDED?HP Ops Manager is a sophisticated framework for monitoring the IT infrastructure,

with advanced alerting, notification, data management, analytics, dashboard, and

other capabilities. But it natively knows very little about the distinct IT elements it

monitors. Instead, it relies on additional plug-in components to provide the know-

how to effectively monitor the various aspects of the IT infrastructure.

How well Ops Manager monitors vSphere depends on the intelligence of the

component that’s gathering, analyzing, and delivering information about the

virtual environment. Therefore, that component should encompass deep vSphere

expertise, including what metrics are available, what metrics are most important,and what thresholds are most appropriate. It should serve as the vSphere “expert in

a box,” capturing and putting into action best practices for vSphere monitoring and

management.

Questions to ask 

When evaluating options for bringing vSphere to HP Ops Manager, organizations

should evaluate the vSphere credentials of the vendor and the depth of vSphere

expertise in the solution.

8/3/2019 Spi White Paper Final

http://slidepdf.com/reader/full/spi-white-paper-final 4/12

 

Ways VMware vSphere Improves Backup and Recovery

Page | 4

Best Practices: VMware Monitoring with HP Operations Manager

Relevant questions include:

  Is the vendor a virtualization specialist? Or are they a generalist with

limited vSphere expertise?

  Why is the vendor offering a vSphere solution? Is this their core

business? Or are they just trying to “check the box” in terms of theplatforms they cover?

  How long has the vendor been managing vSphere? What is their

experience in monitoring large production environments? With vSphere

deployments growing in size and importance, organizations cannot afford

to be the learning lab or the proving ground for a product development

team with little vSphere experience.

  Is the solution specific to vSphere? Or is it a generalized solution,

addressing the “lowest common denominator” among the various

hypervisors?

  What pre-configured monitoring policies does the solution include? 

Pre-configured policies help avoid overflowing the Ops Manager console

with unimportant or informational events. While it should be possible to

customize policies (ideally from the Ops Manager console itself), the

solution should achieve at least an eighty percent fit right out-of-the-box.

  Does the solution monitor events as well as performance? Some

solutions only capture performance data, ignoring the myriad of vSphere

events that alert administrators to potential security, configuration, and

licensing issues. Failed tasks, cluster configuration issues, and virtual

machine deployment or migration failures are just a few examples of the

events that must be monitored in order to have full visibility of what’s

happening—and what needs attention—in the virtual environment. 

  What “value-add” does the solution provide? Does it filter, organize, and

present information in the most useful way possible? Does it provide

derived metrics in addition to what’s available natively from vSphere?

  What supporting information does the solution provide? Does it simply

present metrics and events, or does it explain their meaning, outline next

steps, and suggest corrective action? Can it shorten the learning curve for

operators who are new to virtualization? Does it elevate front-line support

staff by providing readily accessible VMware knowledge?

8/3/2019 Spi White Paper Final

http://slidepdf.com/reader/full/spi-white-paper-final 5/12

 

Ways VMware vSphere Improves Backup and Recovery

Page | 5

Best Practices: VMware Monitoring with HP Operations Manager

HOW IS THE DATA GATHERED? At first glance, various methods exist for collecting data about the virtual

infrastructure. However, further examination reveals the shortcomings of many of 

these methods.

The wrong way

Common technical approaches include the following:

  Ops Manager agent in the Virtual Machine Guest Operating System

(OS): For OS and application health monitoring, Ops Manager uses the Ops

Manager agent. The assumption is that through OS monitoring, the Ops

Manager agent can determine the utilization of the underlying hardware.

However, with virtualization, there’s not a one-to-one relationship between

the OS and the underlying hardware. So an Ops Manager agent running inthe OS cannot determine the performance of the underlying hardware

system. Only the hypervisor can provide a true picture of hardware

utilization and the resources provided to the VM (virtual machine) and the

guest OS. An Ops Manager agent inside the guest OS provides a skewed

picture, since the Ops Manager agent treats the VM as a physical machine.

In addition, vSphere-specific metrics from the hypervisor layer—for

example, balloon memory and CPU ready and wait times—are not visible

to the Ops Manager agent. Likewise, important components of vSphere—

ESX(i) hosts, clusters, and vCenter—are not visible to the Ops Manager

agent. And important vSphere features such as VMotion, perhapscontrolled by DRS (Distributed Resource Scheduler) and/or HA (High

Availability), are also invisible. Without visibility of vSphere as a whole, it is

impossible to monitor host performance, physical hardware status, VM

status and configuration, and other critical aspects of vSphere.

8/3/2019 Spi White Paper Final

http://slidepdf.com/reader/full/spi-white-paper-final 6/12

 

Ways VMware vSphere Improves Backup and Recovery

Page | 6

Best Practices: VMware Monitoring with HP Operations Manager

  Ops Manager agent in the ESX Host Console Operating System (COS):

Another method of collecting data from vSphere is to deploy a Linux

monitoring agent in the ESX host’s Console Operating System (COS).

However, the COS is not truly Linux, and the COS is not truly the ESX

hypervisor. Thus, an agent monitoring the COS provides a limited view of 

hypervisor performance.

An agent running in the COS also gives only partial visibility of many

vSphere aspects such as clustering, DRS, HA, DPM (Distributed Power

Management), and vCenter licensing, configuration and security.

COS-based agents also have the potential to impact the hypervisor.

Hypervisor instability has far-reaching consequences, affecting every VM on

the host—and even other hosts and their VMs in the cluster—and should

be avoided.

Finally, COS-based agents should be avoided because they have no long-

term viability. VMware’s stated direction is to deprecate the role of the COS,as they have already done in ESXi. Thus, any solution based on installing

components in the COS has no future.

  Legacy management protocols:   This includes SNMP trapping, “screen-

scraping” the ESX COS via SSH (to capture the limited metrics the COS can

request from the hypervisor), and Syslog monitoring. These approaches

suffer from many of the same shortcomings as an agent running in the

COS, including limited visibility of the complex vSphere environment and

the potential to impact hypervisor stability.

For the above reasons VMware developed a secure remote management API

(application programming interface) to communicate directly with the hypervisor

and with vCenter to gather accurate and detailed data about the virtual

environment.

8/3/2019 Spi White Paper Final

http://slidepdf.com/reader/full/spi-white-paper-final 7/12

 

Ways VMware vSphere Improves Backup and Recovery

Page | 7

Best Practices: VMware Monitoring with HP Operations Manager

The right way

 The VMware vSphere API (previously known as the VI API) allows for secure, efficient,

and effective monitoring of the virtual environment and eliminates the need to

deploy agents to hosts or virtual machines.

An agentless approach is of special importance since deploying an HP Ops Manager

agent in the ESX COS has been known to impact the stability and performance of the

hypervisor. Even COS agents from hardware vendors (the only kind approved by

VMware to run in the service console) must be re-certified for every VMware patch and

upgrade, and add additional management overhead. Using the vSphere API eliminates

the additional operational costs and the risks associated with COS-based agents.

In addition, agent-based monitoring is impossible for ESXi, which doesn’t have a COS

in which to deploy an agent. The vSphere API allows for safe, efficient, and accurate

monitoring of ESXi hosts in the same manner as ESX hosts.

 The vSphere API also allows for agentless hardware monitoring. VMware needed toprovide such a method when they eliminated the COS in ESXi. vSphere implements

the CIM-SMASH hardware model (see www.dmtf.org) to publish data about fans,

power supplies, temperature sensors, and the like. Solutions that use the vSphere API

have access to CIM-SMASH data for any version of ESXi and for ESX 3.5 Update 2

onwards.

In an enterprise environment, scalability and overhead must also be considered. The

vSphere API is the most efficient method for gathering data about the virtual

environment. It uses highly effective data summarization and data packaging

techniques, and minimizes overhead and consumption of network bandwidth.

vCenter Server itself is built on the vSphere API.The vSphere API is exposed as a web service on vCenter servers and ESX(i) hosts, and

can be accessed using the vSphere Web Services SDK (previously known as the VI SDK).

Only the vSphere API presents a comprehensive picture of the health of all vSphere

aspects and their dependencies. Thus, any viable monitoring solution for VMware

must employ the vSphere API.

8/3/2019 Spi White Paper Final

http://slidepdf.com/reader/full/spi-white-paper-final 8/12

 

Ways VMware vSphere Improves Backup and Recovery

Page | 8

Best Practices: VMware Monitoring with HP Operations Manager

Additional considerations

In addition to using the vSphere API to collect data, the monitoring solution should

allow administrators to configure the monitoring connections to vSphere. This

makes it possible to satisfy the broadest variety of requirements and scenarios—

now and in the future. It should be possible to:

  Connect to multiple vCenter servers. 

  Collect data only for certain hosts or virtual machines in a given

vCenter.

  Connect directly to ESX(i) hosts. This eliminates vCenter as a potential

“single point of failure” for the gathering of monitoring data, and also

allows monitoring of hosts that aren’t managed by vCenter (as may be the

case in remote branch offices).

Certified “VMware Ready Optimized”With the success of VMware, the number of vendors participating in the VMware

market has gone from tens to thousands in a few short years, and it is in vogue for

software companies to say they support the VMware Infrastructure. But which

vendors are actually qualified to make such an assertion? Which have played by the

rules set forth by VMware? Which are executing a strategy in line with VMware’s?

VMware created the VMware Ready Optimized program to address these

questions. It is the highest level of VMware certification for software products.

Products bearing the VMware Ready logo provide optimized levels of integration,

functionality, and performance that strictly adhere to VMware’s architectural,supportability, and future-proofing requirements, including use of the vSphere API.

  To ensure the stability of the production environment, the reliability of the

information and alerts provided, and the long-term viability of the solution itself,

any monitoring solution for VMware should be certified VMware Ready Optimized.

HOW IS THE SOLUTION INTEGRATED INTO

OPS MANAGER?As previously discussed, monitoring VMware with HP Ops Manager provides a

number of compelling benefits:

  Leverages the management infrastructure that’s already in place

  Reduces the amount of training operators need in order to provide 24x7

front-line monitoring for VMware

  Shortens problem resolution time by providing operators with full

visibility of the entire IT infrastructure

  Protects the integrity of the Ops Manager “single pane of glass”

  These benefits are most fully realized when VMware monitoring is implementednatively in the core Ops Manager architecture. And HP facilitates such an approach.

8/3/2019 Spi White Paper Final

http://slidepdf.com/reader/full/spi-white-paper-final 9/12

 

Ways VMware vSphere Improves Backup and Recovery

Page | 9

Best Practices: VMware Monitoring with HP Operations Manager

Wanting Ops Manager to be a true enterprise monitor but knowing it could not

itself address everything that might encompass, HP encouraged third parties to

develop specialized domain-specific monitoring and provided a mechanism to

deliver that in Ops Manager using Smart Plug-ins (SPIs). Thus, the use of third-party

SPIs has become an integral part of Ops Manager deployments.

Integral vs. Connected

 The alternative—a separate monitoring platform specifically for VMware, with its

own console, database, and alerting and notification system—is costly and

disruptive. Introducing another monitoring framework results in “console sprawl”

and requires further investments in staff training and redesigned management

processes. Even if the additional monitor includes a connector to Ops Manager, it

often provides only basic information or alerts. Only a truly integrated solution

allows all the features of Ops Manager to be employed and all the benefits of 

monitoring VMware with Ops Manager to be realized.

Additional considerations

In addition to the fundamental question of SPI versus connected monitor, the

degree of integration with HP Ops Manager should be evaluated. The best solution:

 Enables all standard EMS functions including performance, event, andstate monitoring; reporting and auditing; notifications; and so on. It enables

all the management functionality that operations staff expect to see and

use when something is monitored in Ops Manager.

  Integrates directly with the Ops Manager Service Map . It builds a

detailed topology of the virtual environment that shows component

connections and dependencies, allows for targeted monitoring, and assists

with root-cause analysis.

  Includes pre-defined remedial actions such as tools and tasks to allow

direct interaction with virtual machines and hosts (if operators have the

necessary permissions, of course).

8/3/2019 Spi White Paper Final

http://slidepdf.com/reader/full/spi-white-paper-final 10/12

 

Ways VMware vSphere Improves Backup and Recovery

Page | 10

Best Practices: VMware Monitoring with HP Operations Manager

  Can be configured from the Ops Manager console to lower

management costs and allow administrators to work in the environment

they are familiar with.

  Integrates with HP Operations Center reporting. This includes HP

Reporter, HP Performance Manager, and HP Performance Insight. Thesereporting components are part of the standard Operations Center ”toolkit”

for performance and configuration analysis, historical reporting, and

capacity planning. The solution should enable this reporting functionality

to realize all the benefits of the HP platform.

A solution that offers these capabilities—and includes built-in VMware expertise,

uses the vSphere API, and is configurable and scalable — will deliver solid value

now and in the future.

HOW DOES THE SOLUTION SCALE?Scalability may not be the most exciting aspect of a software evaluation, but it is

key to the success of the solution ultimately selected and deployed to production.

Even the best functionality is of little value if the solution fundamentally can’t

support the environment. And no one wants to invest time and effort to deploy a

solution they quickly outgrow—that is simply too expensive and too disruptive for

today’s cost-conscious, service-oriented IT department.

Scalability may not be the most exciting aspect of software development, either.

It’s a thankless task, garnering attention only when there’s a problem, with little

acknowledgement—from users or, in the case of a commercial software product,

from the market—of a job well done. It’s also a difficult task, requiring deepknowledge of system internals, solid architecture design skills, and patience to

develop, execute, and analyze repeated load tests. And the cost of failure is high:

there’s typically no quick fix when something goes wrong, and a significant system

redesign—which is disruptive to users, administrators, and project schedules—is

often the only solution.

Scalability is especially a concern with new software. What works well on paper

doesn’t necessarily work well in practice. It takes experience, production run-time,

and typically several product releases to get it right.

Scalability for Virtual Environments

 The virtual infrastructure is complex, with many moving parts and dependencies.

So the amount of information about the virtual infrastructure is immense.

Monitoring will suffer in the near-term if the solution can’t handle all the data. And

a solution that works today might not work tomorrow if the virtual environment

continues to grow.

For VMware monitoring with HP Ops Manager, scalability must be addressed on

two fronts:

 Collecting the data from vSphere

  Feeding the data to Ops Manager

8/3/2019 Spi White Paper Final

http://slidepdf.com/reader/full/spi-white-paper-final 11/12

 

Ways VMware vSphere Improves Backup and Recovery

Page | 11

Best Practices: VMware Monitoring with HP Operations Manager

  The vSphere API is the most efficient method to gather the data, and use of the

vSphere API helps to ensure the scalability of data collection.

In terms of delivering the data to Ops Manager, experience shows that the Ops

Manager API or other data injection methods aren’t the best approach. The Ops

Manager API is not designed to handle the volume of data coming from the virtual

environment, which may have hundreds or even thousands of objects, with many

different metrics and events. And if Ops Manager gets bogged down, all

monitoring is affected—not just monitoring of the virtual environment.

A better approach pairs the collector with an Ops Manager agent that feeds the

data to Ops Manager. Multiple collector/agent pairs can be deployed as necessary

to handle the data load. This kind of distributed architecture provides virtually

unlimited scalability and leverages Ops Manager’s Management Server ”back-end,”

which is designed to support multiple agents.

Of course, once you have multiple collectors, you need to a way to manage them.

Although the collectors ultimately feed data to a single central point (OpsManager), the collectors themselves must be configured and managed. To do this

one collector at a time would add significant administrative overhead and

opportunity for errors. In addition, it should be possible to coordinate the work of 

multiple collectors. For example, it should be possible to balance the load among

collectors, and have the collectors serve as backups for each other (high availability

monitoring). The result is a centrally managed, distributed architecture that can

easily grow as the monitored environment grows.

While scalability may not be the most exciting aspect of a software evaluation, it is

absolutely critical to success. Often, the most scalable solution is the most mature

one: it’s gone through several iterations, it includes sophisticated configurationoptions, and it’s been proven in real production environments. Product maturity

and vendor experience not only affect the depth of vSphere expertise in the

solution (as previously discussed), but they also affect scalability and are important

considerations for product selection.

CONCLUSIONFor organizations that have standardized on HP Operations Center, the benefits of 

monitoring VMware with HP Ops Manager are clear. What may not be so clear is

how to bring VMware to the Ops Manager console. This white paper suggests fourcriteria for evaluating the alternatives. In doing so, the white paper addresses three

rather distinct audiences with three rather distinct sets of needs:

  The Ops Manager team, which is responsible for the integrity of the EMS

and the delivery of monitoring data for all business-critical infrastructure

  The virtualization team, which is responsible for the performance and

availability of the virtual environment

  The business as a whole, which is responsible for costs and for service to

customers, employees, and other users

8/3/2019 Spi White Paper Final

http://slidepdf.com/reader/full/spi-white-paper-final 12/12

 

Ways VMware vSphere Improves Backup and Recovery

Page | 12

Best Practices: VMware Monitoring with HP Operations Manager

Veeam nworks Smart Plug-in for VMware is one alternative worthy of 

consideration. Developed in close collaboration with VMware and certified

“VMware Ready Optimized,” the nworks SPI is proven in more than 300 real

customer installations and more than 5 years of production use. It fully embraces

the vSphere API from VMware as well as the Ops Manager framework from HP, to

provide best-in-class monitoring for VMware. And because it’s from Veeam, aleading virtualization specialist, it encompasses deep knowledge and extensive

experience with VMware to provide the most effective monitoring of your virtual

environment and the most comprehensive support for your operations staff. For

more information about the nworks SPI, visit www.veeam.com/spi.

ABOUT THE AUTHORAlec King is a Senior Product Manager at Veeam Software. He has many years’

experience in enterprise systems management with a variety of companies,including Siemens and the British Broadcasting Corporation. He can be reached at

[email protected].

ABOUT VEEAM SOFTWAREVeeam Software, a premier-level VMware Technology Alliance Partner and member

of the VMware Ready Management program, provides innovative software for

managing VMware vSphere 4 and Virtual Infrastructure 3. Veeam offers an award-

winning suite of tools to assist the VMware administrator, including:

  Veeam Backup & Replication: #1 for Virtualization Data Protection 

  Veeam Reporter Enterprise: for VMware performance, storage, and 

capacity reporting and chargeback  

  Veeam Monitor: for vSphere performance monitoring and alerting across

multiple vCenter Servers

  Veeam Configurator: for complete host configuration management 

  Veeam Business View: for  business-aligned management of virtual 

machines, independent of the virtual infrastructure

With its acquisition of nworks in June 2008, Veeam's products also include thenworks Smart Plug-in  and the nworks Management Pack , which incorporate

VMware data into enterprise management consoles from HP and Microsoft. Learn

more about Veeam Software by visiting www.veeam.com.