getting started with splunk it service intelligence

44

Click here to load reader

Upload: splunk

Post on 19-Jan-2017

424 views

Category:

Technology


4 download

TRANSCRIPT

Page 1: Getting Started With Splunk It Service Intelligence

Copyright © 2015 Splunk Inc.

The Hands-On Version

BurchSystems Engineer, IT Operations Analytics SME

Page 2: Getting Started With Splunk It Service Intelligence

Setup Before You Can PlayDownload this presentation slide deck: https://splunk.box.com/splunkliveitsi16 Follow the instructions on your paper hand-out to log in to your VM.

Please log in as either• [email protected] OR• [email protected]• Password is “Changeme1” or

“Changeme2”

After logging in, selectIT Service Intelligence from thelist of apps at the left

2

Page 3: Getting Started With Splunk It Service Intelligence

ITSI Core Concepts

3

Page 4: Getting Started With Splunk It Service Intelligence

4

Experience with ITSI? Splunk?

Page 5: Getting Started With Splunk It Service Intelligence

5

ITSI – The Burch Version

Page 6: Getting Started With Splunk It Service Intelligence
Page 7: Getting Started With Splunk It Service Intelligence

7

ITSI – The Burch Version

Page 8: Getting Started With Splunk It Service Intelligence

8

ITSI – The Texbook Version

Page 9: Getting Started With Splunk It Service Intelligence

What is a Service?

Service RequestsResponses

In ITSI, a Service is a logical group of technology components that a user deems need to be monitored together.

It can often be generalized as a “black box” which we send requests, and expect responses

9

Page 10: Getting Started With Splunk It Service Intelligence

What is a Service?

DNS RequestsResponses

Technical Services

Auth RequestsResponses

Web RequestsResponses

Services can be lower level (technical) …

10

Page 11: Getting Started With Splunk It Service Intelligence

What is a Service?

DNS RequestsResponses

Technical Services

Customer Transactions

RequestsResponses

Business Services

Auth RequestsResponses

Web RequestsResponses

Support Desk RequestsResponses

Services can also be higher level (business) …

11

Page 12: Getting Started With Splunk It Service Intelligence

What is a Service?

Packet Network

Hypervisor and Hosts

RBMDBs

Storage Tier

API Services

Web Services

Customer Transactions

Mobile

API/Middlew

are

Partner Portal

DNS

Services can encompass multiple tiers of the IT domain. Services may also depend upon other services

12

Page 13: Getting Started With Splunk It Service Intelligence

What is a KPI?

DNS RequestsResponses

KPI: Number of requestsKPI: Error rateKPI: Average response timeKPI: Server CPU loadKPI: Server network I/F errors

Customer Transactions

RequestsResponses

KPI: Number of transactionsKPI: Error rateKPI: Average response timeKPI: Count of Incident TicketsKPI: Synthetic Transx Health

KPIs and Health scores constitute the means by which Services are monitored.

13

Page 14: Getting Started With Splunk It Service Intelligence

14

Key Performance Indicators (KPIs)

A Key Performance Indicator (KPI) is a Splunk saved search created within the ITSI UI that helps monitor a specific field like CPU, Memory, Number of Errors

and so on. KPIs are contained within Services.

Page 15: Getting Started With Splunk It Service Intelligence

Service Health Scores

15

A Health score is a score form 0-100 (0 being critical and 100 being normal) that helps determine the health of a Service. It is calculated based on all KPIs

importance and its status (e.g. green, orange, red), once every minute.

Page 16: Getting Started With Splunk It Service Intelligence

Now in ITSI

16

Page 17: Getting Started With Splunk It Service Intelligence
Page 18: Getting Started With Splunk It Service Intelligence

28

New Requirements!● Create a new KPI for the DB Service:

● Network Utilization

● Modify the Executive Glass Tablein order to show off the servicesyou slave over

“WE only have about 15min TO DO WHAT ???!!???”

Think about how long this would take you today?

Page 19: Getting Started With Splunk It Service Intelligence

29

Configuration of DB Service

Click Configure > Click Services

Page 20: Getting Started With Splunk It Service Intelligence

30

Let’s Talk Entities

● Select DB Service

● Entities are the relevant things which support this service (usually hosts)

● Select the right entries with filters, ANDs, ORs● Original Entity list can come from CMDB,

spreadsheet, Splunk search, others

Page 21: Getting Started With Splunk It Service Intelligence

31

A KPI in 5 minutes? Absolutely!

Click New – Generic KPI

Select Data Model● Host Operating System● Network● # bytes● Next

Page 22: Getting Started With Splunk It Service Intelligence

32

KPIs Continued….

Splunk Builds Searches for you – Oh Yeah, that’s happening

● Select Yes for Split by & Filter options● Select host for Entity Lookup & Alias options● Click Next

Page 23: Getting Started With Splunk It Service Intelligence

33

Almost There…Select● KPI Search Schedule: Every Minute ● Entity Calculation: Average● Service/Agg Calculation: Average● Calculation Window: Last Minute● Click Next

● Unit: Bps● Click Next

Page 24: Getting Started With Splunk It Service Intelligence

34

Final Steps …Set your thresholds:

● Aggregate (All) ● Per Entity

● Click “Add Threshold” TWICE● Make the Neapolitan ice cream colors

Yellow, Green, Yellow● Drag the sliders around in order to get

the current data graph entirely inside the Green (normal) band

● Click Finish● Other options are also available,

including adaptive thresholds and anomaly detection

Page 25: Getting Started With Splunk It Service Intelligence

35

Name that KPI!

From the list of KPIs, select your new one (at the bottom)● Click on the little pencil next to the name● Call it “Network Utilization”,

with your username up front

● Click on Save at bottom right when finished!

Page 26: Getting Started With Splunk It Service Intelligence

Adaptive Thresholds

36

What if your KPI data looks like this?

Page 27: Getting Started With Splunk It Service Intelligence

37

Adaptive ThresholdsStatic thresholds will not work…

Page 28: Getting Started With Splunk It Service Intelligence

38

Adaptive ThresholdsAdaptive Thresholding works beautifully with cyclical (and other dynamic) data

Page 29: Getting Started With Splunk It Service Intelligence

39

Anomaly Detection

● Machine Learning

● Works well for data with patterns

● Requires some “training” (trial & error) to zero in on best sensitivity

● More sophisticated capabilities coming! (multivariate, more algorithms, etc)

Page 30: Getting Started With Splunk It Service Intelligence

40

Let’s Fix that Glass Table

Page 31: Getting Started With Splunk It Service Intelligence

41

Clone the Glass TableReturn to Saved Glass Tables page (click on Glass Tables in the upper menu bar)

CLICK Edit for “Buttercup Games Business Process (IN PROGRESS)”• Select Clone• Title: Add your username

to the front• Permissions: Shared in App• Click Clone Page

• Click on your new Glass Tablefrom the list, to view it

Page 32: Getting Started With Splunk It Service Intelligence

42

Edit & Have Fun!Click on Edit in the upper right corner of your Glass Table

Use the “Services” panel on the left to select Individual KPIs, or Aggregate Service Health Scores• Choose 2 KPIs from Online Store that would be useful in

the “Order Process” section• Drag the selected widgets onto the canvas, positioning in

the gray oval

• What’s the difference between the

and tools at the top left?

Page 33: Getting Started With Splunk It Service Intelligence

43

More Fun with the Glass Table Editor…Use the Configurations panel on the right to edit a selected widget• Can change the visualization type, drilldown

behavior, and other settings

• You should hit Save frequently• I wonder what Auto Layout does?• (YIKES!) Revert All Changes might be helpful

Page 34: Getting Started With Splunk It Service Intelligence

44

Finishing up …• Add a ServiceHealthScore widget for Online

Store under Buttercup• Choose a Viz Type with a sparkline graph, then

resize to make it look pretty• Modify the Custom Drilldown action to go to

the saved glass table, Buttercup Games Online Store

• Bonus Points: Make the label bigger, more readable

• Click Save• View when done

Page 35: Getting Started With Splunk It Service Intelligence

45

A Troubleshooting ExerciseLet’s use ITSI to troubleshoot an outage ● Start at your Glass Table, “<UserName> Buttercup Business Process”● Customer Care reports that unhappy customers are complaining of failures

and long delays when trying to purchase● The calls began coming in at around ten minutes after the hour.● In the upper right corner of the Glass Table, change the time picker from Now

to XX:10:00.0, where XX is the appropriate hour. For example, if it is currently 14:05, set the time picker to 13:10:00.0, then Apply

● This is how we can “time travel” back to see conditions at a particular outage– oh yeah!

Page 36: Getting Started With Splunk It Service Intelligence

46

A Troubleshooting Exercise, cont’d● The Online Store seems to be degraded, just as Customer Care reported.

Click on the widget under Buttercup to drill down further

Page 37: Getting Started With Splunk It Service Intelligence

47

A Troubleshooting Exercise, cont’d.● The Online Store Glass Table shows a much more detailed view, including the impacted customer-facing KPIs

at the far left (Revenue, etc)

● Based on this view of all the relevant services, where do you think the root cause lies?

● Which service should we troubleshoot first?● Click on Health widget for that service, to

drill down to a Deep Dive

Page 38: Getting Started With Splunk It Service Intelligence

48

Deep Dive

● Deep Dive shows multiple KPIs and Health Scores in parallel “swim lanes”.

● The Health Score for this Service is the top swim lane. Can you see when it begins to degrade from 100%?

● Mousing over this point in time, can you spot the KPI with the leading fault indication, i.e., what failed first?

Page 39: Getting Started With Splunk It Service Intelligence

51

Review● High-value services can be decomposed and modeled in ITSI, using machine data

from the relevant systems● Services and KPIs can be created in minutes, with sophisticated thresholding

techniques to distinguish “normal” from “not normal”● Glass Tables allow service health and KPI metrics to be displayed in a way that

makes sense to specific groups, such as Executive Leadership, Business Service Owners, the NOC, DevOps & Others

● Deep Dives allow KPIs to be compared side-by-side across any time range, accelerating root cause analysis and significantly reducing MTTR

● Multi-KPI Alerts and Notable Events reduce alert noise, producing actionable events and a means to manage them

● … and it’s fun to build!

Page 40: Getting Started With Splunk It Service Intelligence

52

PLAY TIME IS OVER!Everyone out of the sandbox!

NOT! You can have your very own 7-day free eval sandbox, to continue playing:

● http://splunk.com/ITSI Then select:

And a Guidebook to help you explore ITSI’s capabilities:● https://splunk.box.com/ITSI-Sandbox-Guidebook

Page 41: Getting Started With Splunk It Service Intelligence

53

SEPT 26-29, 2016WALT DISNEY WORLD, ORLANDOSWAN AND DOLPHIN RESORTS

• 5000+ IT & Business Professionals• 3 days of technical content• 165+ sessions • 80+ Customer Speakers• 35+ Apps in Splunk Apps Showcase• 75+ Technology Partners• 1:1 networking: Ask The Experts and Security

Experts, Birds of a Feather and Chalk Talks• NEW hands-on labs! • Expanded show floor, Dashboards Control

Room & Clinic, and MORE!

The 7th Annual Splunk Worldwide Users’ Conference

PLUS Splunk University• Three days: Sept 24-26, 2016• Get Splunk Certified for FREE!• Get CPE credits for CISSP, CAP, SSCP• Save thousands on Splunk education!

Page 42: Getting Started With Splunk It Service Intelligence

A flying start to Service Intelligence

Start With A problem worth solving

Collaborate with Subject Matter Experts

Design Before Configuring

Page 43: Getting Started With Splunk It Service Intelligence

Sign Up Here - We’re Here To Help!Harness the creativity and domain knowledge of your organization to unlock

the value of data and solve an important service problem through a joint service intelligence workshop with key stakeholders

Define methods for:• Proactive service monitoring• Reduced risk and failures• Faster issue resolution• Increased business

performance

What is it? • 1 Day Onsite Workshop• Tightly linked with value• Collaborative approach• Build your own Splunk

ITSI Glass Table……

Page 44: Getting Started With Splunk It Service Intelligence

Copyright © 2015 Splunk Inc.

Thank You

BurchSystems Engineer, IT Operations Analytics SME