five causes of alert fatigue -- and how to prevent them

10

Alert Fatigue - and what to do about it Elik Eizenberg, VP R&D http://www.bigpanda.io

Upload: bigpanda

Post on 29-Jun-2015

885 views

Category:

Technology

4 download

Report

Download

Tags:

Embed Size (px):

DESCRIPTION

“Alert Spam” is a major recurring pain brought up by Ops teams: the constant flood of noisy alerts from your monitoring stack. This presentation discusses five types of spammy alerts that we hear about most often (and how we’d like to see them resolved). Most of them will sound familiar to you.

TRANSCRIPT

Page 1: Five Causes of Alert Fatigue -- and how to prevent them

Alert Fatigue -and what to do about it

Elik Eizenberg, VP R&D

http://www.bigpanda.io

Page 2: Five Causes of Alert Fatigue -- and how to prevent them

2

alert fatiguenoun

A constant flood of noisy, non-actionable alerts, generated by your monitoring stack.

Synonyms: alert overload, alert spam

Page 3: Five Causes of Alert Fatigue -- and how to prevent them

3

Poor Signal-to-Noise Ratio

Delayed Response

Wrong Prioritization

Constant Context Switching

Page 4: Five Causes of Alert Fatigue -- and how to prevent them

4

Common Pitfalls

Page 5: Five Causes of Alert Fatigue -- and how to prevent them

5

What you see: 20 critical Nagios / Zabbix alerts, all at once

What happened: - Unexpected traffic to your app- You get an alert from practically every host in the cluster

In an ideal world: - 1 alert, indicating 80% of the cluster has problems - Don’t wake me up unless at least some % of the cluster is

down

Alert Per Host

Page 6: Five Causes of Alert Fatigue -- and how to prevent them

6

What you see: Low disk space alert on a MongoDB host

What happened: - DB disk is slowly filling up as expected- Will become urgent in a few weeks

In an ideal world: - No need for an alert at all!- Automatically issue a Jira ticket and assign it to me

Important != Urgent

Page 7: Five Causes of Alert Fatigue -- and how to prevent them

7

What you see: The same high-load alerts, every Monday after lunch

What happened: - Monday is busy by definition- You can’t use the same thresholds every day

In an ideal world: - Dynamically update your thresholds- Or focus only on anomalies (e.g. etsy/skyline)

Non-Adaptive Thresholds

Page 8: Five Causes of Alert Fatigue -- and how to prevent them

8

What you see: Incoming alerts from Nagios, Pingdom, NewRelic, Keynote & Splunk…

What happened: - Data corruption in a couple of Mongo nodes- Resulting in heavy disk IO and some transaction errors- This kind of error manifests itself in server, application & user

level

In an ideal world: - Auto correlate highly-related alerts from different systems- Show me one high-level incident, instead of low-level alerts

Same Issue, Different System

Page 9: Five Causes of Alert Fatigue -- and how to prevent them

9

What you see: Issue pops us for a couple of minutes, then disappears.

What happened: - Maybe a cronjob over utilizes the netwrok- Or a random race-condition in the app- Or a rarely-used product feature that causes the backend to

crash

In an ideal world: - No need for an alert every time it happens- Give me a monthly report of common shot-lived alerts

Transient Alerts

Page 10: Five Causes of Alert Fatigue -- and how to prevent them

10

Give us a try - http://www.bigpanda.iohttp://twitter.com/bigpanda

Thanks for listening!

http://www.bigpanda.io/

http://www.bigpanda.io/

http://twitter.com/bigpanda

Preventing alert fatigue through use ... - NorthBay Healthcare

GUIDE Fatigue management · Guidance for the NSW mining and petroleum industries 1. Introduction This document provides guidance on how to prevent worker fatigue and its associated

Optimizing Drug-Dose Checking to Minimize Alert Fatigue · 2016-02-24 · Optimizing Drug-Dose Checking to Minimize Alert Fatigue March 4, 2016 David Kaelber, MD, PhD, MPH CMIO &

Professor Jamie Coleman - ePrescribing Toolkit€¦ · Akron Children's Hospital, Ohio ... Current Digital Strategy 2014‐15 ... Overdependence on technology Alert fatigue Changes

Fatigue Advisor Resource - HE Alert...Fatigue Advisor Resource your risk Getyour sleep Reduce FATIGUE ADVISOR RESOURCE Part 1: Facts and safety strategies Part 2: Writing a fatigue

Driver Wellness. © Business & Legal Reports, Inc. 1107 Session Objectives Understand why wellness matters Manage fatigue and stress on the job Prevent

Prevent Harm from High Alert Medication- Anticoagulants in Primary Care

METHODS TO DETECT AND PREVENT FATIGUE IN AGEING AIRCRAFT ... · PDF fileDifferent methods are developed to detect and prevent fatigue of material in aircraft structures. The most important

Improve Patient Care with Real-time Health Event Alert ...€¦ · and identifies patient leakage ... Customized to avoid alert fatigue Patients Improves overall quality of care Increases

Preventing Surgical Complications Prevent Harm from High Alert Medication- Anticoagulants in Primary Care Insert Date here Presenter:

Fatigue Fatigue Life

Reducing Alert Fatigue

Fatigue Advisor Resource - HE Alert · Fatigue Advisor Resource your risk Getyour sleep Reduce. ... 2.9 Physical environment ... The need to sleep is basic to our biology

How to Guide Prevent Harm High Alert Medications

Starting the fatigue conversation · Stay alert and stay safe To ﬁ nd out more about Fatigue and how to actively manage the risk, reach out to Shell Health or Road Safety. Legal

SOLUTION SHEET WRONG WAY ALERT DETECTION ......SOLUTION SHEET WRONG WAY ALERT DETECTION AND NOTIFICATION SOLUTION Sense. Validate. Prevent. Notify. OVERVIEW Studies show* there is

Attivo Networks -- Distributed Deception Platforms for ... · unburdens critical resources from alert fatigue – essential actions in successfully avoiding a breach. Adoption of

Solving Alert Fatigue in Cyber Security

West Midlands ePrescribing Conference 8 June 2015 · 2015. 6. 15. · Akron Children's Hospital, Ohio ... Overdependence on technology Alert fatigue Changes in communication patterns

Prevent a Collision (crash) The accident prevention formula: Be Alert Be prepared Act in Time

Artículo bilingüe inglés/castellano Reduction in alert fatigue in an … · 2018-12-19 · Reduction in alert fatigue in an assisted electronic prescribing system… Farm Hosp

***MEDIA ALERT***MEDIA ALERT***MEDIA ALERT***prod.omnianightclub.com.s3-us-west-1.amazonaws.com/... · ***media alert***media alert***media alert*** for immediate release omnia nightclub

The complete guide to ship manning - HE Alert · Achievers (See also Alert!Issue No. 4) Safe Manning Fatigue management | Other considerations Organization & Management Future considerations

Fatigue at sea - HE Alert · Fatigue at sea Författare: Uppdragsgivare: Margareta Lützhöft, ... EOG Electrooculography IMO International Maritime Organization KI Karolinska Institute

COMPUTER ALERT FATIGUE - · Drug Interactions: Analysis and Management 94 Drug Interaction Facts 141 ... Market Removals Due to Drug-Drug Interactions ... A. Oral contraceptives-carbamazepine

Defensive Driving Chapter 5 Prevent a Collision Standard Collision-Prevention Formula: ▫Be Alert ▫Be Prepared ▫Act in Time

How-to Guide: Prevent Harm from High-Alert Medications · How-to Guide: Prevent Harm from High-Alert Medications Institute for Healthcare Improvement 7 Improving medication management

Effective fatigue management · Managing fatigue in the transport sector The primary goal of the HVNL’s fatigue management requirements is to prevent drivers from driving while

Load Sequence Analysis in Fatigue Life Prediction - tcsme.org · 1.INTRODUCTION The assessment of the fatigue life of components and structures is essential to prevent failures in

Streamlined Productivity & Simplified Lifestyle · Technology eliminates the harmful flicker of traditional LCDs to prevent fatigue and vision damage. Brightness Intelligence Technology

Business Alert - lkscpa.com fileBusiness Alert . HOW YOU CAN HELP PREVENT TAX-RELATED IDENTITY THEFT . T. a.,'{-related fraud isn't a new crime, but tax preparation software, e-filing

Fatigue Life, Fatigue Damage, Fatigue Factor of Safety

Best Natural Energy Supplements to Prevent Fatigue Problem

Protect your workplace: Cyber security measures to prevent ... · Protect your workplace: Cyber security measures to prevent a data breach April 2019 Stay Alert, Stay United and Stay

NODOZE: Combatting Threat Alert Fatigue with Automated ... · University of Illinois at Urbana-Champaign fwhassan3,[email protected] zVirginia Tech [email protected] NEC Laboratories