modern network monitoring for the rest of us - ausnog · modern network monitoring for the rest of...

18
CC-BY AusNOG 2017 Modern Network Monitoring for the Rest of Us AusNOG 2017 Tim Raphael 1

Upload: dangtram

Post on 19-Apr-2018

215 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Modern Network Monitoring for

the Rest of Us

AusNOG 2017Tim Raphael

1

Page 2: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Intro•Why?

• The current state, emerging trends and technologies

• Barriers for smaller ISPs and enterprises

• An effective monitoring strategy

• How?2

Page 3: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Why?

• React to failure situations

• Capacity and service utilisation analysis

• Discover new and emerging trends

• Drive continual improvement

3

Page 4: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Current State

• Off-the-shelf software and its limitations

• Larger and more innovative organisations tending to build their own due to scale and complexity

• Differences between Silicon Valley-like organisations and “the rest of us”

4

Page 5: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Technologies

• Push vs Pull

• Time-Series Data

• Event Stream Processing

5

Page 6: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

DeviceCollector

DeviceCollector

Request (50ms)

Response (50ms)

Pushed (50ms)

Push vs Pull

6

Page 7: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Time Series

7

Page 8: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Event Stream Processing

Source Aggregation Threshold Alert

8

Page 9: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Limitations on the rest of us

• Time / $

• On-device computing power

• Vendor feature support

• Previous generation monitoring interfaces

9

Page 10: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Monitoring for the rest of us

• Top-down approach

• Business goals measured by metrics

• Focus on alert conditions to reduce noise and improve impact

• Continual improvement

10

Page 11: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

if out.bits.rate > 90% AND out.discards.rate > 0:

alert: high priority:

else if out.bits.rate > 85%: alert: low priority

Page 12: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Services Affected

Identify Cause

Implement Change& Confirm

Re-evaluateMonitoring Strategy

Continual Improvement

12

Page 13: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Monitoring for the rest of us

• Use technology available to you

• Design for resilience

• Put collectors close to the data source

• Use buffers

• Store raw values

• Meta Monitoring

13

Page 14: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Monitoring for the rest of us

Use technology available to you.

14

Page 15: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Monitoring for the rest of usDesign for Resilience

Device Collector

Device Collector

Central Datastore

15

Page 16: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Monitoring for the rest of usMeta Monitoring is important

DeviceCollector

Device Collector

16

Page 17: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Software

• Storage: • InfluxDB, Graphite, Prometheus

• Collection: • Telegraf, Collectd, Sensu, Statd, SnmpCollector

• Alert Evaluation: • Riemann, Kapacitor, Sensu

• Display: • Grafana, Chronograf

17

Page 18: Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of Us ... Push vs Pull 6. CC-BY AusNOG 2017 Time Series 7. ... Monitoring Strategy

CC-BY AusNOG 2017

Questions

@timraphael215 linkedin.com/in/timraphael215/ www.timraphael.com

18