keeping network monitoring current using automated nagios configurations (wip) greg wickham apan...

34
Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Upload: brittany-casey

Post on 01-Jan-2016

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Keeping Network Monitoring Current using Automated Nagios Configurations (WIP)

Greg Wickham

APANJuly 2005

Page 2: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Is the network being monitored correctly?

Greg Wickham

APANJuly 2005

Page 3: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Contents

• Background• Monitoring Overview / Requirements• Solution Architecture• Monitoring Verification• Conclusion

Page 4: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Contents

• Background• Monitoring Overview / Requirements• Solution Architecture• Monitoring Verification• Conclusion

Page 5: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

GrangeNet Architecture

Page 6: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

GrangeNet Monitoring

Device Types Quantity

Routers 6

Servers 6

Switches 4

16

Page 7: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

GrangeNet Monitoring

Device Types Quantity Probes

Routers 6 310

Servers 6 6

Switches 4 7

16 323

Page 8: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

GrangeNet Monitoring

Device Types Quantity Probes

Routers 6 310

Servers 6 6

Switches 4 7

16 323

Nagios Lines (services.cfg) 3172

Page 9: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

GrangeNet Monitoring (ACT Edge)

Probe Types Quantity Notes

Fan 3Hardware 17Ping 1Power 2Temperature 1 (39)Interfaces 16MSDP Peerings 8BGP Peerings 15OSPF 2

Total Probes: 65

Page 10: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

GrangeNet Monitoring (ACT Edge)

• Is that everything that can be monitored?

Page 11: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

GrangeNet Monitoring (ACT Edge)

• Is that everything that can be monitored?

No!

Page 12: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

GrangeNet Monitoring (ACT Edge)

• Is that everything that can be monitored?

No!• What else?

– BGP address family peerings• Multicast / Unicast / IPv6

– Software versions– Hardware versions– Latency (of links)– Usage (of links)– …

Page 13: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Contents

• Background• Monitoring Overview / Requirements• Solution Architecture• Monitoring Verification• Conclusion

Page 14: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Monitoring Solution

• Solution Goals:–Verifying network is correctly monitored–Minimise replication of data–Simplistic integration with existing systems–Easy to maintain–Extensible–Flexible–Efficient

Page 15: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Monitoring Overview

• Facts:–Networks change–Updating is tedious–Monitoring Difficult to Auditing

• Answers Required:–Is the network performing optimally?–Has a change occurred?–What is the status of the network?–Is the monitoring accurate?

Page 16: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Contents

• Background• Monitoring Overview / Requirements• Solution Architecture• Monitoring Verification• Conclusion

Page 17: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Solution Architecture

MonitoringConfiguration

•Configuration data stored as XML•Describes:

•Devices to monitor•How to monitor•Nagios templates•Device Templates

Page 18: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Solution Architecture

MonitoringConfiguration

MonitoringDaemon

•Daemon reads configuration data•Verifies devices are monitored correctly•Generates Nagios Configurations•Performs device probes•Runs periodically

Page 19: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Solution Architecture

MonitoringConfiguration

MonitoringDaemon

NagiosConfiguration

Nagios configurationautomatically generatedby Monitoring Daemon

Page 20: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Solution Architecture

MonitoringConfiguration

MonitoringDaemon

NagiosConfiguration

NagiosDaemon

Nagios uses configuration supplied bymonitoring daemon;Nagios configured to use ‘passive’ checks

Page 21: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Solution Architecture

MonitoringConfiguration

MonitoringDaemon

NagiosConfiguration

NagiosDaemon

Network Devices

Monitoring daemon queries all devicesusing SNMP;Check device telemetry againstknown configurations

Page 22: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Solution Architecture

MonitoringConfiguration

MonitoringDaemon

NagiosConfiguration

NagiosDaemon

Network Devices

Monitoring daemon sendsProbe status direct to Nagios(Nagios running passive checks)

Page 23: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Solution Architecture

MonitoringConfiguration

MonitoringDaemon

NagiosConfiguration

NagiosDaemon

Network Devices

eMail

SM

S

Web

Nagios reports on networkhealth as usual but does noactive checking of its own

Page 24: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Solution Architecture

MonitoringConfiguration

MonitoringDaemon

NagiosConfiguration

NagiosDaemon

Network Devices

eMail

SM

S

Web

Rep

ort

Report generated of devicemonitoring comparison

Page 25: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Solution Architecture

MonitoringConfiguration

MonitoringDaemon

NagiosConfiguration

NagiosDaemon

Network Devices

RRDtool

eMail

SM

S

Web

Rep

ort

Collected data fed tooptional sub-systems

Page 26: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Solution Architecture

• Result–Only one process communicates to all devices

Very EfficientQuery time for 34 devices is < 10 seconds

–As only one daemon communicates to the devices the load on each network device is minimised (collected data is distributed as necessary)

–As Nagios does less work the monitoring server is less loaded (Nagios is heavy)

Page 27: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Contents

• Background• Monitoring Overview / Requirements• Solution Architecture• Monitoring Verification• Conclusion

Page 28: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Monitoring Verification

• Templates are used to define pre-requisite monitoring probes

• Devices are attached to templates

Page 29: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Monitoring Verification

Device Description<device> <alias>edge1.vic</alias> <address>202.0.98.68</address> … <module type="nagios"> <template>ibgp-mesh</template> <template>ebgp-peerings</template> <template>ospf</template> <template>system</template> … <probe type=“ibgp-mesh" description="AS18062 - edge1.nsw“ arg=“202.0.98.13” /> <probe type=“ebgp-peering" description="AS64670“ arg=“202.0.98.190” />

… </module>

</device>

Page 30: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Monitoring Verification

Template Description

<template name=“ibgp-mesh"> <template>system-health</template>

<probe name=“ibgp-mesh" inheirit="bgp-standard"> <attribute type="field">bgpPeerState</attribute> <attribute type="notify">gn-noc</attribute> <attribute type="level">level1-service</attribute> <match> <field name="bgpPeerRemoteAs" value="18062" /> <field name="bgpPeerState" value="up" /> </match> </probe>

</template>

Page 31: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Monitoring Verification

• From the:–Device template; and–Monitoring Template

an accurate report can be generated of the status of monitoring.

• All probe details are stored in XML so can be easily verified

Page 32: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Contents

• Background• Monitoring Overview / Requirements• Solution Architecture• Monitoring Verification• Conclusion

Page 33: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Conclusion

• Due to efficiencies in the monitoring daemon:–Nagios doesn’t load the server–Other applications can share the SNMP data–Doesn’t load the network devices–Device probing is very quick

• Reduces complexity of Nagios configuration• Generate reports identifying inaccuracies in existing monitoring

• Unified configuration data

This is a Work in Progress

Page 34: Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005

Status (Work in Progress)

• Current functionality:– Separate applications:

• Collecting data from devices; feed into Nagios• Generating Nagios configurations

• To Do– Integrate applications– Complete Implementation Nagios templates– Documentation!

• Software– Perl– net-snmp– Nagios