anomaly detection

8
Anomaly Detection - SALIL NAVGIRE

Upload: salil-navgire

Post on 21-Nov-2014

458 views

Category:

Technology


7 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Anomaly Detection

Anomaly Detection

- SALIL NAVGIRE

Page 2: Anomaly Detection

Introduction• problem of finding patterns in data that

do not conform to expected behavior

• covers diverse disciplines from statistics, machine learning, data mining, information theory, spectral theory

Page 3: Anomaly Detection

Applications• Intrusion detection- detection of malicious activity• Host based – OS call traces

• Network based – packet level traces

• Fraud detection - detection of criminal activities in commercial organizations• Credit card fraud detection

• Insurance Claim Fraud Detection

• Insider trading detection

• Industrial damage detection

• Anomaly detection in data

• Anomaly detection in sensor networks

Page 4: Anomaly Detection

Challenges• Defining normal region

• Sometimes malicious agent adapt themselves to appear as normal observation

• Different techniques for different application domain

• Availability of labeled data for training

• Sometimes noise is similar to anomaly and difficult to distinguish

Page 5: Anomaly Detection

Different aspects of detection techniques• Nature of input data

• Types of Anomaly• Point Anomalies

• Contextual Anomalies

• Collective Anomalies

• Data Labels• Supervised anomaly detection

• Semi-Supervised anomaly detection

• Unsupervised anomaly detection

• Output • Scores

• Labels

Page 6: Anomaly Detection

Anomaly detection

techniques

Classification

Nearest Neighbor Clustering Spectral Information

theoretic Statistical Time Series

Anomaly Detection Techniques

Page 7: Anomaly Detection

• Classification• Neural network based

• Bayesian Network based

• Support Vector Machine based

• Rule based

• Nearest Neighbor• KNN

• Relative density

• Clustering• K means

• SOM

Page 8: Anomaly Detection

• Statistical• Parametric• Gaussian model based• Regression model based• Mixture of parametric distributions based

• Non-parametric• Histogram based• Kernel function based

• Spectral• Dimensionality reduction