gotcha! network analytics to augment fraud detection · gotcha! network analytics to augment fraud...

26
Copyright © SAS Institute Inc. All rights reserved. Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique Van Vlasselaer Véronique Van Vlasselaer Véronique Van Vlasselaer Véronique Van Vlasselaer SAS Pre-Sales AnalyticalConsultant December 4th, 2018

Upload: others

Post on 05-Oct-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Gotcha! Network Analytics to augment

Fraud DetectionBig Data in the Food Chain: the un(der)explored goldmine?

Author: Véronique Van VlasselaerVéronique Van VlasselaerVéronique Van VlasselaerVéronique Van Vlasselaer

SAS Pre-Sales Analytical Consultant

December 4th, 2018

Page 2: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

IntroductionFraud Analytics Using Descriptive, Predictive and Social Network Techniques:

A Guide to Data Science for Fraud Detection

Page 3: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Network Analytics! Say what?!

• Main analytical question in fraud:

Given the current network, who shall be the next one that commits fraud?

Page 4: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Network Analytics! Say what?!

• Traditional approach in an fraud context:

• Finding descriptive patterns (e.g. multivariate outliers) or predictive patterns(e.g. predictive analytics) in massive amounts of structured data

Page 5: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Network Analytics! Say what?!

• Traditional approach in an fraud context:

• Finding descriptive patterns (e.g. multivariate outliers) or predictive patterns(e.g. predictive analytics) in massive amounts of structured data

Multivariate Outlier DetectionMultivariate Outlier DetectionMultivariate Outlier DetectionMultivariate Outlier Detection Predictive AnalyticsPredictive AnalyticsPredictive AnalyticsPredictive Analytics

Page 6: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Network Analytics! Say what?!

• State-of-the-art insights grounded in social sciences:

• Fraud is “socially” contagious.

- If Bart and Peter are both fraudsters, and Véronique is friends of Bart and Peter, what would you expect of Véronique’s behavior?

• Extension of traditional detection approaches by including social interactions among fraudsters (and other people).

• Data issue: networked data is unstructured.

Page 7: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Network Analytics! Say what?!Credit Card Transaction Fraud

Page 8: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Network Analytics! Say what?!Social Security Fraud

Page 9: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Network Analytics! Say what?!

• Networked data? Where to find?

• Much more than data on social media channels.

• Call behavior data

• Review data

• Transactional data

• Employee data

• Financial data

• Sales data (e.g. Ebay)

• ...

Page 10: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Network Analytics! Say what?!

• Networked data? Where to find?

• International agro-food trade network

- Network of food suppliers and nations

- Detection of faulty food production

- Impact of food contamination

- How to quickly shortcut a potential safety breach?

• Food supply network

- Network of raw material suppliers, food processors andretail

• Chemical networks

- Network of OTU’s

Page 11: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Network Analytics! Say what?!

• Main analytical question in fraud:

Given the current network, who shall be the next one that commits fraud?

• Main analytical solutions

• Featurization of the network

• Collective inference algorithms (incl. behavioral propagation)

Page 12: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Featurization

Page 13: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Network AnalysisFeaturization

• Featurization is the process in which the unstructured network is transformed to a structured form

unstructured dataunstructured dataunstructured dataunstructured data structured datastructured datastructured datastructured data predictive modelpredictive modelpredictive modelpredictive model

Page 14: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Network AnalysisFeaturization

• Featurization is the process in which the unstructured network is transformed to a structured form

unstructured dataunstructured dataunstructured dataunstructured data structured datastructured datastructured datastructured data predictive modelpredictive modelpredictive modelpredictive model

FEATURIZATIONFEATURIZATIONFEATURIZATIONFEATURIZATION

Page 15: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Network AnalyticsNetwork Representation

• Sociogram:

• Matrix representation:

Page 16: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

• Network featurization processNetwork featurization processNetwork featurization processNetwork featurization process based on

• the first-order neighborhood or egonet of each entity

- How many churners/fraudsters/adopters are connected to node (i.e., degree)?

- Density of the egonet?

- Number of suppliers/addresses/customers from a black list in the egonet?

- Velocity of the network (time-based network analysis)?

- …

• the n-order neighborhood

- Betweenness, closeness, community detection…

Network AnalyticsFeature Engineering

Page 17: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

• Network featurization processNetwork featurization processNetwork featurization processNetwork featurization process examples:

• the n-order neighborhood

Network AnalyticsFeature Engineering

BetweennessBetweennessBetweennessBetweenness ClosenessClosenessClosenessCloseness

Page 18: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Network AnalysisFeaturization

Page 19: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Collective Inference Algorithms

Page 20: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Collective Inference Algorithms

• Collective inference algorithms

• The label of a node is said to dependent on the labels of the neighboring nodes.

• Chicken-egg problem:

- The label of node A depends on the label of node B, and

- The label of node B depends on the label of node A.

• In general: iterative procedure with random ordering

Page 21: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Collective Inference Algorithms

RULE: RULE: RULE: RULE: IF MORE THAN HALF OF IF MORE THAN HALF OF IF MORE THAN HALF OF IF MORE THAN HALF OF NEIGHBORS IS FRAUDULENT, NEIGHBORS IS FRAUDULENT, NEIGHBORS IS FRAUDULENT, NEIGHBORS IS FRAUDULENT, NODE IS FRAUDULENTNODE IS FRAUDULENTNODE IS FRAUDULENTNODE IS FRAUDULENT

Page 22: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Collective Inference Algorithms

• Influence propagation through the network

• E.g. Gotcha!, based on Google’s famous PageRank algorithm

Page 23: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Collective Inference Algorithms

• Influence propagation through the network

• E.g. Gotcha!, based on Google’s famous PageRank algorithm

Page 24: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Conclusion: Fraud DetectionA Hybrid Approach

Page 25: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Hybrid Approach for Detection

Capability

Manual Detection

Rules

Predictive Models

Fraud Network

Analysis

Anomaly Detection

Va

lue

HYBRID ANALYTICAL METHODSHYBRID ANALYTICAL METHODS

Page 26: Gotcha! Network Analytics to augment Fraud Detection · Gotcha! Network Analytics to augment Fraud Detection Big Data in the Food Chain: the un(der)explored goldmine? Author: Véronique

Copyright © SAS Inst itute Inc. A l l r ights reserved.

Questions? Feedback? [email protected]