data warehousing lecture-30 what can data mining do? virtual university of pakistan ahsan abdullah...

13
Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Virtual University of Pakistan Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research www.nu.edu.pk/cairindex.asp National University of Computers & Emerging Sciences, Islamabad Email: [email protected]

Upload: colleen-hopkins

Post on 14-Dec-2015

214 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research

Data Warehousing

Lecture-30What can Data Mining do?

Virtual University of PakistanVirtual University of Pakistan

Ahsan AbdullahAssoc. Prof. & Head

Center for Agro-Informatics Researchwww.nu.edu.pk/cairindex.asp

National University of Computers & Emerging Sciences, IslamabadEmail: [email protected]

Page 2: Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research

CLASSIFICATION

Page 3: Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research

ESTIMATION

Page 4: Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research

PREDICTION

Page 5: Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research

MARKET BASKET ANALYSIS

Page 6: Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research

98% of people who purchased items A and B also purchased item C

AY

B

Ask graphics to replace pictures of items with similar pictures

MARKET BASKET ANALYSIS

Page 7: Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research

Discovering Association Rules

Cola, Diaper, Milk5

Juice, Bread, Diaper, Milk4

Juice, Cola, Diaper, Milk3

Juice, Bread2

Bread, Cola, Milk1

ItemsTID

Rules:

{Milk} {Cola}

{Diaper, Milk} {Juice}

Page 8: Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research

Task of segmenting a heterogeneous population into a number of more homogenous sub-groups or clusters.

CLUSTERING

Page 9: Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research

Examples of Clustering Applications• Marketing:

• Insurance:

• Land use:

• Seismic studies:

Page 10: Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research

Ambiguity in Clustering

How many clusters?Two clustersTwo clustersFour clustersFour clustersSix clustersSix clusters

Page 11: Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research

DESCRIPTION

Page 12: Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research

Comparing Methods

• Accuracy:

• Speed:

• Robustness:

• Scalability:

• Interpretability:

• Simplicity:

Page 13: Data Warehousing Lecture-30 What can Data Mining do? Virtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research

Where does Data Mining fits in?

Data

Preprocessing

• Selection• Cleaning• Transformation• Feature Extraction

KnowledgeKnowledge

Data Mining

• Identify Patterns• Generate Models

Interpretation/Evaluation

• Validation Tests• Visualization

Data Mining is one step of Knowledge Discovery in Databases (KDD)