data warehousing lecture-30 what can data mining do? virtual university of pakistan ahsan abdullah...
TRANSCRIPT
Data Warehousing
Lecture-30What can Data Mining do?
Virtual University of PakistanVirtual University of Pakistan
Ahsan AbdullahAssoc. Prof. & Head
Center for Agro-Informatics Researchwww.nu.edu.pk/cairindex.asp
National University of Computers & Emerging Sciences, IslamabadEmail: [email protected]
CLASSIFICATION
ESTIMATION
PREDICTION
MARKET BASKET ANALYSIS
98% of people who purchased items A and B also purchased item C
AY
B
Ask graphics to replace pictures of items with similar pictures
MARKET BASKET ANALYSIS
Discovering Association Rules
Cola, Diaper, Milk5
Juice, Bread, Diaper, Milk4
Juice, Cola, Diaper, Milk3
Juice, Bread2
Bread, Cola, Milk1
ItemsTID
Rules:
{Milk} {Cola}
{Diaper, Milk} {Juice}
Task of segmenting a heterogeneous population into a number of more homogenous sub-groups or clusters.
CLUSTERING
Examples of Clustering Applications• Marketing:
• Insurance:
• Land use:
• Seismic studies:
Ambiguity in Clustering
How many clusters?Two clustersTwo clustersFour clustersFour clustersSix clustersSix clusters
DESCRIPTION
Comparing Methods
• Accuracy:
• Speed:
• Robustness:
• Scalability:
• Interpretability:
• Simplicity:
Where does Data Mining fits in?
Data
Preprocessing
• Selection• Cleaning• Transformation• Feature Extraction
KnowledgeKnowledge
Data Mining
• Identify Patterns• Generate Models
Interpretation/Evaluation
• Validation Tests• Visualization
Data Mining is one step of Knowledge Discovery in Databases (KDD)