topic 10 (bayesian classifiers)

28
CSE 473: Digital Image CSE 473: Digital Image Processing and Pattern Processing and Pattern Recognition Recognition Spring 2015 Spring 2015 Course Teacher: Course Teacher: Md. Tarek Habib Md. Tarek Habib Assistant Professor Assistant Professor Department of Computer Science and Department of Computer Science and Engineering Engineering Green University of Bangladesh Green University of Bangladesh Topic – 10: Topic – 10: Bayesian Bayesian Classifiers Classifiers

Upload: arch-sidz

Post on 07-Aug-2015

33 views

Category:

Education


0 download

TRANSCRIPT

CSE 473: Digital Image Processing CSE 473: Digital Image Processing and Pattern Recognitionand Pattern Recognition

Spring 2015Spring 2015

Course Teacher:Course Teacher:Md. Tarek HabibMd. Tarek HabibAssistant ProfessorAssistant Professor

Department of Computer Science and Department of Computer Science and EngineeringEngineering

Green University of BangladeshGreen University of Bangladesh

Topic – 10: Topic – 10: Bayesian ClassifiersBayesian Classifiers

Classification TechniquesClassification Techniques Bayesian ClassifiersBayesian Classifiers

Bayes TheoremBayes Theorem Using the Bayes Theorem for Using the Bayes Theorem for ClassificationClassification Naïve Bayes ClassifierNaïve Bayes Classifier

2

Lecture OutlineLecture Outline

Md. Tarek HabibMd. Tarek Habib

Classification Techniques

A classification technique (or classifier) is

a systematic approach to building classification

models from an input data set. Each technique employs a learning algorithm

to identify a model that best fits the relationship

between the attribute set and class label of the

input data. The model generated by a learning algorithm

should both fit the input data well and correctly

predict the class labels of records it has never

seen before.

3

Md. Tarek HabibMd. Tarek Habib

Classification Techniques

Figure 4.3 shows a general approach for

solving classification problems.

4

Md. Tarek HabibMd. Tarek Habib

Classification Techniques

5

Md. Tarek HabibMd. Tarek Habib

Classification Techniques

First, a training set consisting of records

whose class labels are known must be provided. The training set is used to build a classification

model, which is subsequently applied to the test

set, which consists of records with unknown

class labels. Evaluation of the performance of a

classification model is based on the counts of

test records correctly and incorrectly predicted

by the model. A single number would make it more

convenient to compare the performance of

different models.

6

Md. Tarek HabibMd. Tarek Habib

Classification Techniques

This can be done using a performance

metric such as accuracy, which is defined as

follows:

Equivalently, the performance of a model can

be expressed in terms o f its error rate, which is

given by the following equation:

7

Md. Tarek HabibMd. Tarek Habib

Classification Techniques

Most classification algorithms seek models

that attain the highest accuracy, or equivalently,

the lowest error rate when applied to the test

set.

8

Md. Tarek HabibMd. Tarek Habib

Classification Techniques

Typical examples of classification

techniques or classifiers are: Bayesian classifiers Nearest-neighbor classifiers Decision tree classifiers Rule-based classifiers Neural networks Support vector machines

9

Md. Tarek HabibMd. Tarek Habib

Bayesian Classifiers

Consider the task of predicting whether a

person is at risk for heart disease based on the

person’s diet and workout frequency. Although most people who eat healthily and

exercise regularly have less chance of

developing heart disease, they may still do so be

cause of other factors such as heredity,

excessive smoking, and alcohol abuse. This section presents an approach for

modeling probabilistic relationships between the

attribute set and the class variable.10

Md. Tarek HabibMd. Tarek Habib

Bayes Theorem

Consider a football game between two rival

teams: Team 0 and Team 1. Suppose Team 0 wins

65% o f the time and Team 1 wins the remaining

matches. Among the games won by Team 0, only

30% of them come from playing on Team 1 ’s

football field. On the other hand, 75% of the

victories for Team 1 are obtained while playing at

home. If Team 1 is to host the next match between

the two teams, which team will most likely emerge

as the winner? This question can be answered by using the

well-known Bayes theorem.

11

Md. Tarek HabibMd. Tarek Habib

Bayes Theorem

12Md. Tarek HabibMd. Tarek Habib

Bayes Theorem

13

Md. Tarek HabibMd. Tarek Habib

Bayes Theorem

14

Md. Tarek HabibMd. Tarek Habib

Using the Bayes Theorem for Classification

15

Md. Tarek HabibMd. Tarek Habib

Let X denote the attribute set and Y denote

the class variable. We can treat X and Y as random variables. P(Y|X) is also known as the posterior

probability for Y, as opposed to its prior

probability, P(Y).

Using the Bayes Theorem for Classification

16

Md. Tarek HabibMd. Tarek Habib

Using the Bayes Theorem for Classification

17

Md. Tarek HabibMd. Tarek Habib

Using the Bayes Theorem for Classification

18

Md. Tarek HabibMd. Tarek Habib

Using the Bayes Theorem for Classification

19

Md. Tarek HabibMd. Tarek Habib

Using the Bayes Theorem for Classification

20Md. Tarek HabibMd. Tarek Habib

Naïve Bayes ClassifierNaïve Bayes Classifier

21

Md. Tarek HabibMd. Tarek Habib

Naïve Bayes ClassifierNaïve Bayes Classifier

22

Md. Tarek HabibMd. Tarek Habib

Naïve Bayes ClassifierNaïve Bayes Classifier

23

Md. Tarek HabibMd. Tarek Habib

Naïve Bayes ClassifierNaïve Bayes Classifier

24

Md. Tarek HabibMd. Tarek Habib

Naïve Bayes ClassifierNaïve Bayes Classifier

25

Md. Tarek HabibMd. Tarek Habib

Naïve Bayes ClassifierNaïve Bayes Classifier

26

Md. Tarek HabibMd. Tarek Habib

Naïve Bayes ClassifierNaïve Bayes Classifier

27

Md. Tarek HabibMd. Tarek Habib

Naïve Bayes ClassifierNaïve Bayes Classifier

28

Md. Tarek HabibMd. Tarek Habib