pr_unit-i

28
12/31/2010 SAM_PR_UNIT -I 1  What is Pattern Recognition? Pattern: a description of an object.  A pattern class (or category) is a set of patterns sharing common attributes and usually originating from the same source Recognition : classifying an object to a pattern class. PR is the science that concerns the description or classific ation (r ecognition) of measurements. PR techniques are an important component of intelligent systems and are used for  Decision making Object & pattern classific ation Data preprocessing UNIT-I INTRODUCTION

Upload: joyce-george

Post on 04-Apr-2018

224 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 1/28

12/31/2010 SAM_PR_UNIT-I 1

  What is Pattern Recognition? Pattern: a description of an object.

 A pattern class (or category) is a set of patternssharing common attributes and usually originatingfrom the same source

Recognition: classifying an object to a patternclass.

PR is the science that concerns the description or classification (recognition) of measurements.

PR techniques are an important component of 

intelligent systems and are used for  Decision making

Object & pattern classification

Data preprocessing

UNIT-I INTRODUCTION

Page 2: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 2/28

12/31/2010 SAM_PR_UNIT-I 2

 PR applications:    Image Preprocessing, Segmentation, and

 Analysis

Computer Vision

Radar signal classification/analysis

Face recognition

Speech recognition/understanding

Fingerprint identification

Character recognition

Handwriting analysis

Electrocardiography signal

analysis/understanding

Medical diagnosis 

INTRODUCTION

Page 3: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 3/28

12/31/2010 SAM_PR_UNIT-I 3

  Data mining

Image databases

Seismic analysis

 Commercial machines that canrecognize patterns:    Automated speech recognition

Fingerprint identification

Optical character recognition DNA sequence identification

Blood cells

Printed text 

INTRODUCTION

Page 4: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 4/28

12/31/2010 SAM_PR_UNIT-I 4

Machine Perception During recognition (or classification) given objects are

assigned to prescribed classes.

 A classifier is a machine which performs classification

Build a machine that can recognize patterns:

Speech recognition

Fingerprint identification

OCR (Optical Character Recognition)

DNA sequence identification

Page 5: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 5/28

12/31/2010 SAM_PR_UNIT-I 5

PR Example “Sorting incoming Fish on a conveyor according to

species using optical sensing” 

Salmon or Sea Bass?

Sea bass

SpeciesSalmon

Page 6: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 6/28

12/31/2010 SAM_PR_UNIT-I 6

Problem Analysis 

•  Automate the process of sorting incoming fishon a conveyor belt according to species(Salmon or Sea bass).    Set up a camera

Take some sample images

Note the physical differences between the twotypes of fish

Length

Lightness

Width

No. & shape of fins

Position of the mouth

Page 7: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 7/2812/31/2010 SAM_PR_UNIT-I 7

Solution by Stages Preprocess raw data from camera

Segment isolated fish

Extract features from each fish (length, width,brightness, etc.)

Classify each fish

Page 8: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 8/2812/31/2010 SAM_PR_UNIT-I 8

Problem Analysis  cont’d  Preprocessing: Use a segmentation operation to

isolate fishes from one another and from the

background

Information from a single fish is sent to a feature

extractor whose purpose is to reduce the data by

measuring certain features

The features are passed to a classifier  

Classification: Select the length of the fish as a

possible feature for discrimination

Page 9: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 9/2812/31/2010 SAM_PR_UNIT-I 9

Page 10: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 10/2812/31/2010 SAM_PR_UNIT-I 10

Classification

Select the length of the fish as a possible feature for 

discrimination

Page 11: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 11/2812/31/2010 SAM_PR_UNIT-I 11

The length is a poor feature alone!

Select the lightness as a possible feature 

Page 12: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 12/2812/31/2010 SAM_PR_UNIT-I 12

Threshold decision boundary and cost relationship

Move our decision boundary toward smaller values of 

lightness in order to minimize the cost (reduce the number of sea bass that are classified salmon!)

Task of decision theory

“Customers do not want sea bass in their 

cans of salmon” 

Page 13: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 13/2812/31/2010 SAM_PR_UNIT-I 13

 Adopt the lightness and add the width of the fish

Fish x = [x1, x2]

Lightness Width

Page 14: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 14/28

12/31/2010 SAM_PR_UNIT-I 14

Page 15: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 15/28

12/31/2010 SAM_PR_UNIT-I 15

We might add other features that are not correlated

with the ones we already have. A precaution should

be taken not to reduce the performance by addingsuch “noisy features” 

Ideally, the best decision boundary should be the one

which provides an optimal performance such as inthe following figure:

Page 16: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 16/28

12/31/2010 SAM_PR_UNIT-I 16

Page 17: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 17/28

12/31/2010 SAM_PR_UNIT-I 17

However, our satisfaction is premature because the

central aim of designing a classifier is to correctly

classify novel input

Page 18: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 18/28

12/31/2010 SAM_PR_UNIT-I 18

Decision Boundaries

Observe: Can do much better with two features

Page 19: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 19/28

12/31/2010 SAM_PR_UNIT-I 19

Pattern Recognition Systems /

A Complete PR System Sensing

Use of a transducer (camera or microphone)

PR system depends of the bandwidth, theresolution, sensitivity, distortion of the transducer 

Segmentation and grouping

Patterns should be well separated and should notoverlap

Page 20: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 20/28

12/31/2010 SAM_PR_UNIT-I 20

Pattern Recognition Systems /

A Complete PR System cont’d 

Page 21: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 21/28

12/31/2010 SAM_PR_UNIT-I 21

Pattern Recognition Systems cont’d  Feature extraction

Discriminative features

Invariant features with respect to translation,rotation and scale.

Classification

Use a feature vector provided by a featureextractor to assign the object to a category

Post Processing Exploit context dependent information other than

from the target pattern itself to improveperformance

Page 22: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 22/28

12/31/2010 SAM_PR_UNIT-I 22

The Design Cycle

Data collection

Feature Choice Model Choice

Training

Evaluation

Computational Complexity

Page 23: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 23/28

12/31/2010 SAM_PR_UNIT-I 23

The Design Cycle cont’d 

Page 24: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 24/28

12/31/2010 SAM_PR_UNIT-I 24

The Design Cycle cont’d  Data Collection

How do we know when we have collected an

adequately large and representative set of examples

for training and testing the system?

Feature Choice

Depends on the characteristics of the problem

domain. Simple to extract, invariant to irrelevant

transformation insensitive to noise.

Model Choice

Unsatisfied with the performance of our fish classifier 

and want to jump to another class of model

Page 25: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 25/28

12/31/2010 SAM_PR_UNIT-I 25

The Design Cycle cont’d  Training

Use data to determine the classifier. Many differentprocedures for training classifiers and choosing

models Computational Complexity

What is the trade-off between computational easeand performance?(How an algorithm scales as a function of the number 

of features, patterns or categories?) Evaluation

Measure the error rate (or performance and switchfrom one set of features to another one

Page 26: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 26/28

12/31/2010 SAM_PR_UNIT-I 26

LEARNING & ADAPTATION

 Any method that incorporates information from trainingsamples in the design of a classifier employs learning.

We use learning because all practical or interesting PRproblems are so hard that we cannot guess

classification decision ahead of time. Approach:   Assume some general form of model

Use training patterns to learn or estimate theunknown parameters.

Page 27: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 27/28

12/31/2010 SAM_PR_UNIT-I 27

LEARNING & ADAPTATION

Supervised Learning 

Teacher provides a label or cost for each pattern ina training set.

Objective: Reduce the sum of the costs for thesepatterns

Issues: How to make sure that the learningalgorithm

Can learn the solution.

Will be stable to parameter variation.

Will converge in finite time.Scale with # of training patterns & # of inputfeatures.

Favors "simple" solutions.

Page 28: PR_UNIT-I

7/30/2019 PR_UNIT-I

http://slidepdf.com/reader/full/prunit-i 28/28

12/31/2010 SAM PR UNIT I 28

LEARNING & ADAPTATION

Unsupervised Learning (Clustering) 

There is no explicit teacher.

System forms clusters or "natural grouping" of theinput patterns.

 Reinforcement Learning (Learning with a critic) 

No desired category is given. Instead, the onlyteaching feedback is that the tentative categoryis right or wrong.

Typical way to train a classifier:

Present an inputCompute its tentative label

Use the known target category label toimprove the classifier.