chapter 8 discriminative classifiers hidden markov models

32
CHAPTER 8 CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Upload: eleanor-atkins

Post on 13-Dec-2015

240 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

CHAPTER 8CHAPTER 8

DISCRIMINATIVE CLASSIFIERSHIDDEN MARKOV MODELS

Page 2: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Generative vs. DiscriminativeGenerative vs. Discriminative

Page 3: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

The Perceptron Model

Page 4: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Example: Spam

Page 5: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Binary Decision RuleBinary Decision Rule

Page 6: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Online Perceptron TrainingOnline Perceptron Training

Page 7: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Perceptron Training IllustrationPerceptron Training Illustration

Page 8: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Properties of Perceptrons

Page 9: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Issues with PerceptronsIssues with Perceptrons

Page 10: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Reasoning over TimeReasoning over Time

• Often, we want to reason about a sequence of observations

Speech recognition

Robot localization

User attention• Need to introduce time into our models• Basic approach: hidden Markov models

(HMMs)• More general: dynamic Bayes’ nets

Page 11: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Markov ModelsMarkov Models

Page 12: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Conditional IndependenceConditional Independence

Page 13: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Weather Example

Page 14: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Mini-Forward AlgorithmMini-Forward Algorithm

Page 15: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

ExampleExample

Page 16: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Stationary Distributions

• If we simulate the chain long enough: What happens? Uncertainty accumulates Eventually, we have no idea what the state is!

• Stationary distributions: For most chains, the distribution we end up in

is independent of the initial distribution Called the stationary distribution of the chain Usually, can only predict a short time out

Page 17: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Example: Web Link AnalysisExample: Web Link Analysis

Page 18: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Mini-Viterbi AlgorithmMini-Viterbi Algorithm

Page 19: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Hidden Markov ModelsHidden Markov Models

Page 20: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

ExampleExample

Page 21: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Conditional IndependenceConditional Independence

Page 22: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

HMM ApplicationsHMM Applications

Page 23: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Forward AlgorithmForward Algorithm

Page 24: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Viterbi AlgorithmViterbi Algorithm

Page 25: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Viterbi ExampleViterbi Example

Page 26: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Viterbi PropertiesViterbi Properties• Designed for computing the most likely state hidden

sequence given a sequence of observations in Hidden Markov Models

• Two passes, forward to compute the forward probabilities, and then backward to reconstruct the maximum sequence

• What’s the time complexity?

• O(d2n) - Why is this exciting?

• There are many extensions to the basic Viterbi algorithm which have been developed for other models which have similar local structure: syntactic parsing, for instance.

Page 27: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Speech in an HourSpeech in an Hour

Page 28: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

HMMs for Speech

Page 29: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

HMMs for Continuous Obs.?HMMs for Continuous Obs.?

• Before: discrete, finite set of observations• Now: spectral feature vectors are real-valued!• Solution 1: discretization• Solution 2: continuous emissions models

Gaussians Multivariate Gaussians Mixtures of Multivariate Gaussians

• A state is progressively: Context independent subphone (~3 per phone) Context dependent phone (=triphones) State-tying of CD phone

Page 30: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

ASR Lexicon: Markov ModelsASR Lexicon: Markov Models

Page 31: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

Viterbi with 2 Words + Unif. LM

Page 32: CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS

ConclusionConclusion

• Perceptron A discriminative model, an alternative to generative models

like Naïve Bayes Simple classification rule, based on a weight vector Simple online learning algorithm, guaranteed to converge if

training set is separable

• Hidden Markov Models A special kind of Bayesian Network designed for reasoning

about sequences of hidden states Polynomial time inference for most likely state sequence

(Viterbi) and marginalization (Forward- Backward) Many applications