sausage

Sausage

Lidia ManguEric Brill

Andreas StolckePresenter : Jen-Wei Kuo

2004/ 9 /24

Referred Reference

• CSL’00 Finding Consensus in Speech Recognition : Word Error Minimization and other Applications of Confusion Networks

• Eurospeech’99 Finding Consensus among Words : Lattice-Based Word Error Minimization

• Eurospeech’97 Explicit Word Error Minimization in N-Best List Rescoring

Motivation

• The mismatch between the standard scoring paradigm (MAP) and the evaluation metric (WER).

)|()()|(

WAPWPAWP

maximize sentence posterior probability minimize sentence level error

An ExampleCorrect answer : I’M DOING FINE

)|()|()|(

]|)(correct[]|)(correct[]|)(correct[

]|)(correct words[

AwPAwPAwP

AwEAwEAwE

Word Error Minimization

• Minimizing the expected word error under the posterior distribution

WRWWEARPRWWEEW ),()|(min)],([min

potential hypothesis

N-best ApproximationhypothesiscenterW

RWWEARPW

thecalled is

),()|(minarg )()()(

hypothesiscenterW

RWWEARPW

thecalled is

),()|(minarg )()()(

Lattice-Based Word Error Minimization

• Computational Problem– Several orders of magnitude larger than in N-best lists of

practical size.– No efficient algorithm of this kind.

• Fundamental Difficulty– Objective function is based on pairwise string distance, a

nonlocal measure.

• Solution– Replace pairwise string alignment with a modified

multiple string alignment.– WE (word error) MWE (modified word error)

Lattice to Confusion Network

Multiple Alignment

• Finding the optimal alignment is a problem for which no efficient solution is known (Gusfield, 1992)

• We resort to a heuristic approach based on lattice topology.

Algorithms

• Step1. Arc Pruning

• Step2. Same-Arc Clustering

• Step3. Intra-Word Clustering

• Step4*. Same-Phones Clustering

• Step5. Inter-Word Clustering

• Step6. Adding null hypothesis

• Step7. Consensus-based Lattice Pruning

Arc Pruning

Intra-Word Clustering

• Same-Arc Clustering– Arcs with with same word_id, start frame and

end frame would be merged first.

• Intra-Word Clustering– Arcs with same word_id would be merged.

yprobabilitposterior theis)(

length. their of sum by the normalized

which and between length overlap theis ),(

)(WID)(WID

)()(),(max)Intra_SIM(

212121

eeoverlap

epepeeoverlap,EEEeEe

Same-Phones Clustering

• Same-Phones Clustering– Arcs with same phone sequences would be

clustered in this stage.

)ence(phone_sequ)ence(phone_sequbut )WID()WID(

)()(),(max)Phone_SIM(

212121

epepeeoverlap,EEEeEe

Inter-Word Clustering

• Inter-Word Clustering– Remaining arcs be clustered at this stage

finally.

lengths their of sum by the normalized

series phone of distanceedit theminus 1 is

))(:()(

)()(),(avg)Inter_SIM(

weWordsFepwp

wpwpwwsim,FFFWordsw

FWirdsw

Adding null hypothesis

• For each equivalent class, if the sum of the posterior probabilities is less than threshold (0.6) than add the null hypothesis to the class.

Consensus-based Lattice Pruning

• Standard Method Likelihood-based– Paths whose overall score differs by more than

a threshold from the best-scoring path are removed from the word graph.

• Proposed Method Consensus-based– Firstly we construct a pruned confusion

network.– Then intersect the original lattice with the

pruned confusion network.

Algorithm

An Example

• How to merge ?

是誰

我是是

我是我

Computational Issues

• Partial Order Stupid Method:– History-based Look-ahead

• Apply first-pass search to find the history arcs for each arc. Generate the initial partial ordering.

• While clusters are merged, lots of computation for (recursive) updates are needed.

• Thousands of arcs need lots of memory storage.

Computational Issues – An example

If we merge B and C, what happened?

Experimental Set-up

• Lattices was built using HTK• Training Corpus

– Trained with about 60 hours of Switchboard speech.

– LM is a backoff trigram model trained on 2.2 million words of Switchboard transcripts.

• Testing Corpus– Test set in the 1997 JHU

Experimental Results

Set IIWER SER WER

MAP 38.5 65.3 42.9N-best (center) 37.9 65.6 42.3N-best (consensus) 37.6Lattice (consensus) 37.3 65.8 41.6Lattice (consensus withoutphonetic similarity)

Lattice (consensus withoutposteriors)

Set IHypothesis

Experimental Results

Hypothesis

F0 F1 F2 F3 F4 F5 FXOverall

MAP13.0

33.1 33.3

N-best (center)

33.0 　　

Lattice (consens

32.5 33.0

Confusion Network Analyses

Other Approaches

• ROVER (Recognizer Output Voting Error Reduction)

sausage

word errorlattice

word graph

expected word error

intraword clusteringarcs

interword clusteringstep6

intraword clusteringstep4

lattice consensus37

error mwe

Documents

sulami sausage recipes

louie’s finer meats cumberland, wi potential keywords...

andouille sausage

pasta sausage manual

home sausage making

sausage making

cheddar sausage bowl

mkt sausage

america’s finest sausage

sausage poster

cooked sausage

synthetic sausage casings

saxonville sausage

sausage dog

homemade sausage

sausage - bembrasilrestaurants.com

toast orbarmcake homemadeburgers extras … orbarmcake bacon...

sausage party.pdf

tuscan sausage soup

sausages and sausage products. market of sausages and...