applications (2 of 2) - github pageslintool.github.io/.../session14-slides.pdf · applications (2...

68
Applications (2 of 2): Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation Alignment etc Segmentation, Alignment, etc. Kenneth Church [email protected] [email protected] Dec 9, 2009 1

Upload: others

Post on 26-Jun-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Applications (2 of 2):Applications (2 of 2):Recognition, Transduction, Discrimination, 

Segmentation Alignment etcSegmentation, Alignment, etc.

Kenneth Church

[email protected]@jhu.edu

Dec 9, 2009 1

Page 2: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Solitaire  Multiplayer Games: Auctions (Ads)http://www.scienceoftheweb.org/15‐396/lectures/lecture09.pdf

Right Rail

Right Rail: Avoid 

Dec 9, 2009 2

Mainline Addistortions from commercial interests

Page 3: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

A Single Auction  A Stream of Continuous Auctionsg

• Standard Example of Second Price AuctionSta da d a p e o Seco d ce uct o– Single Auction for a Single Apple

• Theoretical Result– Second Price Auction  Truth Telling– http://en.wikipedia.org/wiki/Vickrey_auction– Optimal Strategy: 

• Bid what the apple is worth to you• Don’t worry about what it is worth to others• Don t worry about what it is worth to others

– First Price Auction  Truth Telling

• Does theory generalize to a continuous stream?Does theory generalize to a continuous stream?

Dec 9, 2009 3

Page 4: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Pricing: Cost Per Click (CPC)Pricing: Cost Per Click (CPC)

• Bi = your bid • Equilibriumi  y

• Bi+1 = next bid

• CTRi = your click through rate

– Advertisers• Awareness• Sales

• CTRi+1 = next click through rate

• CPCi = your price(if h d d li k )

Sa es• New Customers• ROI

– Users– (if we show your ad and user clicks)

• Improvement: CTR  Q (Prior)

• Single Auction:

Users• Minimize pain• Obtain Value

Market Maker• Single Auction: – CPCi = Bi+1 

• Continuous Stream:

– Market Maker• Maximize Revenue

• Continuous Stream: – CPCi = Bi+1 CTRi+1 / CTRi

• Truth Telling?Dec 9, 2009 4

Page 5: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Multi‐Player Games Many Technical OpportunitiesMany Technical Opportunities

• Economics– http://www.wired.com/culture/culturereviews/magazine/17‐06/nep_googlenomics?currentPage=all

• Machine Learning– Learning to Rank

– Estimate CTR (Q/Priors)Estimate CTR (Q/Priors)

– Sparse Data: • What is the CTR for a new ad?

– Errors can be expensive– Errors can be expensive• If CTR is too low for new ad  Penalize Growth

• If too high  Reward Bad Guys to do Bad Things

• Truth Telling for Continuous Auctions?• Truth Telling for Continuous Auctions?– Probably not, especially if participants can estimate Q better than 

market maker

• Machine Learning: Solitaire  Multi‐Player Games– Can I estimate Q better than you can?   Man‐eating tiger

Dec 9, 2009 5

Page 6: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

ApplicationsApplications• Recognition: Shannon’s Noisy Channel Model

Speech Optical Character Recognition (OCR) Spelling– Speech, Optical Character Recognition (OCR), Spelling• Transduction

– Part of Speech (POS) Tagging– Machine Translation (MT)

• Parsing ???• Parsing: ???• Ranking

– Information Retrieval (IR)– LexicographyDi i i i• Discrimination:

– Sentiment, Text Classification, Author Identification, Word Sense Disambiguation (WSD)• Segmentation

– Asian Morphology (Word Breaking), Text Tiling• Alignment: Bilingual Corpora, Dotplots• Compression• Language Modeling: good for everything

Dec 9, 2009 6

Page 7: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Speech  Language

Shannon’s: Noisy Channel ModelShannon s: Noisy Channel Model

• I  Noisy Channel  OChannelModel

LanguageModely

• I΄ ≈ ARGMAXI Pr(I|O) = ARGMAXI Pr(I) Pr(O|I)Trigram Language Model Application

IndependentWord Rank More likely alternatives

We 9 The This One Two A Three Please In

need 7 are will the would also do

Channel Model

Application Input Output

Independent

need 7 are will the would also do

to 1

resolve 85 have know do…

all 9 The This One Two A Three

Application Input Output

Speech Recognition writer rider

OCR (Optical Call 9 Please In

of 2 The This One Two A Three Please In

the 1

Character Recognition)

all a1l

Spelling Correction government goverment

7

the 1

important 657 document question first…

issues 14 thing point toDec 9, 2009

Page 8: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Speech  LanguageUsing (Abusing) Shannon’s Noisy Channel Model: Part of g ( g) y

Speech Tagging and Machine Translation

• Speechp– Words  Noisy Channel  Acoustics

• OCR– Words  Noisy Channel  Optics

• Spelling CorrectionW d N i Ch l T– Words  Noisy Channel  Typos

• Part of Speech Tagging (POS): – POS Noisy Channel Words– POS  Noisy Channel  Words

• Machine Translation: “Made in America”– English  Noisy Channel  French

8

g y

Didn’t have the guts to use this slide at Eurospeech (Geneva)Dec 9, 2009

Page 9: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 9

Page 10: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Spelling Correction

Dec 9, 2009 10

Page 11: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 11

Page 12: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 12

Page 13: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 13

Page 14: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 14

Page 15: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

EvaluationEvaluation

Dec 9, 2009 15

Page 16: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

PerformancePerformance

Dec 9, 2009 16

Page 17: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

The Task is Hard without ContextThe Task is Hard without Context

Dec 9, 2009 17

Page 18: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Easier with ContextEasier with Context

• actuall actual actuallyactuall, actual, actually– … in determining whether the defendant actually will diewill die.

• constuming, consuming, costuming

i d i t d i d• conviced, convicted, convinced

• confusin, confusing, confusion

• workern, worker, workers

Dec 9, 2009 18

Page 19: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Easier with Context

Dec 9, 2009 19

Page 20: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Context ModelContext Model

Dec 9, 2009 20

Page 21: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 21

Page 22: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 22

Page 23: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 23

Page 24: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 24

Page 25: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Future ImprovementsFuture Improvements

• Add More FactorsAdd More Factors– Trigrams

Thesaurus Relations– Thesaurus Relations

– Morphology

S t ti A t– Syntactic Agreement

– Parts of Speech

b l• Improve Combination Rules– Shrink (Meaty Methodology)

Dec 9, 2009 25

Page 26: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 26

Page 27: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Conclusion (Spelling Correction)Conclusion (Spelling Correction)

• There has been a lot of interest in smoothingThere has been a lot of interest in smoothing– Good‐Turing estimation

Knesser Ney– Knesser‐Ney

• Is it worth the trouble?

• Ans: Yes (at least for recognition applications)

Dec 9, 2009 27

Page 28: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 28

Page 29: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 29

Page 30: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 30

Page 31: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 31

Page 32: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 32

Page 33: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 33

Page 34: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 34

Page 35: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 35

Page 36: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 36

Page 37: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 37

Page 38: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 38

Page 39: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 39

Page 40: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 40

Page 41: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 41

Page 42: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 42

Page 43: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 43

Page 44: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 44

Page 45: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 45

Page 46: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 46

Page 47: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 47

Page 48: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 48

Page 49: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 49

Page 50: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 50

Page 51: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Aligning WordsAligning Words

Dec 9, 2009 51

Page 52: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 52

Page 53: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 53

Page 54: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 54

Page 55: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 55

Page 56: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 56

Page 57: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 57

Page 58: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 58

Page 59: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 59

Page 60: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 60

Page 61: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 61

Page 62: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 62

Page 63: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 63

Page 64: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 64

Page 65: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 65

Page 66: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 66

Page 67: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 67

Page 68: Applications (2 of 2) - GitHub Pageslintool.github.io/.../session14-slides.pdf · Applications (2 of 2): Recognition, Transduction, Discrimination, Segmentation, Alignment, etc. Kenneth

Dec 9, 2009 68