approximate factoring for a* search aria haghighi, john denero, and dan klein computer science...

35
Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California

Post on 22-Dec-2015

214 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Approximate Factoring for A* Search

Aria Haghighi, John DeNero, and Dan Klein

Computer Science Division

University of California Berkeley

Page 2: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Inference for NLP Tasks

A* Search

Page 3: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Inference as Search

ya1

a2

a3

PartialHypothesis

a2

Page 4: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

VP

S

NP

Bitext Parsing as Search

translation is hard , la traducción es dificil

Weighted Synchronous Grammar

Parsing O(n6)

Modified CKY over bi-spans (X[i,j],X’[i’,j’])

Source Target

VP

S

NP

S S’

Page 5: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

A* Search

Completion ScoreScore So Far

y

Page 6: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

A* Search

Heuristic Design Tight

small Admissible

Efficient to compute

This way hypothesis!

A* Heuristic ManOptimal Result

Page 7: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

A* Example: Bitext Search

Viterbi Inside Score

Cost So Far

Bi-Span

Page 8: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

A* Bitext Search

Viterbi Outside Score

Completion Score

O(n6)Ideal Heuristic

Page 9: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Of Stately Projections ¼

S S’

S SVP

S

NP

S S’

S S’

VP

S

NP VP’

S’

NP’

VP’

S’

NP’

Page 10: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

A* Bitext Search

Suppose,Then,

VP

S

NP

S S’

VPVP

S

NP

S

NP

VP’

S’

NP’

Page 11: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Projection Heuristic

O(n3) O(n3) O(n6)

Klein and Manning [2003]

Page 12: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

When models don’t factorize

Page 13: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

When models don’t factorize

Pointwise Admissibility

y

c(a)

x

¼s(y)

Ás(a)

¼s(x) ¼t(y)

Át(a)

¼t(x)

Page 14: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

When models don’t factorize

Admissibility

¼s(y) ¼t(y)

y

Page 15: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Finding Factored Costs

Pointwise Gap

How to find Ás and Át?

Page 16: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Finding Factored Costs

Small gaps

Page 17: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Finding Factored Costs

PointwiseAdmissibility

Page 18: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Finding Factored Costs

Page 19: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Bitext Experiments

Synchronous Tree-to-Tree Transducer Trained on 40k sentences of English-Spanish Europarl [Galley et. al, 2004] Rare words replaced with POS tags Tested on 1,200 sent. max length 5-15

Optimization Problem Solved only once per grammar 206K Variables 160K Constraints 29 minutes

Page 20: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Bitext Experiments

Page 21: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Bitext Experiments

Page 22: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Bitext Experiments

Zhang and Gildea (2006)

Page 23: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Bitext Experiments

Zhang and Gildea (2006)

Page 24: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Lexicalized Parsing

NP-(translation,NN)

S-(is,VBZ)

VP-(is,VBZ)

(is,VBZ)

(translation, NN)NP

S

VP

Klein and Manning [2003]

Page 25: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Lexicalized Parsing

Page 26: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Lexicalized Parsing

Too many constraints to efficiently solve!

Over 64e13

possiblelexicalized

rules

Page 27: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Lexicalized Parsing

Page 28: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Lexicalized Parsing

Page 29: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Lexicalized Parsing

Page 30: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Lexicalized Parsing

Page 31: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Lexicalized Model Experiments

Standard Setup Train on section 2-21 of the treebank Test on section 23 (length · 40)

Models Tested Factored model [Klein and Manning, 2003]

Non-Factored Model

Page 32: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Lexicalized Parsing

Factored Model [Klein and Manning, 2003]

Page 33: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Lexicalized Parsing

Non-Factored Model

Page 34: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Conclusions

General technique for generating A* estimates

Can explicitly control admissibility tightness trade-off

Future Work: Explore different objectives and applications

Page 35: Approximate Factoring for A* Search Aria Haghighi, John DeNero, and Dan Klein Computer Science Division University of California Berkeley

Thanks

http://nlp.cs.berkeley.edu