statistical nlp spring 2010 lecture 22: summarization dan klein – uc berkeley includes slides from...

37
Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

Upload: prosper-watkins

Post on 29-Jan-2016

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

Statistical NLPSpring 2010

Lecture 22: SummarizationDan Klein – UC Berkeley

Includes slides from Aria Haghighi, Dan Gillick

Page 2: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick
Page 3: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick
Page 4: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

Selection

• Maximum Marginal Relevance

mid-‘90s

present

Maximize similarity to the query

Minimize redundancy

[Carbonell and Goldstein, 1998]ss11

ss33

ss22

ss44QQ

Greedy search over sentences

Page 5: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

• Maximum Marginal Relevance• Graph algorithms [Mihalcea 05++]

mid-‘90s

present

Selection

Page 6: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

• Maximum Marginal Relevance• Graph algorithms

mid-‘90s

presentss11

ss33

ss22

ss44Nodes are sentences

Selection

Page 7: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

• Maximum Marginal Relevance• Graph algorithms

mid-‘90s

presentss11

ss33

ss22

ss44Nodes are sentences

Edges are similarities

Selection

Page 8: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

• Maximum Marginal Relevance• Graph algorithms

mid-‘90s

present ss11

ss33

ss22

ss44Nodes are sentences

Edges are similarities

Stationary distribution represents node centrality

Selection

Page 9: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

• Maximum Marginal Relevance• Graph algorithms• Word distribution models

mid-‘90s

present

Input document distribution Summary distribution

~

ww PPAA(w)(w)

Obama ?

speech ?

health ?

Montana ?

ww PPDD(w)(w)

Obama 0.017

speech 0.024

health 0.009

Montana 0.002

Selection

Page 10: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

• Maximum Marginal Relevance• Graph algorithms• Word distribution models

mid-‘90s

present

SumBasic [Nenkova and Vanderwende, 2005]

Value(wi) = PD(wi)

Value(si) = sum of its word values

Choose si with largest value

Adjust PD(w)

Repeat until length constraint

Selection

Page 11: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

• Maximum Marginal Relevance• Graph algorithms• Word distribution models• Regression models

mid-‘90s

present

ss11

ss22

ss33

word word valuesvalues positionposition lengthlength

12 1 24

4 2 14

6 3 18

ss22

ss33

ss11

F(x)

frequency is just one of many features

Selection

Page 12: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

• Maximum Marginal Relevance• Graph algorithms• Word distribution models• Regression models

• Topic model-based[Haghighi and Vanderwende, 2009]

mid-‘90s

present

Selection

Page 13: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick
Page 14: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick
Page 15: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick
Page 16: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick
Page 17: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick
Page 18: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick
Page 19: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick
Page 20: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

H & V 09PYTHY

Page 21: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick
Page 22: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick
Page 23: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

• Maximum Marginal Relevance• Graph algorithms• Word distribution models• Regression models• Topic models

• Globally optimal search

mid-‘90s

present [McDonald, 2007]

ss11

ss33

ss22

ss44QQ

Optimal search using MMR

Integer Linear Program

Selection

Page 24: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

[Gillick and Favre, 2008]

Universal health care is a divisive issue.

Obama addressed the House on Tuesday.

President Obama remained calm.

The health care bill is a major test for the Obama administration.ss11

ss22

ss33

ss44

conceptconcept valuevalue

Selection

Page 25: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

[Gillick and Favre, 2008]

Universal health care is a divisive issue.

Obama addressed the House on Tuesday.

President Obama remained calm.

The health care bill is a major test for the Obama administration. conceptconcept valuevalue

obama 3ss11

ss22

ss33

ss44

Selection

Page 26: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

[Gillick and Favre, 2008]

Universal health care is a divisive issue.

Obama addressed the House on Tuesday.

President Obama remained calm.

The health care bill is a major test for the Obama administration. conceptconcept valuevalue

obama 3

health 2

ss11

ss22

ss33

ss44

Selection

Page 27: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

[Gillick and Favre, 2008]

Universal health care is a divisive issue.

Obama addressed the House on Tuesday.

President Obama remained calm.

The health care bill is a major test for the Obama administration. conceptconcept valuevalue

obama 3

health 2

house 1

ss11

ss22

ss33

ss44

Selection

Page 28: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

[Gillick and Favre, 2008]

Universal health care is a divisive issue.

Obama addressed the House on Tuesday.

President Obama remained calm.

conceptconcept valuevalue

obama 3

health 2

house 1

ss11

ss22

ss33

ss44

The health care bill is a major test for the Obama administration.

summarysummary lengthlength valuevalue

{s1, s3} 17 5

{s2, s3, s4}

17 6

Length limit: 18 words

greedy

optimal

Selection

Page 29: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

[Gillick, Riedhammer, Favre, Hakkani-Tur, 2008]

total concept value

summary length limit

maintain consistency between selected sentences

and concepts

Integer Linear Program for the maximum coverage model

Selection

Page 30: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

[Gillick and Favre, 2009]

This ILP is tractable for reasonable problems

Selection

Page 31: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

How to include sentence position?

First sentences are

unique

Selection

Page 32: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

How to include sentence position?

Only allow first sentences in the summaryUp-weight concepts appearing in first sentences

Identify more sentences that look like first sentences

surprisingly strong baseline

included in TAC 2009 system

first sentence classifier is not reliable enough

yet

Selection

Page 33: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

Some interesting work on sentence ordering[Barzilay et. al., 1997; 2002]

But choosing independent sentences is easier• First sentences usually stand alone well

• Sentences without unresolved pronouns• Classifier trained on OntoNotes: <10% error rate

Baseline ordering module (chronological) is not obviously worse than anything fancier

Selection

Page 34: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

•52 submissions•27 teams•44 topics

•10 input docs•100 word summaries

Gillick & Favre• Rating scale: 1-10• Humans in [8.3, 9.3]

• Rating scale: 1-10• Humans in [8.5, 9.3]

• Rating scale: 0-1• Humans in [0.62, 0.77]

• Rating scale: 0-1• Humans in [0.11, 0.15]

Results [G & F, 2009]

Page 35: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

[Gillick and Favre, 2008]

Error Breakdown?

Page 36: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

Sentence extraction is limiting

... and boring!

But abstractive summaries are much harder to generate…

in 25 words?

Beyond Extraction?

Page 37: Statistical NLP Spring 2010 Lecture 22: Summarization Dan Klein – UC Berkeley Includes slides from Aria Haghighi, Dan Gillick

http://www.rinkworks.com/bookaminute/