statistical nlp spring 2010 lecture 22: summarization dan klein – uc berkeley includes slides from...

Statistical NLPSpring 2010

Lecture 22: SummarizationDan Klein – UC Berkeley

Includes slides from Aria Haghighi, Dan Gillick

Selection

• Maximum Marginal Relevance

mid-‘90s

present

Maximize similarity to the query

Minimize redundancy

[Carbonell and Goldstein, 1998]ss11

ss44QQ

Greedy search over sentences

• Maximum Marginal Relevance• Graph algorithms [Mihalcea 05++]

mid-‘90s

present

Selection

• Maximum Marginal Relevance• Graph algorithms

mid-‘90s

presentss11

ss44Nodes are sentences

Selection

mid-‘90s

presentss11

Edges are similarities

Selection

mid-‘90s

present ss11

Edges are similarities

Stationary distribution represents node centrality

Selection

• Maximum Marginal Relevance• Graph algorithms• Word distribution models

mid-‘90s

present

Input document distribution Summary distribution

ww PPAA(w)(w)

Obama ?

speech ?

health ?

Montana ?

ww PPDD(w)(w)

Obama 0.017

speech 0.024

health 0.009

Montana 0.002

Selection

• Maximum Marginal Relevance• Graph algorithms• Word distribution models

mid-‘90s

present

SumBasic [Nenkova and Vanderwende, 2005]

Value(wi) = PD(wi)

Value(si) = sum of its word values

Choose si with largest value

Adjust PD(w)

Repeat until length constraint

Selection

• Maximum Marginal Relevance• Graph algorithms• Word distribution models• Regression models

mid-‘90s

present

word word valuesvalues positionposition lengthlength

12 1 24

4 2 14

6 3 18

frequency is just one of many features

Selection

• Maximum Marginal Relevance• Graph algorithms• Word distribution models• Regression models

• Topic model-based[Haghighi and Vanderwende, 2009]

mid-‘90s

present

Selection

H & V 09PYTHY

• Maximum Marginal Relevance• Graph algorithms• Word distribution models• Regression models• Topic models

• Globally optimal search

mid-‘90s

present [McDonald, 2007]

ss44QQ

Optimal search using MMR

Integer Linear Program

Selection

[Gillick and Favre, 2008]

Universal health care is a divisive issue.

Obama addressed the House on Tuesday.

President Obama remained calm.

The health care bill is a major test for the Obama administration.ss11

conceptconcept valuevalue

Selection

The health care bill is a major test for the Obama administration. conceptconcept valuevalue

obama 3ss11

Selection

obama 3

health 2

Selection

obama 3

health 2

house 1

Selection

conceptconcept valuevalue

obama 3

health 2

house 1

The health care bill is a major test for the Obama administration.

summarysummary lengthlength valuevalue

{s1, s3} 17 5

{s2, s3, s4}

Length limit: 18 words

greedy

optimal

Selection

[Gillick, Riedhammer, Favre, Hakkani-Tur, 2008]

total concept value

summary length limit

maintain consistency between selected sentences

and concepts

Integer Linear Program for the maximum coverage model

Selection

This ILP is tractable for reasonable problems

Selection

How to include sentence position?

First sentences are

unique

Selection

How to include sentence position?

Only allow first sentences in the summaryUp-weight concepts appearing in first sentences

Identify more sentences that look like first sentences

surprisingly strong baseline

included in TAC 2009 system

first sentence classifier is not reliable enough

Selection

Some interesting work on sentence ordering[Barzilay et. al., 1997; 2002]

But choosing independent sentences is easier• First sentences usually stand alone well

• Sentences without unresolved pronouns• Classifier trained on OntoNotes: <10% error rate

Baseline ordering module (chronological) is not obviously worse than anything fancier

Selection

•52 submissions•27 teams•44 topics

•10 input docs•100 word summaries

Gillick & Favre• Rating scale: 1-10• Humans in [8.3, 9.3]

• Rating scale: 1-10• Humans in [8.5, 9.3]

Results [G & F, 2009]

Error Breakdown?

Sentence extraction is limiting

... and boring!

But abstractive summaries are much harder to generate…

in 25 words?

Beyond Extraction?

http://www.rinkworks.com/bookaminute/

statistical nlp spring 2010 lecture 22: summarization dan klein – uc berkeley includes slides from...

obama administration

president obama

universal health care

s1s2s3s4the health care

major test

divisive issue

word valueschoose si

sentence position

Documents

jacobsen syndrome ashley osborne quesha mcclanahan orchi...

recomp avionics communication modem peter de waard and peter...

the icsi summarization system dan gillick, benoit favre, and...

speaker detection without models dan gillick july 27, 2004

representing systems with hidden state dorna kashef...

learning jazz grammars - harvey mudd...

aging with dignity - who€¦ · aging with dignity muriel...

an elderly woman with nausea, vomiting, diarrhea, fatigue...

distillation pilot plant design, operating parameters...

rte @ stanford rajat raina, aria haghighi, christopher cox,...

2001: characterization of some new iranian...

liam gillick - lectoratenakvstjoost.files.wordpress.com it...

why generative models underperform surface heuristics uc...

chapter 7: unit equation models - sahand university of...

the human genome project ashley osborne quesha mcclanahan...

james gillick : still lifes 2013

davoud talebi haghighi - universiti putra...

akbari haghighi sina

unsupervised coreference resolution in a...

graphic communications international union...