modeling coreference in contexts with three referents€¦ · modeling coreference in contexts with...

38
Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 1 / 16

Upload: others

Post on 25-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Modeling Coreference in Contexts with Three Referents

Jet Hoek, Andrew Kehler & Hannah Rohde

RAILS, 25 October 2019

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 1 / 16

Page 2: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

The puzzle

Donald called Rudy. . . .

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 2 / 16

Page 3: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Models of coreference

Mirror Model (Ariel 1990; Gundel et al. 1993)

p(referent|pronoun) ∼ p(pronoun|referent)

Expectancy Model (Arnold 2001)

p(referent|pronoun) ∼ p(referent)

Bayesian Model (Kehler et al. 2008; Kehler & Rohde 2013; Rohde & Kehler 2014)

p(referent|pronoun)interpretation ∼ p(referent)prior ∗ p(pronoun|referent)likelihood

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 3 / 16

Page 4: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Models of coreference

Mirror Model (Ariel 1990; Gundel et al. 1993)

p(referent|pronoun) ∼ p(pronoun|referent)

Expectancy Model (Arnold 2001)

p(referent|pronoun) ∼ p(referent)

Bayesian Model (Kehler et al. 2008; Kehler & Rohde 2013; Rohde & Kehler 2014)

p(referent|pronoun)interpretation ∼ p(referent)prior ∗ p(pronoun|referent)likelihood

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 3 / 16

Page 5: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Models of coreference

Mirror Model (Ariel 1990; Gundel et al. 1993)

p(referent|pronoun) ∼ p(pronoun|referent)

Expectancy Model (Arnold 2001)

p(referent|pronoun) ∼ p(referent)

Bayesian Model (Kehler et al. 2008; Kehler & Rohde 2013; Rohde & Kehler 2014)

p(referent|pronoun)interpretation ∼ p(referent)prior ∗ p(pronoun|referent)likelihood

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 3 / 16

Page 6: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Models of coreference

Mirror Model (Ariel 1990; Gundel et al. 1993)

p(referent|pronoun) ∼ p(pronoun|referent)

Expectancy Model (Arnold 2001)

p(referent|pronoun) ∼ p(referent)

Bayesian Model (Kehler et al. 2008; Kehler & Rohde 2013; Rohde & Kehler 2014)

p(referent|pronoun)interpretation ∼ p(referent)prior ∗ p(pronoun|referent)likelihood

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 3 / 16

Page 7: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Interpretation does not equal production

Story continuation

John scolded Bob. He [pronoun prompt]John scolded Bob. [free prompt]

The Bayesian model captures this asymmetry

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 4 / 16

Page 8: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Interpretation does not equal production

Story continuation

John scolded Bob. He [pronoun prompt]John scolded Bob. [free prompt]

The Bayesian model captures this asymmetry

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 4 / 16

Page 9: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Weak versus strong Bayes

Bayesian Model

p(referent|pronoun)interpretation ∼ p(referent)prior ∗ p(pronoun|referent)likelihood

In its strong form, the Bayesian model separates the discourse featuresthat influence the prior and the likelihood:

meaning drives the prior

topicality drives the likelihood

→ Recent work that shows that the likelihood of pronominalizationincreases for referents with a higher prior (e.g., Rosa & Arnold 2017)

In its weak form, the Bayesian model states that pronoun productionand interpretation are related by Bayesian principles.

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 5 / 16

Page 10: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Weak versus strong Bayes

Bayesian Model

p(referent|pronoun)interpretation ∼ p(referent)prior ∗ p(pronoun|referent)likelihood

In its strong form, the Bayesian model separates the discourse featuresthat influence the prior and the likelihood:

meaning drives the prior

topicality drives the likelihood

→ Recent work that shows that the likelihood of pronominalizationincreases for referents with a higher prior (e.g., Rosa & Arnold 2017)

In its weak form, the Bayesian model states that pronoun productionand interpretation are related by Bayesian principles.

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 5 / 16

Page 11: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Weak versus strong Bayes

Bayesian Model

p(referent|pronoun)interpretation ∼ p(referent)prior ∗ p(pronoun|referent)likelihood

In its strong form, the Bayesian model separates the discourse featuresthat influence the prior and the likelihood:

meaning drives the prior

topicality drives the likelihood

→ Recent work that shows that the likelihood of pronominalizationincreases for referents with a higher prior (e.g., Rosa & Arnold 2017)

In its weak form, the Bayesian model states that pronoun productionand interpretation are related by Bayesian principles.

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 5 / 16

Page 12: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Current study

Most of the research on pronoun production / interpretation hasfocused on sentence frames with two referents.

Results appear to differ between implicit causality verbs and studieswith transfer-of-possession verbs(e.g., Rohde 2008; Fukumura & van Gompel 2010 versus Rosa & Arnold 2017)

In a new context type with three referents, we test:

1 whether predictability influences pronominalization

2 whether Bayes’ Rule captures the relationship between pronouninterpretation and production

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 6 / 16

Page 13: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Current study

Most of the research on pronoun production / interpretation hasfocused on sentence frames with two referents.

Results appear to differ between implicit causality verbs and studieswith transfer-of-possession verbs(e.g., Rohde 2008; Fukumura & van Gompel 2010 versus Rosa & Arnold 2017)

In a new context type with three referents, we test:

1 whether predictability influences pronominalization

2 whether Bayes’ Rule captures the relationship between pronouninterpretation and production

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 6 / 16

Page 14: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Story continuation experiment

Items

Adam called Diana for Russel. He [pronoun prompt]Adam called Diana for Russel. [free prompt]

Counterbalanced which referents were gender-matched(NP1&NP2, NP1&NP3, NP2&NP3)

83 native speakers of English

30 items

Continuations were coded for:

who the continuation is aboutwhat form of referring expression is used (free prompt condition only)

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 7 / 16

Page 15: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Story continuation experiment

Items

Adam called Diana for Russel. He [pronoun prompt]Adam called Diana for Russel. [free prompt]

Counterbalanced which referents were gender-matched(NP1&NP2, NP1&NP3, NP2&NP3)

83 native speakers of English

30 items

Continuations were coded for:

who the continuation is aboutwhat form of referring expression is used (free prompt condition only)

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 7 / 16

Page 16: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Story continuation experiment

Items

Adam called Diana for Russel. He [pronoun prompt]Adam called Diana for Russel. [free prompt]

Counterbalanced which referents were gender-matched(NP1&NP2, NP1&NP3, NP2&NP3)

83 native speakers of English

30 items

Continuations were coded for:

who the continuation is aboutwhat form of referring expression is used (free prompt condition only)

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 7 / 16

Page 17: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Story continuation experiment

Items

Adam called Diana for Russel. He [pronoun prompt]Adam called Diana for Russel. [free prompt]

Counterbalanced which referents were gender-matched(NP1&NP2, NP1&NP3, NP2&NP3)

83 native speakers of English

30 items

Continuations were coded for:

who the continuation is aboutwhat form of referring expression is used (free prompt condition only)

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 7 / 16

Page 18: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Results: More subject continuations in pronoun prompt

Free prompt Pronoun prompt

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 8 / 16

Page 19: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Results: Subjects are preferentially pronominalized

Free prompt

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 9 / 16

Page 20: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Results 1: Does predictability influence pronominalization?

Free prompt

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 10 / 16

Page 21: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Results 1: Does predictability influence pronominalization?

Free prompt

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 10 / 16

Page 22: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Results 2: Does Bayes’ Rule rule?

Following Rohde & Kehler (2014), we used the free prompt continuations to calculateBayes-derived estimates of p(referent|pronoun) via the prior p(referent) and likelihoodp(pronoun|referent), as well as estimates for the Expectancy Model (prior) and the MirrorModel (normalized likelihood). We then compared the model estimates with the pronouninterpretations measured in the pronoun prompt condition

Items: Bayes: R2 = .122, Expectancy: R2 = .003, Mirror: R2 = .377Participants: Bayes: R2 = .084, Expectancy: R2 = .021, Mirror: R2 = .075

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 11 / 16

Page 23: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Results 2: Does Bayes’ Rule rule?

Following Rohde & Kehler (2014), we used the free prompt continuations to calculateBayes-derived estimates of p(referent|pronoun) via the prior p(referent) and likelihoodp(pronoun|referent), as well as estimates for the Expectancy Model (prior) and the MirrorModel (normalized likelihood). We then compared the model estimates with the pronouninterpretations measured in the pronoun prompt condition

Items: Bayes: R2 = .122, Expectancy: R2 = .003, Mirror: R2 = .377Participants: Bayes: R2 = .084, Expectancy: R2 = .021, Mirror: R2 = .075

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 11 / 16

Page 24: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Interim discussion

We do not find any evidence that pronominalization is affected bypredictability

→ In line with strong Bayes

The Bayesian model outperforms the Expectancy model

The Bayesian model is outperformed by the Mirror model

→ Is this due to the construction or does it have something to dowith the number of referents?

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 12 / 16

Page 25: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Interim discussion

We do not find any evidence that pronominalization is affected bypredictability

→ In line with strong Bayes

The Bayesian model outperforms the Expectancy model

The Bayesian model is outperformed by the Mirror model

→ Is this due to the construction or does it have something to dowith the number of referents?

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 12 / 16

Page 26: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Interim discussion

We do not find any evidence that pronominalization is affected bypredictability

→ In line with strong Bayes

The Bayesian model outperforms the Expectancy model

The Bayesian model is outperformed by the Mirror model

→ Is this due to the construction or does it have something to dowith the number of referents?

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 12 / 16

Page 27: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Follow-up: 2-human Benefactive prompts

Items

Adam called the hospital for Russel. He [pronoun prompt]Adam called the hospital for Russel. [free prompt]

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 13 / 16

Page 28: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Follow-up: 2-human Benefactive prompts

Items

Adam called the hospital for Russel. He [pronoun prompt]Adam called the hospital for Russel. [free prompt]

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 13 / 16

Page 29: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Follow-up: 2-human Benefactive prompts

Items: Bayes: R2 = .719, Expectancy: R2 = .311, Mirror: R2 = .714Participants: Bayes: R2 = .348, Expectancy: R2 = .008, Mirror: R2 = .282

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 14 / 16

Page 30: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Follow-up: 2-human Benefactive prompts

Items: Bayes: R2 = .719, Expectancy: R2 = .311, Mirror: R2 = .714Participants: Bayes: R2 = .348, Expectancy: R2 = .008, Mirror: R2 = .282

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 14 / 16

Page 31: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Discussion

The models’ poor fit for the observed pronoun interpretation data inour first experiment appears to be due to the number of referents

In the experiment with 2-human Benefactive prompts, Bayes is back

But why?Power issue?

But no fewer observations per ambiguous pair than earlier work with 2referents

3 referents make the task harder?

But is it really? In which way? And why would this matter?

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 15 / 16

Page 32: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Discussion

The models’ poor fit for the observed pronoun interpretation data inour first experiment appears to be due to the number of referents

In the experiment with 2-human Benefactive prompts, Bayes is back

But why?Power issue?

But no fewer observations per ambiguous pair than earlier work with 2referents

3 referents make the task harder?

But is it really? In which way? And why would this matter?

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 15 / 16

Page 33: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Discussion

The models’ poor fit for the observed pronoun interpretation data inour first experiment appears to be due to the number of referents

In the experiment with 2-human Benefactive prompts, Bayes is back

But why?

Power issue?

But no fewer observations per ambiguous pair than earlier work with 2referents

3 referents make the task harder?

But is it really? In which way? And why would this matter?

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 15 / 16

Page 34: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Discussion

The models’ poor fit for the observed pronoun interpretation data inour first experiment appears to be due to the number of referents

In the experiment with 2-human Benefactive prompts, Bayes is back

But why?Power issue?

But no fewer observations per ambiguous pair than earlier work with 2referents

3 referents make the task harder?

But is it really? In which way? And why would this matter?

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 15 / 16

Page 35: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Discussion

The models’ poor fit for the observed pronoun interpretation data inour first experiment appears to be due to the number of referents

In the experiment with 2-human Benefactive prompts, Bayes is back

But why?Power issue?

But no fewer observations per ambiguous pair than earlier work with 2referents

3 referents make the task harder?

But is it really? In which way? And why would this matter?

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 15 / 16

Page 36: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Discussion

The models’ poor fit for the observed pronoun interpretation data inour first experiment appears to be due to the number of referents

In the experiment with 2-human Benefactive prompts, Bayes is back

But why?Power issue?

But no fewer observations per ambiguous pair than earlier work with 2referents

3 referents make the task harder?

But is it really? In which way? And why would this matter?

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 15 / 16

Page 37: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Discussion

The models’ poor fit for the observed pronoun interpretation data inour first experiment appears to be due to the number of referents

In the experiment with 2-human Benefactive prompts, Bayes is back

But why?Power issue?

But no fewer observations per ambiguous pair than earlier work with 2referents

3 referents make the task harder?

But is it really? In which way? And why would this matter?

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 15 / 16

Page 38: Modeling Coreference in Contexts with Three Referents€¦ · Modeling Coreference in Contexts with Three Referents Jet Hoek, Andrew Kehler & Hannah Rohde RAILS, 25 October 2019 Hoek,

Thank [email protected]

Hoek, Kehler & Rohde Modeling Coreference 25 October 2019 16 / 16