stress and gradient weight in portuguese

40
i ii iii Stress and gradient weight in Portuguese Guilherme D. Garcia guilhermegarcia.github.io McGill University WCCFL 33 | March 2015 Garcia 2015 Stress and gradient weight in Portuguese 1 / 40

Upload: others

Post on 16-Oct-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Stress and gradient weight in Portuguese

iiiiii

Stress and gradient weight in Portuguese

Guilherme D. Garciaguilhermegarcia.github.io

McGill University

WCCFL 33 | March 2015

Garcia 2015 Stress and gradient weight in Portuguese 1 / 40

Page 2: Stress and gradient weight in Portuguese

iiiiii

Portuguese stress: previous analyses

I Weight effects on stress: only word-finally 5

I I Falling diphthongs or coda segments → heavy

I Weight-sensitivity is categorical 5I Word-final syllables: either H or L

I Syllable-based generalizations (rules and constraints) 5See Araujo (2007) for a comprehensive review

Garcia 2015 Stress and gradient weight in Portuguese 2 / 40

Page 3: Stress and gradient weight in Portuguese

iiiiii

Portuguese stress: previous analyses

I Weight effects on stress: only word-finally 5

Falling diphthongs or coda segments → heavy

I Weight-sensitivity is categorical 5

Word-final syllables: either H or L

I Syllable-based generalizations (rules and constraints) 5See Araujo (2007) for a comprehensive review

Garcia 2015 Stress and gradient weight in Portuguese 3 / 40

Page 4: Stress and gradient weight in Portuguese

iiiiii

Today: A novel analysis

I Weight effects → across the whole stress domain 3

I Weight-sensitivity is gradual 3

I Probabilistic interval1-based account 3I Economy, accuracy, empirical motivation

1Steriade 2012

Garcia 2015 Stress and gradient weight in Portuguese 4 / 40

Page 5: Stress and gradient weight in Portuguese

iiiiii

iPortuguese stressIntervals and syllables

iiQuestionsMethodsAnalysis

iiiSummaryFinal remarks

Garcia 2015 Stress and gradient weight in Portuguese 5 / 40

Page 6: Stress and gradient weight in Portuguese

iiiiii

Portuguese stressIntervals and syllables

Portuguese main stress

I Non-verbs ⇒ phonological factors + exceptional casesI Trisyllabic window: σσσ]PWd

ma.rı.ti.mo, ca.va.lo, fa.rol ‘maritime, horse, lighthouse’

I final and penult stress: (mostly) regularI antepenult stress: irregular/unpredictableI Weight effects only word-finally (See Araujo (2007) for a review)

I Verbs ⇒ morphological factors (Wetzels 2007)

Garcia 2015 Stress and gradient weight in Portuguese 6 / 40

Page 7: Stress and gradient weight in Portuguese

iiiiii

Portuguese stressIntervals and syllables

Non-verbsRegular patterns: 72% of the Portuguese Stress Corpus (Garcia 2014; Houaiss et al. 2001, n=154,083)

I Final stress (≈15%): papel ‘paper’

I Penult stress (≈57%): cavalo ‘horse’

I Standard view: Heavy σs only attract stress word-finally

Garcia 2015 Stress and gradient weight in Portuguese 7 / 40

Page 8: Stress and gradient weight in Portuguese

iiiiii

Portuguese stressIntervals and syllables

Non-verbsIrregular cases: 28% of the lexicon

σσσ ≈13% perola ‘pearl’σσL ≈11% cafe ‘coffee’σσH ≈4% nıvel ‘level’

I Previous work: Marked stress, extrametricality, catalexis andmorphological factors (Roca 1999; Araujo 2007)

Garcia 2015 Stress and gradient weight in Portuguese 8 / 40

Page 9: Stress and gradient weight in Portuguese

iiiiii

Portuguese stressIntervals and syllables

Portuguese stress

I Previous approaches:

i weight domain = σrhyme#ii weight is categorical, word-final and binary: H or Liii onsets do not contribute to weight

I Possible issues:I Antepenult stress restricted to XLL]PWd words. Why?

i.e., *XHL, *XLH, *XHHI Recent studies on onset effects (Gordon 2005, Topintzi 2010, Ryan 2014)

Do onsets affect stress in Portuguese? If so, how?

Garcia 2015 Stress and gradient weight in Portuguese 9 / 40

Page 10: Stress and gradient weight in Portuguese

iiiiii

Portuguese stressIntervals and syllables

Intervals and syllablesOnsets and weight

I Prediction: no effect or a positive correlation CCV ≥ CV

I Unexpected: CV > CCV, i.e., negative correlation

I Preview: This is found in the Portuguese lexicon (Houaiss et al. 2001)

Garcia 2015 Stress and gradient weight in Portuguese 10 / 40

Page 11: Stress and gradient weight in Portuguese

iiiiii

Portuguese stressIntervals and syllables

Intervals and syllables(LLL words)

Penult onset

0%

25%

50%

75%

100%

0 1 2Penult onset size

Stress: Antepenult Penult Final

Garcia 2015 Stress and gradient weight in Portuguese 11 / 40

Page 12: Stress and gradient weight in Portuguese

iiiiii

Portuguese stressIntervals and syllables

Intervals and syllables(LLL words)

Penult onset

0%

25%

50%

75%

100%

0 1 2Penult onset size

Stress: Antepenult Penult Final

Final onset

0%

25%

50%

75%

100%

0 1 2Final onset size

Stress: Antepenult Penult Final

Garcia 2015 Stress and gradient weight in Portuguese 12 / 40

Page 13: Stress and gradient weight in Portuguese

iiiiii

Portuguese stressIntervals and syllables

Intervals and syllablesDefining intervals (Steriade 2012)

Facts are consistent with interval theory

I Interval: rhythmic unit that spans from one V up to the nextV: no constituency assumed

CVCσ2 CCVCσ1 ⇒ 〈C〉 VCCCι2 VCι1

I More segments in one ι ⇒ longer duration ⇒ more weight

I Intervals argued to be the weight domain in unrelatedlanguages: Finnish, Norwegian, Greek, Latin, Bhojpuri andEstonian

How about Portuguese?

Garcia 2015 Stress and gradient weight in Portuguese 13 / 40

Page 14: Stress and gradient weight in Portuguese

iiiiii

QuestionsMethodsAnalysis

Questions

i How does weight-sensitivity affect stress in Portuguese?

categorical or gradual?word-final or broader in the stress domain?

ii How do syllables and intervals differ in answering question i?

Garcia 2015 Stress and gradient weight in Portuguese 14 / 40

Page 15: Stress and gradient weight in Portuguese

iiiiii

QuestionsMethodsAnalysis

MethodsThe Portuguese lexicon (Houaiss et al. 2001)

I Houaiss Dictionary (n = 442,000)

I Non-verbs: n = 154,083

I Representativeness: spoken vs. written Portuguesetwo frequency corpora: similar proportions of stress patterns(penult > final > antepenult)

I What factors best predict stress position?

Garcia 2015 Stress and gradient weight in Portuguese 15 / 40

Page 16: Stress and gradient weight in Portuguese

iiiiii

QuestionsMethodsAnalysis

MethodsFactors: stress domain (σσσ | ιιι )

Syllable

σ

Onset

0-2

Rhyme

Nucleus

1-2

Coda

0-2

Interval

VCCC 4 VCC 3 ]PWd

Garcia 2015 Stress and gradient weight in Portuguese 16 / 40

Page 17: Stress and gradient weight in Portuguese

iiiiii

QuestionsMethodsAnalysis

MethodsData: final syllable by stress position

Onset

0%

25%

50%

75%

100%

0 1 2Onset size

Stress: Final Penult Antepenult

Nucleus

0%

25%

50%

75%

100%

1 2Nucleus size

Stress: Final Penult Antepenult

Coda

0%

25%

50%

75%

100%

0 1Coda size

Stress: Final Penult Antepenult

Garcia 2015 Stress and gradient weight in Portuguese 17 / 40

Page 18: Stress and gradient weight in Portuguese

iiiiii

QuestionsMethodsAnalysis

MethodsData: penult syllable by stress position

Onset

0%

25%

50%

75%

100%

0 1 2Onset size

Stress: Penult Final Antepenult

Nucleus

0%

25%

50%

75%

100%

1 2Nucleus size

Stress: Penult Final Antepenult

Coda

0%

25%

50%

75%

100%

0 1Coda size

Stress: Penult Final Antepenult

Garcia 2015 Stress and gradient weight in Portuguese 18 / 40

Page 19: Stress and gradient weight in Portuguese

iiiiii

QuestionsMethodsAnalysis

MethodsData: antepenult syllable by stress position

Onset

0%

25%

50%

75%

100%

0 1 2Onset size

Stress: Antepenult Penult Final

Nucleus

0%

25%

50%

75%

100%

1 2Nucleus size

Stress: Antepenult Penult Final

Coda

0%

25%

50%

75%

100%

0 1Coda size

Stress: Antepenult Penult Final

Garcia 2015 Stress and gradient weight in Portuguese 19 / 40

Page 20: Stress and gradient weight in Portuguese

iiiiii

QuestionsMethodsAnalysis

MethodsData: stress pattern by interval

3 (ant)

21% 1%74%4%

0%

25%

50%

75%

100%

0 1 2 3 4 5Interval size

Stress: Antepenult Penult Final

2 (pen)

6%

59%

32% 2%1%

0%

25%

50%

75%

100%

1 2 3 4 5Interval size

Stress: Penult Final Antepenult

1 (fin)

21%

54%

24% 1%

0%

25%

50%

75%

100%

1 2 3 4Interval size

Stress: Final Penult Antepenult

Garcia 2015 Stress and gradient weight in Portuguese 20 / 40

Page 21: Stress and gradient weight in Portuguese

iiiiii

QuestionsMethodsAnalysis

MethodsVariables predicting stress: Ordinal Regression* (clm() in R)

Syllable model9 predictors (3/σ)

onset (y/n)

nucleus (monoph-/diphthong)

coda (y/n)

(binary)

Interval model3 predictors (1/ι)

int.0

int.1

int.2

Response:3 > 2 > 1 ]Pwd

Garcia 2015 Stress and gradient weight in Portuguese 21 / 40

Page 22: Stress and gradient weight in Portuguese

iiiiii

QuestionsMethodsAnalysis

Methods: Model evaluation

I Accuracy: What % of the lexicon is accurately predicted?

I Consistency: Which model is more theoretically consistent?

I Fit: Which model has a better fit (aic, fit vs. df)?

Garcia 2015 Stress and gradient weight in Portuguese 22 / 40

Page 23: Stress and gradient weight in Portuguese

iiiiii

QuestionsMethodsAnalysis

Analysis: Syllable model

In what positions does weight affect stress?

What has been argued: CVC.CVC.C VC #

Actual patterns: CV C. CVC . CVC #

Garcia 2015 Stress and gradient weight in Portuguese 23 / 40

Page 24: Stress and gradient weight in Portuguese

iiiiii

QuestionsMethodsAnalysis

Analysis: Syllable model

I Onset behaviour: Inconsistent with syllable theory

CVC—C2VC—C1VC#

1. Final onsets → penult stress β = 1.52 p < 0.00001

2. Penult onsets → antepenult stress β = 0.50 p < 0.00001

Garcia 2015 Stress and gradient weight in Portuguese 24 / 40

Page 25: Stress and gradient weight in Portuguese

iiiiii

QuestionsMethodsAnalysis

Analysis: Interval model

I Longer intervals → higher stress likelihood

I Weight gradience: between and within interval

CVCC—VCC—VC#

int1 → final stress β = −2.01 p < 0.00001

int2 → penult stress β = −0.25 p < 0.00001

int3 → antepenult stress β = 0.32 p < 0.00001

Garcia 2015 Stress and gradient weight in Portuguese 25 / 40

Page 26: Stress and gradient weight in Portuguese

iiiiii

QuestionsMethodsAnalysis

Question i: Weight gradience (regression weights by model)

Syllables

0.0

2.5

5.0

7.5

10.0

Antepenult Penult Final

Pred

icted

effe

ct

Constituent: Coda Nucleus Onset

Intervals

-2

-1

0

1

2

Antepenult Penult Finalβ

Garcia 2015 Stress and gradient weight in Portuguese 26 / 40

Page 27: Stress and gradient weight in Portuguese

iiiiii

QuestionsMethodsAnalysis

Question ii: Syllables vs. intervals

Domain Predictors aic κ Accuracy

Syllable 9 186433.21 40.79 74.75%

Interval 3 180035.58 15.38 78.29%

Intervals:

I More accurate

I More empirically motivated

I More parsimonious

Garcia 2015 Stress and gradient weight in Portuguese 27 / 40

Page 28: Stress and gradient weight in Portuguese

iiiiii

SummaryFinal remarks

Summary of proposal

I Weight effects on stress are gradual, not categorical

I Intervals offer a better analysis of stress in Portuguese

I Onset effects are accurately captured

I Crucially: only one predictor, namely, weight

I Probabilistic: compatible with constraint-based grammars

Garcia 2015 Stress and gradient weight in Portuguese 28 / 40

Page 29: Stress and gradient weight in Portuguese

iiiiii

SummaryFinal remarks

Final remarks

I Weight effects in Portuguese are gradient

I Onset effects suggest that weight is computed in intervalsI Advantages of a probabilistic interval-based approach:

I Economy, empirical motivation and accuracyI Nearly 80% accuracy vs. 72% regular casesI Only one positionally-defined predictor/constraint

I Probabilistic constraint-based implementation (E.g., MaxEnt)

Garcia 2015 Stress and gradient weight in Portuguese 29 / 40

Page 30: Stress and gradient weight in Portuguese

iiiiii

SummaryFinal remarks

Main references

Araujo, G. A. (Ed.) (2007). O acento em Portugues: abordagens fonologicas. SaoPaulo: Parabola.

Gordon, M. (2005). A perceptually-driven account of onset-sensitive stress. NaturalLanguage & Linguistic Theory, 23(3), 595–653.

Hayes, B., & Wilson, C. (2008). A maximum entropy model of phonotactics andphonotactic learning. Linguistic Inquiry, 39(3), 379–440.

Houaiss, A., Villar, M., & de Mello Franco, F. M. (2001). Dicionario eletronicoHouaiss da lıngua portuguesa. Rio de Janeiro: Objetiva.

Prince, A. (1990). Quantitative consequences of rhythmic organization. Cls, 26(2),355–398.

Garcia 2015 Stress and gradient weight in Portuguese 30 / 40

Page 31: Stress and gradient weight in Portuguese

iiiiii

SummaryFinal remarks

Main references

Roca, I. M. (1999). Stress in the Romance languages. In H. van der Hulst (Ed.) WordProsodic Systems in the Languages of Europe, (pp. 672–811). Berlin: Mouton deGruyter.

Ryan, K. M. (2014). Onsets contribute to syllable weight: statistical evidence fromstress and meter. Language, 90(2), 309–341.

Steriade, D. (2012). Intervals vs. syllables as units of linguistic rhythm. Handouts,EALING, Paris.

Topintzi, N. (2010). Onsets: suprasegmental and prosodic behaviour. New York:Cambridge University Press.

Wetzels, W. L. (2007). Primary word stress in Brazilian Portuguese and the weightparameter. Journal of Portuguese Linguistics, 5, 9–58.

Garcia 2015 Stress and gradient weight in Portuguese 31 / 40

Page 32: Stress and gradient weight in Portuguese

iiiiii

SummaryFinal remarks

Many thanks to Heather Goad, Morgan Sonderegger, Kie Zuraw,Michael Wagner and Colin Wilson.

Thank you | Obrigado

guilherme.ca

Garcia 2015 Stress and gradient weight in Portuguese 32 / 40

Page 33: Stress and gradient weight in Portuguese

iiiiii

SummaryFinal remarks

Appendix I: Syllable model

σ predictor β e|β| se(β) z value p value

onset.fin 1.52 4.59 0.02 53.97 < 0.00001nucleus.fin −2.86 17.55 0.02 −137.99 < 0.00001coda.fin −4.68 108.10 0.02 −194.62 < 0.00001

onset.pen 0.50 1.65 0.02 23.75 < 0.00001nucleus.pen −1.32 3.75 0.02 −46.18 < 0.00001coda.pen −1.09 3.00 0.01 −63.85 < 0.00001

onset.ant 0.24 1.27 0.02 11.64 < 0.00001nucleus.ant −1.06 2.90 0.02 −40.50 < 0.00001coda.ant −0.02 1.02 0.01 −1.646 0.0999

θ = {−1.64, 3.34} aic: 186433.21 Accuracy: 74.75% κ = 40.79

Garcia 2015 Stress and gradient weight in Portuguese 33 / 40

Page 34: Stress and gradient weight in Portuguese

iiiiii

SummaryFinal remarks

Appendix II: Interval model

ι predictor β e|β| se(β) z value p value

int1 −2.01 7.46 0.01 −196.60 < 0.00001int2 −0.25 1.28 0.01 −35.33 < 0.00001int3 0.32 1.38 0.01 43.82 < 0.00001

θ = {−2.21, 2.63} aic: 180035.58 Accuracy: 78.29% κ = 15.38

Garcia 2015 Stress and gradient weight in Portuguese 34 / 40

Page 35: Stress and gradient weight in Portuguese

iiiiii

SummaryFinal remarks

Appendix III: Edge effects

[〈C〉VC vs. [CVC → No onset extrametricality in the lexicon

I Onsets at the left edge have a positive effect on stress. Why?

I Hypothesis: Parsing is exhaustive

I Test: words with 2, 3 and 4 syllables

Garcia 2015 Stress and gradient weight in Portuguese 35 / 40

Page 36: Stress and gradient weight in Portuguese

iiiiii

SummaryFinal remarks

If onsets have to be parsed...

Overall, penult onsets have a negative effect on penult stress.

How about disyllabic words? Do we find a null effect?

2σ ([CV...) β = 0.18 p < 0.002

> 2σ β = -1.39 p < 0.00001

Onsets have a positive effect at the left edge of the domain

Garcia 2015 Stress and gradient weight in Portuguese 36 / 40

Page 37: Stress and gradient weight in Portuguese

iiiiii

SummaryFinal remarks

Appendix IV: Constraint-based approach

An adaptation of the Weight-to-Stress Principle (Prince 1990)

Let ιu be an unstressed interval:

wspιu Assign one ∗ to every segment in ιu

Example: MaxEnt Grammar (Hayes & Wilson 2008)

Garcia 2015 Stress and gradient weight in Portuguese 37 / 40

Page 38: Stress and gradient weight in Portuguese

iiiiii

SummaryFinal remarks

MaxEnt: Lexical modelling

I Lexicon = {different combinations of interval sequences}I So, amostragem ‘sampling’ ⊆ {4-2-2} (VCCC-VC-VC)

I Assume inputs = unique sequences

I Each input includes a non-empty set of words

I Sets of words (inputs) have different stress patterns

I Problem: sets → very skewed lexical distribution

Garcia 2015 Stress and gradient weight in Portuguese 38 / 40

Page 39: Stress and gradient weight in Portuguese

iiiiii

SummaryFinal remarks

MaxEntLearned weights for all words in Portuguese

Constraint wsp3 wsp2 wsp1

Weight 0.00 0.27 0.83

I Weight gradience across the stress domainI Antepenult interval: No weight effect learned

I Lexical skewness: 37 out of 86 inputs → 99% of the lexiconI Model treats each input equally

Garcia 2015 Stress and gradient weight in Portuguese 39 / 40

Page 40: Stress and gradient weight in Portuguese

iiiiii

SummaryFinal remarks

MaxEntExample

/VCCCVCVC/ wsp1 wsp2 wsp3 H(x) MaxEnt P(x) Freq(x)

[VCCC•VC•VC] 0 2 4 0.54 0.58 0.66 0.63

[VCCC•VC•VC] 2 0 4 1.66 0.19 0.22 0.36

[VCCC•VC•VC] 2 2 0 2.2 0.11 0.13 0.01

Garcia 2015 Stress and gradient weight in Portuguese 40 / 40