1.0 introduction traditional view of phonetic laryngeal contrasts (/t/~/d/, voicing): f0 drop, f1...

23
1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et al. 2008, Lisker 1986, Raphael 1972, Port & Dalby 1982, Stevens 1998) 1. Often have samples that don’t conform to norms (Fig. 1) 2. No single measure found to universally correlate with voicing Alternative View: termination of the preceding vowel, nature of RMS drop (Parker 1974) Goal 1. Better characterize VOICING with measures available 2. Evaluate measures (RMS) associated with vowel transition

Upload: cortez-chunn

Post on 22-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

1.0 Introduction

• Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et al. 2008, Lisker 1986, Raphael 1972, Port & Dalby 1982, Stevens 1998)

1. Often have samples that don’t conform to norms (Fig. 1)2. No single measure found to universally correlate with voicing

• Alternative View: termination of the preceding vowel, nature of RMS drop (Parker 1974)

Goal1. Better characterize VOICING with measures available2. Evaluate measures (RMS) associated with vowel transition

Page 2: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

Figure 1. Formant plots (a) normal /bat/ (b) normal /bad/ (c) abnormal /bat/ (d) abnormal /bad/. Blue plane is RMS knee, green plane is lingual closure.

Page 3: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

2.0 Experiment 1

Conventional wisdom: Frequency valuesFormant transitions will show trend for /t/ vs. /d/f0 transition will show well-defined trend for /t/ vs. /d/

Test energy vs frequency information

Page 4: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

2.1 Methodology N=27 female speakers from Upper Midwest (MN, WI) Context: bat and bad, isolated and sentence (“Use this new

bat”/”Make this one bad”) X-ray microbeam tongue-tip data Acoustic Measures

1. Frequency Characteristics Change in F1 frequency prior to formant end Change in f0 frequency prior to formant end

2. Temporal Energy Characteristics Formant end: 15 dB drop in F2 from local maximum

(Fig. 2) RMS knee: RMS derivative exceeds negative threshold

of -0.33 dB/ms (Fig. 3)

Page 5: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et
Page 6: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

Figure 3. RMS and rate of change of RMS (a) bat (b) bad. (a) Syllable final /t/ shows RMS knee prior to occlusion (vertical green) while (b) final /d/ shows RMS knee close to occlusion.

Increase in rateof RMS dropbrought about byocclusion (greenline).

RMS kneeRMS drop broughtabout by somethingother than occlusion

a)

RMS kneeRMS drop broughtabout by occlusion(green line).

b)

Page 7: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

2.2 Results

ANOVA, means for acoustics (Table 1, p<0.05, df = [1, 104])

Energy factors: significant interactions (Fig. 4) for F2 end (F=30.3) and RMS knee (30.3)

Frequency factors: only significant main effect for f0 change for VOICING (F=5.3)

Page 8: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

Gesture F2 db RMS F1 f0

end end knee Δ Δ

(ms) (ms) (ms) (Hz) (Hz)

/t/

Isolation 266 231 201 197 47

(31) (27) (29) (174) (95)

Sentence 269 219 188 151 19

(30) (28) (19) (180) (71)

/d/

Isolation 328 313 317 226 -6

(36) (37) (39) (89) (10)

Sentence 262 236 231 192 14

(31) (31) (35) (70) (61)

Table 1. Means and standard deviations.

Page 9: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

Figure 4. Comparison of Gesture end with F2 end.

Page 10: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

Covariation with lingual closure (r2, Table 2, p<0.05) Energy factors: significant for both factors in all

conditions except /t/ in sentences (Fig. 4) Frequency factors: no significant covariation

r2 F2 dB end RMS knee F1 Δ f0 Δ/t/Isolation0.76* 0.72* n.s. n.s.Sentence 0.40* n.s. n.s. n.s./d/Isolation0.94* 0.92* n.s. n.s.Sentence 0.93* 0.59* n.s. n.s.

Table 2. Covariation with lingual closure. (*p<0.05)

Page 11: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

2.3 Discussion

Final /t/ characteristics RMS knee prior to occlusion (Fig. 3a) phonation terminates prior to occlusion

Final /d/ characteristics RMS knee at time of occlusion if present (Fig. 3b) phonation continues past occlusion

Formant transitions not significant for voicing f0 transition not significant for voicing

Page 12: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

Interpretation Energy for final /d/ determined by supra-laryngeal gestures (lingual closure and enhancing gestures) Energy for final /t/ determined by laryngeal gesture (Fig. 5) Traditional measures of frequency for f0 and F1 weak correlation with voicing at best

Page 13: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

Figure 5. Timing diagram. Shows supra-laryngeal gesture for syllable final /d/, laryngeal termination gesture for final /t/.

approach hold release

approach hold release

phonation

phonation

/d/

/t/

termination gesture

enhancing gesture

Page 14: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

3.0 Experiment 2

Test role of phonation and relation between energy and frequency

Exploit difference in lower harmonic behavior between final /t/ and /d/ (Fig. 6)Examine energy behavior in lower harmonics relative to energy in formants

Page 15: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

Figure 6. Harmonic plots (a) bat (b) bad. Final /t/ shows simultaneous end of lower 3 harmonics, final /d/ shows continuation of h1 relative to higher harmonics.

h3

h1h2h2

h1

a) b)

h3

Page 16: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

3.1 Methodology

Same data set as Experiment 1 Acoustic Measures

1. Frequency Characteristics Change in F1, F2, F3 frequency prior to formant end Change in f0 frequency prior to formant end

2. Temporal Energy Characteristics H1 end: time to 15 dB drop in first harmonic H2 end: time to 15 db drop in second harmonic Formant end: 15 dB drop in F2 from local maximum H1 energy: average energy over 30 msec prior to formant

end for 150-350 Hz (Fig. 7)

Page 17: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

Figure 7. Formant energy and first harmonic energy (a) bat (b) bad. Shading shows energy in first harmonic at formant end (vertical aqua line).

F1

F2

F3

F1F2

F3h1

h1

a) b)

Page 18: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

3.2 Results

ANOVA, means for acoustics (Table 3, p<0.05, df = [1, 104])

Energy factors: significant main effect for VOICING: h1-h2 (F=37.6), energy h1-h2 (F=16.5) and mean energy h1(F=361.1); for Condition: h1-h2 (F=40.1)

Frequency factors: significant main effect for VOICING: F2 change (F=4.1) and f0 change (F=5.3)

Page 19: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

Table 3. Means and standard deviations for select measures.

F2 f0 h1-h2 mean

Δ Δ (ms) energy h1

(Hz) (Hz) (dB)

/t/

Isolation 98 47 181 -67

(427) (95) (102) (3)

Sentence 60 19 -73 -67

(383) (71) (139) (3)

/d/

Isolation -16 -6 337 -52

(132) (10) (183) (3)

Sentence -59 14 174 -54

(91) (61) (231) (5)

Page 20: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

Figure 8. Comparison of F2 end with mean H1 energy. Box identifies region of /t/ distinct from /d/, regardless of context.

Page 21: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

3.3 Discussion

• /t/ vs /d/ difference• Vowel end determined by lingual closure for /d/• Vowel end determined by laryngeal termination gesture for /t/• Relation between H1 energy prior to vowel end and VOICING is significant (Fig. 8)

•Implications•Confirm importance of vowel duration•Energy levels as opposed to frequency values (Parker 1974)

•Energy measures greater correlation with VOICING than traditional measures(f0 delta, F1 delta)

• Energy measures during transition informative for consonant voicing in addition to vowels (Jenkins et al. 1983)

Page 22: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

4.0 Conclusion

•Work to date on speech perception in post-vocalic stops has focused on frequency transitions, percent closure voicing, vowel duration.• Energy characteristics can provide more robust acoustic measures for voicing than traditional measures.• /t/ is characterized by a phonation termination gesture (glottal stop) which has implications for phonological theories.•Going forward: investigate importance of energy in perception.

Page 23: 1.0 Introduction Traditional View of phonetic laryngeal contrasts (/t/~/d/, VOICING): F0 drop, F1 drop, pulsing in the gap, CV Ratio, etc. (Kingston et

ReferencesJenkins, J. et al., 1983, Perception & Psychophysics, 34, 441-450.Kingston, J. et al., 2008, J Phonetics, 36, 28-54.Lisker, L., 1986. Language & Speech, 29, 3-11.Parker, F., 1974, J Phonetics 2, 211-221.Peterson, G. & Lehiste, I., 1960, JASA, 32, 693-703.Port, R. & Dalby, J., 1982, Perception & Psychophysics, 32, 141-152.Raphael, L., 1972, JASA, 51, 1296-1303.Stevens, K. 1998. Acoustic Phonetics, Cambridge, MA: MIT Press.AcknowledgementsWe would like to thank Eric Raimy for discussion. All mistakes are our own.

Contact [email protected] [email protected] [email protected]