surveying the nasal peak: a1 and p0 in nasal and nasalized vowels · 2020. 12. 8. · shosted, c....

1
Corpus of nasality in coarticulatory and contrastive contexts 160 English words in oral (CVC) vs. nasal (CVN/NVC/NVN) sets, 10 vowels 12 college-aged speakers from the US = 3823 tokens 120 French words with phonemically oral vs. nasal vowels, /ɛ, ɑ, ɔ/ & /ɛ ̃ , ɑ ̃ , ɔ ̃ / 8 speakers from Northern France = 995 tokens All tokens were machine-aligned with vowel boundaries hand-confirmed Features extracted automatically by Praat script at 1/3 and 2/3 of vowel dur. 35 features associated with nasality examined (7 reported here) Surveying the nasal peak: A1 and P0 in nasal and nasalized vowels Will Styler and Rebecca Scarborough University of Colorado Phonetics Lab Research Goals Results and Findings Nasality is commonly measured using "A1-P0", a composite spectral measure which compares the first formant with a low frequency nasal peak We hope to better understand this measure, as well as its acoustic components We're laying the groundwork for deeper perceptual study of nasality We want to know how to artificially manipulate nasalization We're searching for cross-linguistic differences in the realization of nasality Our Approach: Measure various features in actual language data and examine their relationship to coarticulatory and contrastive nasality Acoustical Features P0 Amplitude (dB) - Amplitude of the higher of H1 vs. H2 A1 Amplitude (dB) - Amplitude of the the first formant's highest harmonic A1-P0 (dB) - The difference of amplitude between A1 and P0 A1-P0 Compensated (dB) - A1-P0 adjusted using Chen's Compensation formula, which minimizes the effect of formant variations (from Chen 1997) P0 Prominence (dB) - The difference of P0's Amplitude from the average amplitude of the two surrounding harmonics References Chen, Marilyn Y. (1997) Acoustic correlates of English and French nasalized vowels, J. Acoust. Soc. Am. 102, 2360 || C. Carignan, R. Shosted, C. Shih, and P. Rong. (2011) Compensatory articulation in American English nasalized vowels. Journal of Phonetics, 39(4):668 – 682 || C. Carignan (2014) An acoustic and articulatory examination of the oral in nasal: The oral articulations of French nasal vowels are not arbitrary. Journal of Phonetics, 46(0):22-33 #1aSC8 Data Collection Implications for Measurement and ASR Similar measures are predictive of nasality across both English (with coarticulatory nasality) and French (with contrastive nasality) There are differences with regard to the details of realization (especially in direction of change in P0 relative to oral vowels) This may be under speaker control (e.g. Carignan et al. 2011, Carignan 2014) The best acoustical correlates of nasality - and the perceptual cues they represent - are relative Nasal Peak vs. A1, Nasal Peak vs. Surrounding Spectrum, Nasal vs. Oral With better understanding of acoustical details, we can manipulate and test the perceptual consequences of these features and of types and patterns of nasality A1-P0 remains the most reliable measure for nasality Measurements are improved by using Chen's formant compensation formula A1-P0 performs well across vowels, speakers and languages The degree of expected change will vary by vowel, speaker, and language P0 prominence is an additional reliable feature for nasality Use of formant frequency for classification uses language-specific patterns Statistical Analysis Linear Mixed-Effects Regressions (Feature ~ Nasality) using repetition, vowel and timepoint as factors, speaker and word as random factors Separate analyses for each feature in English and French data Analysis performed using the R Statistics Software Suite Frequency (Hz) Amplitude (dB) English Nasals P0 1.42 dB French Nasals P0 0.935 dB English Nasals A1 2.5 dB French Nasals A1 6.6 dB English Nasals Prominence 1.86 dB French Nasals Prominence 2.663 dB P0 F1 English Nasals A1 32 Hz French Nasals A1 23 Hz (N.S) Table 1: Means, Coefficients and | t | stats showing the change in each feature in "oral" vs. "nasal" vowels 152.795 56.121 +0.787 -1.714 -1.794 -4.173 -2.425 English-French Difference -3.559 -177.572 -1.31 (N.S) -23.612 7.890 2.663 -14.069 -5.703 -13.932 -5.733 -13.116 -6.676 -4.095 -0.935 French | t | Value French Coefficient 1410.198 1587.827 613.365 637.056 5.645 2.995 2.932 8.678 0.719 6.497 38.690 45.604 37.970 39.107 French Nasal Mean French Oral Mean F2 Frequency F1 Frequency P0 Prominence A1-P0 Comp. A1-P0 A1 Amplitude P0 Amplitude Feature 1762.837 1770.088 629.036 601.869 12.083 10.290 1.133 5.099 -0.982 2.932 43.962 46.445 44.944 43.513 English Nasal Mean English Oral Mean -0.92 (N.S) -24.777 3.266 32.509 7.724 1.876 -15.099 -3.989 -14.873 -3.939 -11.155 -2.503 11.018 1.429 English | t | Value English Coefficient ~250 Hz ~600 Hz Frequency (Hz) 0 3000 Sound pressure level (dB / Hz) 0 20 40 0 600 1200 1800 2400 3000 Frequency (Hz) 0 3000 Sound pressure level (dB / Hz) 0 20 40 0 600 1200 1800 2400 3000 Fig 2: FFT at midpoint of "bombes" showing a French nasal vowel Fig 1: FFT at midpoint of "bob" showing a French oral vowel A1-P0 is strongly affected by the coupling with the nasal cavity Lowering of A1 is the primary driver of this effect P0 does not reliably gain amplitude in nasal vowels ... but it very often gains prominence relative to the surrounding harmonics This "Prominence" of the nasal peak is another useful measure of nasality Chen's formant compensation formula enhances the nasal/oral difference Formant frequency changes from nasality are language-specific These are unlikely to be direct consequences of the nasal coupling Linguistic Significance For more information: http://savethevowels.org/asa2014 Ongoing work Evaluate relationship between nasality and the additional 28 features measured Investigate which of these features are used in human perception of nasality Use this understanding to reliably model (and modify) nasality in speech Continue exploring cross-linguistic differences in the realization of nasality

Upload: others

Post on 10-Feb-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

  • • Corpus of nasality in coarticulatory and contrastive contexts

    • 160 English words in oral (CVC) vs. nasal (CVN/NVC/NVN) sets, 10 vowels• 12 college-aged speakers from the US = 3823 tokens

    • 120 French words with phonemically oral vs. nasal vowels, /ɛ, ɑ, ɔ/ & /ɛ,̃ ɑ̃, ɔ̃/• 8 speakers from Northern France = 995 tokens

    • All tokens were machine-aligned with vowel boundaries hand-confirmed

    • Features extracted automatically by Praat script at 1/3 and 2/3 of vowel dur.

    • 35 features associated with nasality examined (7 reported here)

    Surveying the nasal peak: A1 and P0 in nasal and nasalized vowelsWill Styler and Rebecca Scarborough

    University of Colorado Phonetics LabResearch Goals Results and Findings• Nasality is commonly measured using "A1-P0", a composite spectral measure

    which compares the first formant with a low frequency nasal peak

    • We hope to better understand this measure, as well as its acoustic components

    • We're laying the groundwork for deeper perceptual study of nasality

    • We want to know how to artificially manipulate nasalization

    • We're searching for cross-linguistic differences in the realization of nasality

    • Our Approach: Measure various features in actual language data and examine their relationship to coarticulatory and contrastive nasality

    Acoustical Features• P0 Amplitude (dB) - Amplitude of the higher of H1 vs. H2

    • A1 Amplitude (dB) - Amplitude of the the first formant's highest harmonic

    • A1-P0 (dB) - The difference of amplitude between A1 and P0

    • A1-P0 Compensated (dB) - A1-P0 adjusted using Chen's Compensation

    formula, which minimizes the effect of formant variations (from Chen 1997)

    • P0 Prominence (dB) - The difference of P0's Amplitude from the average

    amplitude of the two surrounding harmonics

    ReferencesChen, Marilyn Y. (1997) Acoustic correlates of English and French nasalized vowels, J. Acoust. Soc. Am. 102, 2360 || C. Carignan, R.

    Shosted, C. Shih, and P. Rong. (2011) Compensatory articulation in American English nasalized vowels. Journal of Phonetics, 39(4):668

    – 682 || C. Carignan (2014) An acoustic and articulatory examination of the oral in nasal: The oral articulations of French nasal

    vowels are not arbitrary. Journal of Phonetics, 46(0):22-33

    #1aSC8

    Data Collection

    Implications for Measurement and ASR

    • Similar measures are predictive of nasality across both English (with coarticulatory nasality) and French (with contrastive nasality)

    • There are differences with regard to the details of realization (especially in direction of change in P0 relative to oral vowels)• This may be under speaker control (e.g. Carignan et al. 2011, Carignan 2014)

    • The best acoustical correlates of nasality - and the perceptual cues they represent - are relative• Nasal Peak vs. A1, Nasal Peak vs. Surrounding Spectrum, Nasal vs. Oral

    • With better understanding of acoustical details, we can manipulate and test the perceptual consequences of these features and of types and patterns of nasality

    • A1-P0 remains the most reliable measure for nasality• Measurements are improved by using Chen's formant compensation formula• A1-P0 performs well across vowels, speakers and languages• The degree of expected change will vary by vowel, speaker, and language

    • P0 prominence is an additional reliable feature for nasality• Use of formant frequency for classification uses language-specific patterns

    Statistical Analysis• Linear Mixed-Effects Regressions (Feature ~ Nasality) using repetition,

    vowel and timepoint as factors, speaker and word as random factors

    • Separate analyses for each feature in English and French data

    • Analysis performed using the R Statistics Software Suite

    Frequency (Hz)

    Amplitude (dB)

    English NasalsP0 ↑ 1.42 dB

    French NasalsP0 ↓ 0.935 dB

    English NasalsA1 ↓ 2.5 dB

    French NasalsA1 ↓ 6.6 dB

    English NasalsProminence ↑ 1.86 dB

    French NasalsProminence ↑ 2.663 dB

    P0

    F1

    English NasalsA1 → 32 Hz

    French NasalsA1 ← 23 Hz (N.S)

    Table 1: Means, Coefficients and | t | stats showing the change in each feature in "oral" vs. "nasal" vowels

    152.795

    56.121

    +0.787

    -1.714

    -1.794

    -4.173

    -2.425

    English-FrenchDifference

    -3.559-177.572

    -1.31 (N.S)-23.612

    7.8902.663

    -14.069-5.703

    -13.932-5.733

    -13.116-6.676

    -4.095-0.935

    French | t | Value

    French Coefficient

    1410.1981587.827

    613.365637.056

    5.6452.995

    2.9328.678

    0.7196.497

    38.69045.604

    37.97039.107

    FrenchNasal Mean

    FrenchOral Mean

    F2 Frequency

    F1 Frequency

    P0 Prominence

    A1-P0 Comp.

    A1-P0

    A1 Amplitude

    P0 Amplitude

    Feature

    1762.8371770.088

    629.036601.869

    12.08310.290

    1.1335.099

    -0.9822.932

    43.96246.445

    44.94443.513

    EnglishNasal Mean

    EnglishOral Mean

    -0.92 (N.S)-24.777

    3.26632.509

    7.7241.876

    -15.099-3.989

    -14.873-3.939

    -11.155-2.503

    11.0181.429

    English| t | Value

    EnglishCoefficient

    ~250 Hz ~600 Hz

    Frequency (Hz)0 3000

    Soun

    d pr

    essu

    re le

    vel (

    dB/H

    z)

    0

    20

    40

    0 600 1200 1800 2400 3000

    Frequency (Hz)0 3000

    Soun

    d pr

    essu

    re le

    vel (

    dB/H

    z)

    0

    20

    40

    0 600 1200 1800 2400 3000

    Fig 2: FFT at midpoint of "bombes" showing a French nasal vowel

    Fig 1: FFT at midpoint of "bob" showing a French oral vowel• A1-P0 is strongly affected by the coupling with the nasal cavity

    • Lowering of A1 is the primary driver of this effect• P0 does not reliably gain amplitude in nasal vowels• ... but it very often gains prominence relative to the surrounding harmonics

    • This "Prominence" of the nasal peak is another useful measure of nasality• Chen's formant compensation formula enhances the nasal/oral difference• Formant frequency changes from nasality are language-specific

    • These are unlikely to be direct consequences of the nasal coupling

    Linguistic Significance

    For more information:

    http://savethevowels.org/asa2014

    Ongoing work• Evaluate relationship between nasality and the additional 28 features measured• Investigate which of these features are used in human perception of nasality• Use this understanding to reliably model (and modify) nasality in speech

    • Continue exploring cross-linguistic differences in the realization of nasality