automatic classification of lexical stress errors for german capt - anjana vakil · 2017-08-25 ·...

19
Automatic classification of lexical stress errors for German CAPT Anjana Vakil and J¨ urgen Trouvain Department of Computational Linguistics and Phonetics Saarland University, Saarbr¨ ucken, Germany SLaTE 2015, Leipzig 4 September 2015

Upload: others

Post on 17-Jun-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Automatic classification of lexical stress errorsfor German CAPT

Anjana Vakil and Jurgen Trouvain

Department of Computational Linguistics and PhoneticsSaarland University, Saarbrucken, Germany

SLaTE 2015, Leipzig4 September 2015

Page 2: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Background & motivation

Lexical stress: Accentuation/prominence of syllable(s) in a word

In German:I Variable placement, contrastive function

um·FAHR·en vs. UM·fahr·ento drive around to run over

I Reflected by duration, F0, intensityI Impacts intelligibility of non-native (L2) speech

1 / 14

Page 3: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Background & motivation

I Contrastive lexical stress (LS) difficult for French speakersI CAPT can help; requires automatic diagnosisI Classification of LS errors in L2 German unexplored

Classification of LS errors by French learners of GermanHow feasible is it?Which features are most useful?

2 / 14

Page 4: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Data

Subset of IFCASL corpus of French-German speech(Fauth et al. 2014)

Extracted utterances of 12 bisyllabic, initial-stress wordsI 668 tokens from 56 French speakers - manually annotatedI 477 tokens from 40 German speakers - assumed correct

3 / 14

Page 5: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Data annotation

I Each token assigned a class label:[correct], [incorrect], [none][bad nsylls], [bad audio]

I 15 annotators (12 native), each token labeled by ≥2I Varying phonetics/phonology expertise

4 / 14

Page 6: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Data annotation

Overall pairwise inter-annotator agreement

Mean Maximum Median Minimum% Agreement 54.92% 83.93% 55.36% 23.21%Cohen’s κ 0.23 0.61 0.26 -0.01

I Variability not explained by annotator L1 or expertiseI Single gold-standard label selected for each token

5 / 14

Page 7: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Data annotation results

6 / 14

Page 8: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Method

Train & evaluate CART classifiers using WEKA toolkit

Training dataI Manually annotated L2 utterancesI Automatically annotated L1 utterances (all [correct])

Held-out testing dataI Feature comparison: 1/10 of L2 utterances (random)I Unseen speakers: all utterances from 1 of 56 L2 speakers

EvaluationI Compute agreement (% and κ) with gold standardI Cross-validation (10 or 56 folds)

7 / 14

Page 9: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Feature sets

Prosodic feature setsI DUR - Duration (relative syllable & nucleus lengths)I F0 - Fundamental frequency (mean, max., min., range)I INT - Intensity (mean, max.)

Pitch and energy contours calculated using JSnoori software(http://jsnoori.loria.fr)

8 / 14

Page 10: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Feature sets

Other featuresI WD - Word uttered (e.g. Flagge)I LV - Speaker’s CEFR skill level (A2|B1|B2|C1)I AG - Speaker’s age/gender (Girl|Boy|Woman|Man)

9 / 14

Page 11: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Results

10 / 14

Page 12: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Results

10 / 14

Page 13: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Results

11 / 14

Page 14: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Results

11 / 14

Page 15: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Results

11 / 14

Page 16: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Results

12 / 14

Page 17: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Results

12 / 14

Page 18: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Results

% agreement κ

Best classifier vs. gold standardRandom test set 71.87% 0.34Unseen speakers 70.22% 0.24

Majority ([correct]) classifier vs. gold 63.77% 0.00Human vs. human 54.92% 0.23

I Results are encouraging in this contextI Still want better performance for real-world use

13 / 14

Page 19: Automatic classification of lexical stress errors for German CAPT - Anjana Vakil · 2017-08-25 · Automatic classification of lexical stress errors for German CAPT Anjana Vakil

Conclusion

I Classification-based diagnosis of lexical stress errorsnovel approach in German CAPT

I Results of >70% accuracy encouraging(especially considering low human-human agreement)

I Still much room for improvement

Future directionsI More powerful machine learning algorithmsI Additional features (e.g. vowel quality, phrase information)I Online, semi-supervised learning/active learning

14 / 14