proceedings - gbv · 2007-03-16 · proceedings october 3-6, 1996 wyndham franklin plaza hotel...
TRANSCRIPT
Proceedings
October 3-6, 1996Wyndham Franklin Plaza Hotel
Philadelphia, PA, USA
ICSLP 96
Sponsored by
University of DelawareAlfred I. duPont Institute
in conjunction withAcoustical Society of America
Acoustical Society of JapanAmerican Speech-Language-Hearing Association
Australian Speech Science and Technology AssociationEuropean Speech Communication Association
IEEE Signal Processing SocietyIncorporated Canadian Acoustical Association
International Phonetic AssociationLinguistic Society of America
UB/TIB Hannover116 238 895
89
Fourth International Conference on Spoken Language Processing
1229 Formant Analysis Using Mixtures of GaussiansParham Zolfaghari, Tony Robinson
1233 Deriving Articulatory Representations from Speech withVarious Excitation ModesHywel B. Richards, John S. Mason, Melvyn J. Hun:,John S. Bridle
1237 "Blind" Speech Segmentation: Automatic Segmentationof Speech Without Linguistic KnowledgeManish Sharma, Richard J. Mammone
1241 Speech Synthesis Using a Nonlinear Energy DampingModel for the Vocal Folds Vibration EffectHiroshi Ohmura, Kazuyo Tanaka
1245 Neural Networks Learning with LI Criteria and ItsEfficiency in Linear Prediction of Speech SignalsMunehiro Namba, Hiroyuki Kamata, Yoshihisa Ishida
1249 Preprocessing and Neural Classification of English StopConsonants [b,d,g,p,Uc]A. Esposito, C. E. Ezin, M. Ceccarelli
1253 A Comparison of Modified k-means(MKM) and NNbased Real Time Adaptive Clustering Algorithms forArticulatory Space Codebook FormationK.S. Ananthakrishnan
1257 A Novel Approach to the Estimation of Voice Sourceand Vocal Tract Parameters from Speech SignalsWen Ding, Hideki Kasuya
1261 Syllable Detection in Read and Spontaneous SpeechHanmut R. Pfitzinger, Susanne Burger, Sebastian Held
1265 Maximum Likelihood Learning of Auditory FeatureMaps for Stationary VowelsKuansan Wang, Chin-Hui Lee, Biing-Hwang Juang
1269 Explicit Segmentation of Speech using Gaussian ModelsAntonio Bonafonte, Albino Nogueiras, AntonioRodriguez-Garrido
1273 A Comparison of Several Recent Methods ofFundamental Frequency and Voicing DecisionEstimationE. Mousset, W.A. Ainsworth, Jose A. R. Fonollosa
1277 Robust Pitch Estimation with Harmonics Enhancementin Noisy Environments Based on InstantaneousFrequencyToshihiko Abe, Takao Kobayashi, Satoshi Imai
1281 Integrated Polispectrum on Speech RecognitionAsuncidn Moreno, Miquel Rutlldn
FrP2Sl - Physics and Simulation of the Vocal Tract
1285 Analysis of Acoustic Properties of the Nasal Tract Using3-DFEMHisayoshi Suzuki, Takayoshi Nakai, Hirosi Sakakibara
1289 Experiments with Analysis By Synthesis of GlottalAirflowJohan Liljencrants
SaAJLl - Speech Recognition Using HMMsandfWs
1293 An Incremental Speaker-Adaptation Technique forHybrid HMM-MLP RecognizerJoao P. Neto, Ciro A. Martins, Luis B. Almeida
1297 Phoneme Segmentation of Continuous Speech usingMulti-layer PerceptronYoungjoo Suh, Youngjik Lee
1301 Stochastic Perceptual Speech Models with DurauonalDependenceJeffBilmes, Nelson Morgan, Su-Lin Wu, Herve Bourlard
1305 Boosting the Performance of Connectionist LargeVocabulary Speech RecognitionCD. Cook, A.J. Robinson
1309 HMMs and OWE Neural Network for ContinuousSpeech RecognitionNicolas Picon, Dominique Fohr, Jean-Frangois Mori
1313 Smoothed Local Adaptation of Connectionist SystemsSteve Waterhouse, Dan Kershaw, Tony Robinson
Microphones
1317 Robust Speech Recognition with Speaker Localizationby a Microphone ArrayTakeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano
1321 Sound Source Localization in Reverberant Environmentsusing an Outlier Elimination AlgorithmEa-Ee Jan, James L. Flanagan
1325 The 1995 Abbot LVCSR System for Multiple UnknownMicrophonesDan Kershaw, Tony Robinson, Steve Renals
1329 Experiments of Speech Recognition in a Noisy andReverberant Environment using a Microphone Arrayand HMM AdaptationD. Giuliani, M. Omologo, P. Svaizer
1333 Increasing Robustness in GMM Speaker RecognitionSystems for Noisy and Reverberant Speech with LowComplexity Microphone ArraysJoaquin Gonzalez-Rodriguez. Javier Ortega-Garcia,Cisar Martin, Luis Hemdndez
1337 Robust Automatic Speech Recognition Using a Multi-channel Signal Separation Front-EndKuan-Chieh Yen, Yunxin Zhao
1341 Prosody Generation in Text-to-Speech ConversionUsing Dependency GraphsAnders Lindstrom, Ivan Breton, Mats Ljwigqvist
1345 Extraction Method of Non-restrictive Modification inJapanese as a Marked Factor of ProsodyHisako Asano, Hisashi Ohara, Yoshifumi Ooyama
1349 Modeling Contrast in the Generation and Synthesis ofSpoken LanguageScott Prevost
1353 A Left-to-right Processing Model of Pausing in JapaneseBased on Limited Syntactic InformationHajime Tsukada
1357 Modeling of Intonation Bearing Emphasis for TTS-Synthesis of Greek DialoguesD. Galanis, V. Darsinos, G. Kokkinakis
1361 Synthesizing Prosody: a Prominence-based ApproachBarbara Heuft, Thomas Portele
1365 Multilingual Text Analysis for Text-to-Speech SynthesisRichard Sproat
1369 Spoken-style Explanation Generator for Japanese Kanjiusing a Text-to-speech SystemYoshifumi Ooyama, Hisako Asano, Koji Matsuoka
1373 A Method for Estimating Prosodic Symbol from Text forJapanese Text-To-Speech SynthesisKen-ichi Magata, Tomoki Hamagami, Mitsuo Komura
1377 Statistical Methods in Data-driven Modeling of SpanishProsody for Text to SpeechE. Ldpez-Gonzalo, J.M. Rodriguez-Garcia
1381 Intonation Processing for TTS Using Stylization andNeural Network Learning MethodJung-Chid Lee, Youngjik Lee, Song-Hun Kim, MinsooHahn
1385 Generating FO Contours from ToBI Labels using LinearRegressionAlan W. Black, Andrew J. Hunt
1389 The Broad Study of Homograph Disambiguity forMandarin Speech SynthesisWem-Jun Wang, Shaw-Hwa Hwang, Sin-Homg Chen
1393 The MBROLA project: Towards a Set of High QualitySpeech Synthesizers Free of Use for Non CommercialPurposesT. Dutoit, V. Pagel N. Pierret, F. Bataille, O. Van derVrecken
1397 Training Data Selection for Voice Conversion UsingSpeaker Selection and Vector Field SmoothingMakoto Hashimoto, Norio Higuchi
1401 A New Voice Transformation Method Based on BothLinear and Nonlinear Prediction AnalysisKi Seung Let, Dae Hee Youn, II Whan Cha
1405 On the Transformation of the Speech Spectrum forVoice ConversionG. Baudoin, Yannis Stylianou
1409 Spectral Analysis of Synthetic Speech and NaturalSpeech with Noise over the Telephone LineCristina Delogu, Andrea Paoloni, Susanna Ragazzini,Paola Ridolfi
1413 A New Speech Synthesis System Based on the ARXSpeech Production ModelWeizhong Zhu, Hideki Kasuya
1417 Speech Synthesis Using the CELP AlgorithmGeraldo Lino de Campos, Evandro Bacci Gouvia
1421 A Mandarin Text-to-Speech SystemShaw-Hwa Hwang, Sin-Homg Chen, Yih-Ru Wang
1425 Residual-based Speech Modification Algorithms forText-to-Speech SynthesisM.D. Edgington, A. Lowry
1429 A Generalized LR Parser for Text-to-speech SynthesisPer Olav Heggtveit
1433 Enhanced Shape-invariant Pitch and Time-scaleModification for Concatenative Speech SynthesisM.P. Pollard, B.M.G. Cheetham, C.C. Goodyear, M.D.Edgington, A. Lowry
1437 An Excitation Synchronous Pitch Waveform ExtractionMethod and its Application to the VCV-concatenationSynthesis of Japanese Spoken WordsYasuhiko Arai, Ryo Mochizuki, Hirofumi Nishimura,Takashi Honda
1441 A New Chinese Text-to-Speech System with HighNaturalnessRen-Hua Wang, Qinfeng Liu, Difei Tang
1445 Voice Conversion Based on Topological Feature Mapsand Time-variant FilteringAnsgar Rinscheid
SaAlP2 - Instructional Technology for Spoken '
1449 Language Traimng System Utilizing SpeechModificationMeron Yoram, Keikichi Hirose
1453 Perception of English It/ and /I/ Speech Contrasts byNative Korean Listeners with Extensive English-language ExperienceD.G. Jamieson, K. Yu
1457 Automatic Text-independent Pronunciation Scoring ofForeign Language Student SpeechLeonardo Neumeyer, Horacio Franco, MitchelWeintraub, Path Price
1461 Assessing the Contribution of Instructional Technologyin the Teaching of PronunciationAntonio Simoes
1465 Detection of Foreign Speakers' Pronunciation Errors forSecond Language Training - Preliminary ResultsMaxine Eskenazi
1469 Foreign Accent in Intonation Patterns - A ContrastiveStudy Applying a Quantitative Model of the FO ContourHansjorg Mixdorff
1473 Input Modality Effects in Foreign AccentDuncan J. Markham, Yasuko Nagano-Madsen
SaAlSl — Mnliiinoda] Spoken Language Processing I
1477 For Speech Perception by Humans or Machines, ThreeSenses are Better than OneLynne E. Bernstein, Christian Benoit
1481 A Few Factors Which Affect the Degree ofIncorporating Lip-read Information into SpeechPerceptionKaoru Sekiyama, Yoh'ichi Tohkura, Michio Umeda
1485 Characterizing Audiovisual Information During SpeechE. Vatikiotis-Bateson, K.G. Munhall, Y. Kasahara, F.Garcia, H. Yehia
1489 The Implications of the Tadoma Method ofSpeechreading for Spoken Language ProcessingCharlotte M. Reed
1493 Seeing Speech in Space and Time: Psychological andNeurological FindingsRuth Campbell
SaA2Ll - Prosody -Phonological/Phonetit MeaOTres
1497 What's in the "Pure" Prosody?Volker Strom, Christina Widera
1501 FO Declination in Read-aloud and Spontaneous SpeechMarc Swerts, Eva Stronger!, Manias Helaner
1505 Prediction of Prosodic Phrase Boundaries ConsideringVariable Speaking RateYeon-jun Kim, Yung-hwan Oh
1509 Prediction of FO Parameter of Contextualized Utterancesin DialogueYoichi Yamashita, Riichiro Mizoguchi
1513 The Production and Perception of PotentiallyAmbiguous Intonation Contours by Speakers of Russianand JapaneseV. Makarova, J. Matsui
1517 What is Invariant and What is Optional in theRealization of a FOCUSED Word? A Cross-dialectalStudy of Swedish Sentences With Moving FocusRobert Eklund
SaA2L2 —Phonetks^nAPerception
1521 Quantifying Spectral Characteristics of FricativesChristine H. Shadle, Sheila J. Mair
1525 Acoustic Characteristics of Ejectives in IngushNatasha Warner
1529 An Acoustic Profile of Consonant ReductionR.J.J.H. van Son, Louis C. W. Pols
1533 Devoicing in Post-vocalic Canadian-French ObstruantsDaniele Archambault, Blagovesta Maneva
1537 Paying Attention to Speaking RateAlexander L Francis, Howard C. Nusbaum
1541 The Lack of Invariance Problem and the Goal of SpeechPerceptionIrene Appelbaum
SaA2L3 — language Acquisition
— , ' • ' - . ' -
1545 The Acoustic Structure of Vowels in Mothers' Speech toInfants and AdultsJean E. Andruski, Patricia K. Kuhl
1549
1553
1557
1561
1565
Acoustical Characteristics of Sound Production of Deafand Normally Hearing InfantsChris J. Clement, Florien J. Koopmans-van Beinum,Louis C. W. Pols
Learning Non-native Vowel CategoriesJohn Kingston, Christine Bartels, Josi Benkx, DeannaMoore, Jeremy Rice, Rachel Thorbum, Neil MacmiUan
Word Recognition by Japanese InfantsPA. Halle, Toshisada Deguchi, Yuji Tamekawa, B.Boysson-Bardies, Shigeru Kiritani
Investigations of the Word Segmentation Abilities ofInfantsPeter W. Jusczyk
Developmental Change in Perception of ClauseBoundaries by 6- and 10-Month-old Japanese InfantsAioko Hayashi, Yuji Tamekawa, Toshisada Deguchi,Shigeru Kiritani
• * * •
OCIM iswIySVt
1569
1573
1577
1581
1585
1589
1593
1597
1601
A Frequency Domain Method for Parametrization of theVoice' SourcePaavo Alku, Erkki Vilkman
Glottal Correlates of the Word Stress and the Tense/LaxOpposition in GermanKrzysztof Marasek
Coarticulatory Stability in American English ItlSuzanne Boyce, Carol Y. Espy-Wilson
An MRI-based Analysis of the English Ixl and DlArticulationsShinobu MasaJd, Reiko Akahane-Yamada, Mark K.Tiede, Yasuhiro Shimada, Ichiro Fujimoto
Does Lexical Stress or Metrical Stress Better PredictWord Boundaries in Dutch?David van Kuijk
Optopalatograph (OPG): A New Apparatus for SpeechProduction AnalysisA. A. Wrench, A. D. Mclntosh, W. J. Hardcastle
Prediction of Vowel Systems using a DeductiveApproachRene Carre
Distinctions Between [t] and [tch] usingElectropalatography DataSheila J. Mair, Celia Scully, Christine H. Shadle
Relating Formants and Articulation in Intelligibility TestWordsMichiko Hashi, Raymond D. Kent, John R. Westbury,Mary J. Lindstrom
1605 The Role of Coarticulation in the Perception of VowelQuality in Modern Standard ArabicImad Znagui, Mohamed Yeou
1609 Updating the Reading EPGSimon Amfield, Wilf Jones
1612 Lexical Stress Detection on Stress-minimal Word PairsGoangshiuan S. Ying, Leah H. Jamieson, Ruxm Chen,Carl D. Mitchell
1616 An Acoustic Study of the Interaction Between Stressedand Unstressed Syllables in Spoken MandarinJing Wang
1620 Automatic Detection of Accent Nuclei at the Head ofWords for Speech RecognitionNobuaki Minematsu, Seiichi Nakagawa
1624 Automatic Generation of Prosodic Structure for HighQuality Mandarin Speech SynthesisFu-chiang Chou, Chiu-yu Tseng, Lin-shan Lee
1628 A Study on Japanese Prosodic Pattern and its Modelingin Restricted SpeechTomoki Hamagami, Ken-ichi Magata, Mitsuo Komura
1632 A Phonetic Study of Focus in Intransitive VerbSentencesSteve Hoskins
' Variation in Vocal Fold Vibration Associated withProsodic ConditionsShigeru Kiritani, Hiroshi Imagawa, Seiji Niimi
1636 Goethe for ProsodyStefan Rapp
1640 Prosodic Cues in Syntactically Ambiguous Strings; AnInteractive Speech Planning MechanismKA. Straub
1644 A Functional Model for Generation of the LocalComponents of F0 Contours in ChineseJinfu Ni, Ren-Hua Wang, Deyu Xia
1648 The Acquisition of Voiceless Stops in the Interlanguageof Second Language Learners of English and SpanishMarie Fellbaum
iZi „. s '—"—*
-K SS^Srfi! •-*-? ^**- * *• W* » Hj.r*-?^^^^^!?®,' '(»»»**«
1652 Studies of the McGurk Effect: Implications for Theoriesof Speech PerceptionKerry P. Green
1656 Using the Visual Component in Automatic SpeechRecognitionN. M. Brooke
1660 Perceptual Organization of Speech in One and SeveralModalities: Common Functions, Common ResourcesRobert E. Remez
1664 Multi-modal Encoding of Speech in Memory: A FirstReportDavid B. Pisoni, Helena M. Saldaha, Sonya M. Sheffert
1668 Evaluating Automatic Speech Recognition as aComponent of a Multi-input Device Human-computerInterfaceB.A. Mellor, C. Baber, C. Tunley
1672 Data Collection for the MASK Kiosk: WOz vs PrototypeSystemA. Life, I. Salter, J.N. Temem, F. Bernard, S. Rosset,S.K. Bennacef, Lori Lamel
1676 An Experimental Japanese/English Interpreting VideoPhone SystemM. Karaorman, T.H. Applebaum, T. Itoh, M. Endo, Y.Ohno, M. Hoshimi, T. Kamai, K. Matsui, K. Hata, S.Pearson, J.-C. Janqua
1680 User Participation and Compliance in Speech AutomatedTelecommunications ApplicationsSara Basson, Stephen Springer, Cynthia Fong, HongLeung, Ed Man, Michele Olson, John Pitrelli, RanvirSingh, Suk Wong
1684 Embedding Speech in Web InterfacesSamuel Bayer
1688 Voice-activated Home Banking System and its FieldTrialToshihiro Isobe, Masatoshi Morishima, FuminoriYoshitani, Nobuo Koizumi, Ken'ya Murakami
l :" V—TlSSysterasand Rules CT „-
1692 A Text Analyzer for Korean Text-to-Speech SystemsSangho Lee, Yung-Hwan Oh
1696 Design and Evaluation of a Phonological Phrase Parserfor Spanish Text-to-SpeechHelen E. Karn
1700 Comparison of Two Tree-Structured Approaches forGrapheme-to-Phoneme ConversionOve Andersen, Roland Kuhn, Ariane Lazarides, PaulDalsgaard, Ju'rgen Haas, Elmar Noth
1704 A Recurrent Network that Leams to Pronounce EnglishTextM.J. Adamson, R.I. Damper
1708 Archisegment-based Letter-to-Phone Conversion forConcatenative Speech Synthesis in PortugueseEleonora Cavalcante Albano, Agnaldo Antonio Moreira
1712 A New Method of Generating Speech Synthesis UnitsBased on Phonological Knowledge and ClusteringTechniqueYuki Yoshida, Shin'ya Nakajima, Kazuo Hakoda,Tomohisa Hirokawa
1716 Consistency in Transcription and Labelling of GermanIntonation with GToBIMartine Grice, Matthias Reyelt, Ralf BenzmiUler, Jo'rgMayer, Anton Ballmer
1720 Syntactic-prosodic Labeling of Large SpontaneousSpeech Data-basesAnton Batliner, R. Kompe, A. Kiessling, H. Niemann, E.Noth
1724 Relationship Between Discourse Structure and DynamicSpeech RateFlorien J. Koopmans-van Beinum, Monique E. vanDonzel
1728 Using Prosodic Clues to Decide When to Produce Back-channel UtterancesNigel Ward
1732 Dialog Act Classification with the Help of ProsodyMarion Mast, Ralf Kompe, Stefan Harbeck, AndreasKiessling, Heinrich Niemann, Elmar Noth, E. G.Schukat-Talamazzini, V. Wamke
1736 Using Lexical Stress in Continuous Speech Recognitionfor DutchDavid van Kuijk, Henk van den Heuvel, Louis Boves
1740 Automatic Accent Classification of Foreign AccentedAustralian English SpeechKarsten Kumpf, Robin W. King
1744 Discriminative Adaptation for Speaker VerificationF. Korkmazskiy, Biing-Hwang Juang
1746 Perceptual Features of Unknown Foreign Languages asRevealed by Multi-dimensional ScalingV. Stockmal, D. Muljani, Z.S. Bond
1752 On-line Incremental Adaptation for Speaker Verificationusing Maximum Likelihood Estimates of CDHMMParametersKin Yu, John S. Mason
1756 Combining Methods to Improve Speaker VerificationDecisionDominique Genoud, Frederic Bimbot, GuillaumeGravier, Girard Chollet
1760 Incremental Speaker Adaptation with Minimum ErrorDiscriminative Training for Speaker IdentificationCesar Martin del Alamo, J. Alvarez, C.aela Torre, F.J.Poyatos, L Hemdndez
1764 Frame Level Likelihood Normalization for Text-independent Speaker Identification using GaussianMixture ModelsKonstantin P. Markov, Seiichi Nakagawa
1768 On Using Prosodic Cues in Automatic LanguageIdentificationAnn E. Thyme-Gobbel, Sandra E. Hutchins
1772 Speaker Recognition Model using Two-dimensional Mel-Cepstrum and Predictive Neural NetworkTaaashi Kitamura, Shinsai Takei
1776 Unknown Language Rejection in LanguageIdentification SystemHingkeung Kwan, Keikichi Hirose
1780 Spoken Language Identification using Large VocabularySpeech RecognitionJames L Hieronymus, Shubha Kadambe
1784 Accent IdentificationCarlos Teixeira, Isabel M. Trancoso, AntdnioSerralheiro
1788 Comparison of Text-independent Speaker RecognitionMethods on Telephone Speech with Acoustic MismatchSarel van Vuuren
1792 On the Sources of Inter- and Intra-speaker Variability inthe Acoustic Dynamics of SpeechXue Yang, J. Bruce Millar, Iain Macleod
1796 Language Identification with Inaccurate String MatchingKay M. Berkling, Etienne Barnard
1800 Robust Prosodic Features for Speaker IdentificationM.J. Carey, E.S. Parris, H. Lloyd-Thomas, S.J. Bennett
1804 Text Independent Speaker Identification on NoisyEnvironments by Means of Self Organizing MapsE. Monte, J. Hemando, X. Miro, A. Adolf
1808 Language-identification Using Language-dependentPhonemes and Language-independent Speech UnitsPaul Dalsgaard, Ove Andersen, Hanne Hesselager,Bojan Petek
* Large Vocabulary Speech Recognition: The SwitchboardDomainRonald Rosenfeld, Hervi Bourlard
-Emotfbn in Recognit ion^
* Adding the Affective Dimension: A New Look inSpeech Analysis and SynthesisKlaus R. Scherer
1812 Ethological Theory and the Expression of Emotion inthe VoiceJohn J. Ohala
1816 Synthesizing Emotions in Speech: Is it Time to GetExcited?Iain R. Murray, John L Amott
SaP2Ll - Stochastic T e c h i i k ^ in Robast Speech
1820 A Study on Task-independent Subword Selection andModeling for Speech RecognitionChin-Hui Lee, Biing-Hwang Juang, Wu Chou, J.J.Molina-Perez
1824 Simultaneous ANN Feature and HMM RecognizerDesign using String-based Minimum ClassificationError (MCE) TrainingMazin G. Rahim, Chin-Hui Lee
1828 Quantizing Mixture-weights in a Tied-mixture HMMSunil K. Gupta, Frank K. Soong, Raziel Haimi-Cohen
1832 Variance Compensation within the MLLR Frameworkfor Robust Speech Recognition and Speaker AdaptationM.J.F. Gales, D. Pye, P.C. Woodland
1836 Maximum-likelihood Stochastic Matching Approach toNon-linear Equalization for Robust Speech RecognitionA.C. Surendran, Chin-Hui Lee, Mazin G. Rahim
1840 Estimation of Channel Bias for Telephone SpeechRecognitionJen-Tzung Chien, Hsiao-Chuan Wang, Lee-Min Lee
:SaP2L2-Prosodic Synthesis in Text to Speech
1844 Synthesis of English Intonation using Explicit Models ofReading and Spontaneous SpeechM. E. Johnson
1848 Implementation and Evaluation of a Model for Synthesisof Swedish IntonationMerle Home, Marcus Filipsson
1852 Natural Prosody Generation for Domain Specific Text-to-Speech SystemsNobuyuki Katae, Shinta Kimura
1856 Improving Text-to-Speech SynthesisMark Tatham, Eric Lewis
1860 Synthesis of Stressed Speech from Isolated NeutralSpeech Using HMM-based ModelsSahar E. Bou-Ghazale, John H.L Hansen
1864 Modeling Segment Intonation for Slovene TTS SystemAles Dobnikar
SaP2L3 - DialogueJEvents
1868 Word Predictability After Hesitations: A Corpus-basedStudyElizabeth Shriberg, Andreas Stolcke
1872 Interruptions and IntonationLi-chiung Yang
1876 On not Recognizing Disfluencies in DialogueRobin J. Lickley, Ellen Gurman Bard
1880 A Theory of Word Frequencies and its Application toDialogue Move RecognitionPhil Gamer, Sue Browning, Roger Moore, MartinRussell
1884 Utterance Units and Grounding in Spoken DialogueDavid R. Traum, Peter A. Heeman
1888 Coordinating Turn-taking with GazeDavid G. Novick, Brian Hansen, Karen Ward
SaE2Pl -Databases and Took
1892 BABEL: An Eastern European Multi-language DatabasePeter Roach, Simon Arnfield, W. Barry, J. Baltova, M.Boldea, A. Fourcin, W. Gonet, R. Gubrynowicz, E.Hallum, L Lamel, K. Marasek, A. Marchal, E. Meister,K. Vicsi
1894 USTC95—A Putonghua CorpusRen-Hua Wang, Deyu Xia, Jinfu Ni, Bicheng Liu
1898 Telephone Data Collection using the World Wide WebEdward Hurley, Joseph Polifroni, James Glass
1902 The "SIVA" Speech Database for Speaker Verification:Description and EvaluationM. Falcone, A. Callo
1906 A Multi-level Description of Date Expressions inGerman Telephone SpeechChristoph Draxler
1910 Viterbi Search Visualization Using Vista: A GenericPerformance Visualization ToolRobert H. Halstead Jr., Ben Serridge, Jean-Manuel VanThong, William Goldenthal
1914 A Multilingual Phonetic Representation and AnalysisSystem for Different Speech DatabasesToomas Altosaar, Matti Karjalainen, Martti Vainio
1918 FRESCO: The French Telephone Speech DataCollection - Part of the European SpeechDat(M) ProjectD. Langmann, R. Haeb-Umbach, Louis Boves, E. den Os
1922 Predicting the Out-of-Vocabulary Rate and the RequiredVocabulary Size for Speech Processing ApplicationsJohannes Miiller, Holger StahL Manfred Lang
1926 AMULET: Automatic MUltisensor Speech Labellingand Event Tracking: Study of the Spatio-temporalCorrelations in Voiceless Plosive ProductionNathalie Parlangeau, Alain Marchal
1930 Constructing Multi-level Speech Database forSpontaneous Speech ProcessingMinsoo Hahn, Sanghun Kim, Jung-Chul Lee, Yong-JuLee
1934 Preliminaries to a Romanian Speech DatabaseMarian Boldea, Alin Doroga, Tiberiu Dumitrescu,Maria Pescaru
1938 Labelled Data Bank of Spoken Standard German TheKiel Corpus of Read/Spontaneous SpeechKlaus J. Kohler
1942 SAPPHIRE: An Extensible Speech Analysis andRecognition Tool Based on Tcl/TkLee Hetherington, Michael McCandless
1946 Automatic Detection of Topic Boundaries and Keywordsin Arbitrary Speech Using Incremental ReferenceInterval-free Continuous DPJiro Kiyama, Yoshiaki Itoh, Ryuichi Oka
1950 Very-large-vocabulary Mandarin Voice Message FileRetrieval using Speech QueriesBo-Ren Bai, Lee-Feng Chien, Lin-Shan Lee
1954 Gandalf - A Swedish Telephone Speaker VerificationDatabaseH.Melin
1958 The DCIEM Map Task Corpus: Spontaneous DialogueUnder Sleep Deprivation and Drug TreatmentEllen Gurman Bard, C. Sotillo, A. H. Anderson, M. M.Taylor
1962 The Nemours Database of Dysarthric Speech
Xavier Menendez-PidaL James B. Polikoff, Shirley M.Peters, Jennie E. Leonzio, H.T. Bunnell
1966 POST: Parallel Object-oriented Speech ToolkitJean Hennebert, Dijana Petrovska Delacritaz
2013 Lombard Effect Compensation and Noise Suppressionfor Noisy Lombard Speech RecognitionSang-mun Chi, Yung-Hwan Oh
1970 Recognizing Emotion in SpeechFrank Dellaert, Thomas Polzin, Alex Waibel
1974 Emotions in Time Domain SynthesisBarbara Heuft, Thomas Portele, Monika Rauth
1978 Word Class Driven Synthesis of Prosodic AnnotationsSimon Amfield
1981 Dynamical Modelling of Vowel Sounds as a SynthesisToolM. Banbrook, S. McLaughlin
1985 Emotional Speech Elicited using Computer GamesTom Johnstone
1989 Automatic Statistical Analysis of the Signal andProsodic Signs of Emotion in SpeechRoddy Cowie, Ellen Douglas-Cowie
'^ "•" T ^
£f r4r"-* ? *
— Ridmst
* ^ . r 1 1 _ • > •
%KedliPrw-'* f
n o
1993
1997
2001
2005
2009
Channel and Noise Normalization Using AffineTransformed CepstrumXiaoyu Zhang, Richard J. Mammone
Spectral Estimation and Normalisation for RobustSpeech RecognitionTom Claes, Fei Xie, Dirk Van Compemolle
Trellis Encoded Vector Quantization for Robust SpeechRecognitionWu Chou, Nambi Seshadri, Mazin Rahim
Phone Clustering using the Bhattacharyya DistanceBrian Mak, Etienne Barnard
Variability of Lombard Effects Under Different NoiseConditionsAtsushi Wakao, Kazuya Takeda, Fumitada Itakura
2017 The Use of Shibboleth Words for AutomaticallyClassifying Speakers by DialectA.W.F. Huggins, Yogen Patel
' The Organization of Dialect Diversity in North AmericaWilliam Labov
2021 Data Collection of Japanese Dialects and its Influenceinto Speech RecognitionIkuo Kudo, Takao Nakama, Tomoko Watanabe, ReikoKameyama
2025 Statistical Dialect Classification Based on MeanPhonetic FeaturesDavid R. Miller, James Trischitta
2028 Norwegian Numerals: a Challenge to Automatic SpeechRecognitionKnut Kvale
2032 Evaluation of the Telefdnica I+D Natural NumbersRecognizer over Different Dialects of Spanish fromSpain and AmericaC. de la Torre, J. Caminero-Gil, J. Alvarez. C. Martindel Alamo, L Hemdndez-Gdmez
2036 Rhythmic Constraints on English Stress TimingFred Cummins, Robert F. Port
2040 On the Interaction of Clash, Focus and PhonologicalPhrasingIrene Vogel, Steve Ho skins
2044 On the Quantal Nature of Speech TimingGunnar Font, Anita Kruckenberg
2048 Differential Perception of Tonal Contours Through theSyllableDavid House
2052 Pitch, Loudness, and Segmental Duration Correlates:Towards a Model for the Phonetic Aspects of FinnishProsodyMartti Vainio, Toomas Altosaar
2056 Prosodic Manipulation System of Speech Material forPerceptual ExperimentsNobuaki Minematsu, Seiichi Nakagawa, Keikichi Hirose