proceedings - gbv · 2007-03-16 · proceedings october 3-6, 1996 wyndham franklin plaza hotel...

Proceedings

October 3-6, 1996Wyndham Franklin Plaza Hotel

Philadelphia, PA, USA

ICSLP 96

Sponsored by

University of DelawareAlfred I. duPont Institute

in conjunction withAcoustical Society of America

Acoustical Society of JapanAmerican Speech-Language-Hearing Association

Australian Speech Science and Technology AssociationEuropean Speech Communication Association

IEEE Signal Processing SocietyIncorporated Canadian Acoustical Association

International Phonetic AssociationLinguistic Society of America

UB/TIB Hannover116 238 895

89

Fourth International Conference on Spoken Language Processing

1229 Formant Analysis Using Mixtures of GaussiansParham Zolfaghari, Tony Robinson

1233 Deriving Articulatory Representations from Speech withVarious Excitation ModesHywel B. Richards, John S. Mason, Melvyn J. Hun:,John S. Bridle

1237 "Blind" Speech Segmentation: Automatic Segmentationof Speech Without Linguistic KnowledgeManish Sharma, Richard J. Mammone

1241 Speech Synthesis Using a Nonlinear Energy DampingModel for the Vocal Folds Vibration EffectHiroshi Ohmura, Kazuyo Tanaka

1245 Neural Networks Learning with LI Criteria and ItsEfficiency in Linear Prediction of Speech SignalsMunehiro Namba, Hiroyuki Kamata, Yoshihisa Ishida

1249 Preprocessing and Neural Classification of English StopConsonants [b,d,g,p,Uc]A. Esposito, C. E. Ezin, M. Ceccarelli

1253 A Comparison of Modified k-means(MKM) and NNbased Real Time Adaptive Clustering Algorithms forArticulatory Space Codebook FormationK.S. Ananthakrishnan

1257 A Novel Approach to the Estimation of Voice Sourceand Vocal Tract Parameters from Speech SignalsWen Ding, Hideki Kasuya

1261 Syllable Detection in Read and Spontaneous SpeechHanmut R. Pfitzinger, Susanne Burger, Sebastian Held

1265 Maximum Likelihood Learning of Auditory FeatureMaps for Stationary VowelsKuansan Wang, Chin-Hui Lee, Biing-Hwang Juang

1269 Explicit Segmentation of Speech using Gaussian ModelsAntonio Bonafonte, Albino Nogueiras, AntonioRodriguez-Garrido

1273 A Comparison of Several Recent Methods ofFundamental Frequency and Voicing DecisionEstimationE. Mousset, W.A. Ainsworth, Jose A. R. Fonollosa

1277 Robust Pitch Estimation with Harmonics Enhancementin Noisy Environments Based on InstantaneousFrequencyToshihiko Abe, Takao Kobayashi, Satoshi Imai

1281 Integrated Polispectrum on Speech RecognitionAsuncidn Moreno, Miquel Rutlldn

FrP2Sl - Physics and Simulation of the Vocal Tract

1285 Analysis of Acoustic Properties of the Nasal Tract Using3-DFEMHisayoshi Suzuki, Takayoshi Nakai, Hirosi Sakakibara

1289 Experiments with Analysis By Synthesis of GlottalAirflowJohan Liljencrants

SaAJLl - Speech Recognition Using HMMsandfWs

1293 An Incremental Speaker-Adaptation Technique forHybrid HMM-MLP RecognizerJoao P. Neto, Ciro A. Martins, Luis B. Almeida

1297 Phoneme Segmentation of Continuous Speech usingMulti-layer PerceptronYoungjoo Suh, Youngjik Lee

1301 Stochastic Perceptual Speech Models with DurauonalDependenceJeffBilmes, Nelson Morgan, Su-Lin Wu, Herve Bourlard

1305 Boosting the Performance of Connectionist LargeVocabulary Speech RecognitionCD. Cook, A.J. Robinson

1309 HMMs and OWE Neural Network for ContinuousSpeech RecognitionNicolas Picon, Dominique Fohr, Jean-Frangois Mori

1313 Smoothed Local Adaptation of Connectionist SystemsSteve Waterhouse, Dan Kershaw, Tony Robinson

Microphones

1317 Robust Speech Recognition with Speaker Localizationby a Microphone ArrayTakeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano

1321 Sound Source Localization in Reverberant Environmentsusing an Outlier Elimination AlgorithmEa-Ee Jan, James L. Flanagan

1325 The 1995 Abbot LVCSR System for Multiple UnknownMicrophonesDan Kershaw, Tony Robinson, Steve Renals

1329 Experiments of Speech Recognition in a Noisy andReverberant Environment using a Microphone Arrayand HMM AdaptationD. Giuliani, M. Omologo, P. Svaizer

1333 Increasing Robustness in GMM Speaker RecognitionSystems for Noisy and Reverberant Speech with LowComplexity Microphone ArraysJoaquin Gonzalez-Rodriguez. Javier Ortega-Garcia,Cisar Martin, Luis Hemdndez

1337 Robust Automatic Speech Recognition Using a Multi-channel Signal Separation Front-EndKuan-Chieh Yen, Yunxin Zhao

1341 Prosody Generation in Text-to-Speech ConversionUsing Dependency GraphsAnders Lindstrom, Ivan Breton, Mats Ljwigqvist

1345 Extraction Method of Non-restrictive Modification inJapanese as a Marked Factor of ProsodyHisako Asano, Hisashi Ohara, Yoshifumi Ooyama

1349 Modeling Contrast in the Generation and Synthesis ofSpoken LanguageScott Prevost

1353 A Left-to-right Processing Model of Pausing in JapaneseBased on Limited Syntactic InformationHajime Tsukada

1357 Modeling of Intonation Bearing Emphasis for TTS-Synthesis of Greek DialoguesD. Galanis, V. Darsinos, G. Kokkinakis

1361 Synthesizing Prosody: a Prominence-based ApproachBarbara Heuft, Thomas Portele

1365 Multilingual Text Analysis for Text-to-Speech SynthesisRichard Sproat

1369 Spoken-style Explanation Generator for Japanese Kanjiusing a Text-to-speech SystemYoshifumi Ooyama, Hisako Asano, Koji Matsuoka

1373 A Method for Estimating Prosodic Symbol from Text forJapanese Text-To-Speech SynthesisKen-ichi Magata, Tomoki Hamagami, Mitsuo Komura

1377 Statistical Methods in Data-driven Modeling of SpanishProsody for Text to SpeechE. Ldpez-Gonzalo, J.M. Rodriguez-Garcia

1381 Intonation Processing for TTS Using Stylization andNeural Network Learning MethodJung-Chid Lee, Youngjik Lee, Song-Hun Kim, MinsooHahn

1385 Generating FO Contours from ToBI Labels using LinearRegressionAlan W. Black, Andrew J. Hunt

1389 The Broad Study of Homograph Disambiguity forMandarin Speech SynthesisWem-Jun Wang, Shaw-Hwa Hwang, Sin-Homg Chen

1393 The MBROLA project: Towards a Set of High QualitySpeech Synthesizers Free of Use for Non CommercialPurposesT. Dutoit, V. Pagel N. Pierret, F. Bataille, O. Van derVrecken

1397 Training Data Selection for Voice Conversion UsingSpeaker Selection and Vector Field SmoothingMakoto Hashimoto, Norio Higuchi

1401 A New Voice Transformation Method Based on BothLinear and Nonlinear Prediction AnalysisKi Seung Let, Dae Hee Youn, II Whan Cha

1405 On the Transformation of the Speech Spectrum forVoice ConversionG. Baudoin, Yannis Stylianou

1409 Spectral Analysis of Synthetic Speech and NaturalSpeech with Noise over the Telephone LineCristina Delogu, Andrea Paoloni, Susanna Ragazzini,Paola Ridolfi

1413 A New Speech Synthesis System Based on the ARXSpeech Production ModelWeizhong Zhu, Hideki Kasuya

1417 Speech Synthesis Using the CELP AlgorithmGeraldo Lino de Campos, Evandro Bacci Gouvia

1421 A Mandarin Text-to-Speech SystemShaw-Hwa Hwang, Sin-Homg Chen, Yih-Ru Wang

1425 Residual-based Speech Modification Algorithms forText-to-Speech SynthesisM.D. Edgington, A. Lowry

1429 A Generalized LR Parser for Text-to-speech SynthesisPer Olav Heggtveit

1433 Enhanced Shape-invariant Pitch and Time-scaleModification for Concatenative Speech SynthesisM.P. Pollard, B.M.G. Cheetham, C.C. Goodyear, M.D.Edgington, A. Lowry

1437 An Excitation Synchronous Pitch Waveform ExtractionMethod and its Application to the VCV-concatenationSynthesis of Japanese Spoken WordsYasuhiko Arai, Ryo Mochizuki, Hirofumi Nishimura,Takashi Honda

1441 A New Chinese Text-to-Speech System with HighNaturalnessRen-Hua Wang, Qinfeng Liu, Difei Tang

1445 Voice Conversion Based on Topological Feature Mapsand Time-variant FilteringAnsgar Rinscheid

SaAlP2 - Instructional Technology for Spoken '

1449 Language Traimng System Utilizing SpeechModificationMeron Yoram, Keikichi Hirose

1453 Perception of English It/ and /I/ Speech Contrasts byNative Korean Listeners with Extensive English-language ExperienceD.G. Jamieson, K. Yu

1457 Automatic Text-independent Pronunciation Scoring ofForeign Language Student SpeechLeonardo Neumeyer, Horacio Franco, MitchelWeintraub, Path Price

1461 Assessing the Contribution of Instructional Technologyin the Teaching of PronunciationAntonio Simoes

1465 Detection of Foreign Speakers' Pronunciation Errors forSecond Language Training - Preliminary ResultsMaxine Eskenazi

1469 Foreign Accent in Intonation Patterns - A ContrastiveStudy Applying a Quantitative Model of the FO ContourHansjorg Mixdorff

1473 Input Modality Effects in Foreign AccentDuncan J. Markham, Yasuko Nagano-Madsen

SaAlSl — Mnliiinoda] Spoken Language Processing I

1477 For Speech Perception by Humans or Machines, ThreeSenses are Better than OneLynne E. Bernstein, Christian Benoit

1481 A Few Factors Which Affect the Degree ofIncorporating Lip-read Information into SpeechPerceptionKaoru Sekiyama, Yoh'ichi Tohkura, Michio Umeda

1485 Characterizing Audiovisual Information During SpeechE. Vatikiotis-Bateson, K.G. Munhall, Y. Kasahara, F.Garcia, H. Yehia

1489 The Implications of the Tadoma Method ofSpeechreading for Spoken Language ProcessingCharlotte M. Reed

1493 Seeing Speech in Space and Time: Psychological andNeurological FindingsRuth Campbell

SaA2Ll - Prosody -Phonological/Phonetit MeaOTres

1497 What's in the "Pure" Prosody?Volker Strom, Christina Widera

1501 FO Declination in Read-aloud and Spontaneous SpeechMarc Swerts, Eva Stronger!, Manias Helaner

1505 Prediction of Prosodic Phrase Boundaries ConsideringVariable Speaking RateYeon-jun Kim, Yung-hwan Oh

1509 Prediction of FO Parameter of Contextualized Utterancesin DialogueYoichi Yamashita, Riichiro Mizoguchi

1513 The Production and Perception of PotentiallyAmbiguous Intonation Contours by Speakers of Russianand JapaneseV. Makarova, J. Matsui

1517 What is Invariant and What is Optional in theRealization of a FOCUSED Word? A Cross-dialectalStudy of Swedish Sentences With Moving FocusRobert Eklund

SaA2L2 —Phonetks^nAPerception

1521 Quantifying Spectral Characteristics of FricativesChristine H. Shadle, Sheila J. Mair

1525 Acoustic Characteristics of Ejectives in IngushNatasha Warner

1529 An Acoustic Profile of Consonant ReductionR.J.J.H. van Son, Louis C. W. Pols

1533 Devoicing in Post-vocalic Canadian-French ObstruantsDaniele Archambault, Blagovesta Maneva

1537 Paying Attention to Speaking RateAlexander L Francis, Howard C. Nusbaum

1541 The Lack of Invariance Problem and the Goal of SpeechPerceptionIrene Appelbaum

SaA2L3 — language Acquisition

— , ' • ' - . ' -

1545 The Acoustic Structure of Vowels in Mothers' Speech toInfants and AdultsJean E. Andruski, Patricia K. Kuhl

1549

1553

1557

1561

1565

Acoustical Characteristics of Sound Production of Deafand Normally Hearing InfantsChris J. Clement, Florien J. Koopmans-van Beinum,Louis C. W. Pols

Learning Non-native Vowel CategoriesJohn Kingston, Christine Bartels, Josi Benkx, DeannaMoore, Jeremy Rice, Rachel Thorbum, Neil MacmiUan

Word Recognition by Japanese InfantsPA. Halle, Toshisada Deguchi, Yuji Tamekawa, B.Boysson-Bardies, Shigeru Kiritani

Investigations of the Word Segmentation Abilities ofInfantsPeter W. Jusczyk

Developmental Change in Perception of ClauseBoundaries by 6- and 10-Month-old Japanese InfantsAioko Hayashi, Yuji Tamekawa, Toshisada Deguchi,Shigeru Kiritani

• * * •

OCIM iswIySVt

1569

1573

1577

1581

1585

1589

1593

1597

1601

A Frequency Domain Method for Parametrization of theVoice' SourcePaavo Alku, Erkki Vilkman

Glottal Correlates of the Word Stress and the Tense/LaxOpposition in GermanKrzysztof Marasek

Coarticulatory Stability in American English ItlSuzanne Boyce, Carol Y. Espy-Wilson

An MRI-based Analysis of the English Ixl and DlArticulationsShinobu MasaJd, Reiko Akahane-Yamada, Mark K.Tiede, Yasuhiro Shimada, Ichiro Fujimoto

Does Lexical Stress or Metrical Stress Better PredictWord Boundaries in Dutch?David van Kuijk

Optopalatograph (OPG): A New Apparatus for SpeechProduction AnalysisA. A. Wrench, A. D. Mclntosh, W. J. Hardcastle

Prediction of Vowel Systems using a DeductiveApproachRene Carre

Distinctions Between [t] and [tch] usingElectropalatography DataSheila J. Mair, Celia Scully, Christine H. Shadle

Relating Formants and Articulation in Intelligibility TestWordsMichiko Hashi, Raymond D. Kent, John R. Westbury,Mary J. Lindstrom

1605 The Role of Coarticulation in the Perception of VowelQuality in Modern Standard ArabicImad Znagui, Mohamed Yeou

1609 Updating the Reading EPGSimon Amfield, Wilf Jones

1612 Lexical Stress Detection on Stress-minimal Word PairsGoangshiuan S. Ying, Leah H. Jamieson, Ruxm Chen,Carl D. Mitchell

1616 An Acoustic Study of the Interaction Between Stressedand Unstressed Syllables in Spoken MandarinJing Wang

1620 Automatic Detection of Accent Nuclei at the Head ofWords for Speech RecognitionNobuaki Minematsu, Seiichi Nakagawa

1624 Automatic Generation of Prosodic Structure for HighQuality Mandarin Speech SynthesisFu-chiang Chou, Chiu-yu Tseng, Lin-shan Lee

1628 A Study on Japanese Prosodic Pattern and its Modelingin Restricted SpeechTomoki Hamagami, Ken-ichi Magata, Mitsuo Komura

1632 A Phonetic Study of Focus in Intransitive VerbSentencesSteve Hoskins

' Variation in Vocal Fold Vibration Associated withProsodic ConditionsShigeru Kiritani, Hiroshi Imagawa, Seiji Niimi

1636 Goethe for ProsodyStefan Rapp

1640 Prosodic Cues in Syntactically Ambiguous Strings; AnInteractive Speech Planning MechanismKA. Straub

1644 A Functional Model for Generation of the LocalComponents of F0 Contours in ChineseJinfu Ni, Ren-Hua Wang, Deyu Xia

1648 The Acquisition of Voiceless Stops in the Interlanguageof Second Language Learners of English and SpanishMarie Fellbaum

iZi „. s '—"—*

-K SS^Srfi! •-*-? ^**- * *• W* » Hj.r*-?^^^^^!?®,' '(»»»**«

1652 Studies of the McGurk Effect: Implications for Theoriesof Speech PerceptionKerry P. Green

1656 Using the Visual Component in Automatic SpeechRecognitionN. M. Brooke

1660 Perceptual Organization of Speech in One and SeveralModalities: Common Functions, Common ResourcesRobert E. Remez

1664 Multi-modal Encoding of Speech in Memory: A FirstReportDavid B. Pisoni, Helena M. Saldaha, Sonya M. Sheffert

1668 Evaluating Automatic Speech Recognition as aComponent of a Multi-input Device Human-computerInterfaceB.A. Mellor, C. Baber, C. Tunley

1672 Data Collection for the MASK Kiosk: WOz vs PrototypeSystemA. Life, I. Salter, J.N. Temem, F. Bernard, S. Rosset,S.K. Bennacef, Lori Lamel

1676 An Experimental Japanese/English Interpreting VideoPhone SystemM. Karaorman, T.H. Applebaum, T. Itoh, M. Endo, Y.Ohno, M. Hoshimi, T. Kamai, K. Matsui, K. Hata, S.Pearson, J.-C. Janqua

1680 User Participation and Compliance in Speech AutomatedTelecommunications ApplicationsSara Basson, Stephen Springer, Cynthia Fong, HongLeung, Ed Man, Michele Olson, John Pitrelli, RanvirSingh, Suk Wong

1684 Embedding Speech in Web InterfacesSamuel Bayer

1688 Voice-activated Home Banking System and its FieldTrialToshihiro Isobe, Masatoshi Morishima, FuminoriYoshitani, Nobuo Koizumi, Ken'ya Murakami

l :" V—TlSSysterasand Rules CT „-

1692 A Text Analyzer for Korean Text-to-Speech SystemsSangho Lee, Yung-Hwan Oh

1696 Design and Evaluation of a Phonological Phrase Parserfor Spanish Text-to-SpeechHelen E. Karn

1700 Comparison of Two Tree-Structured Approaches forGrapheme-to-Phoneme ConversionOve Andersen, Roland Kuhn, Ariane Lazarides, PaulDalsgaard, Ju'rgen Haas, Elmar Noth

1704 A Recurrent Network that Leams to Pronounce EnglishTextM.J. Adamson, R.I. Damper

1708 Archisegment-based Letter-to-Phone Conversion forConcatenative Speech Synthesis in PortugueseEleonora Cavalcante Albano, Agnaldo Antonio Moreira

1712 A New Method of Generating Speech Synthesis UnitsBased on Phonological Knowledge and ClusteringTechniqueYuki Yoshida, Shin'ya Nakajima, Kazuo Hakoda,Tomohisa Hirokawa

1716 Consistency in Transcription and Labelling of GermanIntonation with GToBIMartine Grice, Matthias Reyelt, Ralf BenzmiUler, Jo'rgMayer, Anton Ballmer

1720 Syntactic-prosodic Labeling of Large SpontaneousSpeech Data-basesAnton Batliner, R. Kompe, A. Kiessling, H. Niemann, E.Noth

1724 Relationship Between Discourse Structure and DynamicSpeech RateFlorien J. Koopmans-van Beinum, Monique E. vanDonzel

1728 Using Prosodic Clues to Decide When to Produce Back-channel UtterancesNigel Ward

1732 Dialog Act Classification with the Help of ProsodyMarion Mast, Ralf Kompe, Stefan Harbeck, AndreasKiessling, Heinrich Niemann, Elmar Noth, E. G.Schukat-Talamazzini, V. Wamke

1736 Using Lexical Stress in Continuous Speech Recognitionfor DutchDavid van Kuijk, Henk van den Heuvel, Louis Boves

1740 Automatic Accent Classification of Foreign AccentedAustralian English SpeechKarsten Kumpf, Robin W. King

1744 Discriminative Adaptation for Speaker VerificationF. Korkmazskiy, Biing-Hwang Juang

1746 Perceptual Features of Unknown Foreign Languages asRevealed by Multi-dimensional ScalingV. Stockmal, D. Muljani, Z.S. Bond

1752 On-line Incremental Adaptation for Speaker Verificationusing Maximum Likelihood Estimates of CDHMMParametersKin Yu, John S. Mason

1756 Combining Methods to Improve Speaker VerificationDecisionDominique Genoud, Frederic Bimbot, GuillaumeGravier, Girard Chollet

1760 Incremental Speaker Adaptation with Minimum ErrorDiscriminative Training for Speaker IdentificationCesar Martin del Alamo, J. Alvarez, C.aela Torre, F.J.Poyatos, L Hemdndez

1764 Frame Level Likelihood Normalization for Text-independent Speaker Identification using GaussianMixture ModelsKonstantin P. Markov, Seiichi Nakagawa

1768 On Using Prosodic Cues in Automatic LanguageIdentificationAnn E. Thyme-Gobbel, Sandra E. Hutchins

1772 Speaker Recognition Model using Two-dimensional Mel-Cepstrum and Predictive Neural NetworkTaaashi Kitamura, Shinsai Takei

1776 Unknown Language Rejection in LanguageIdentification SystemHingkeung Kwan, Keikichi Hirose

1780 Spoken Language Identification using Large VocabularySpeech RecognitionJames L Hieronymus, Shubha Kadambe

1784 Accent IdentificationCarlos Teixeira, Isabel M. Trancoso, AntdnioSerralheiro

1788 Comparison of Text-independent Speaker RecognitionMethods on Telephone Speech with Acoustic MismatchSarel van Vuuren

1792 On the Sources of Inter- and Intra-speaker Variability inthe Acoustic Dynamics of SpeechXue Yang, J. Bruce Millar, Iain Macleod

1796 Language Identification with Inaccurate String MatchingKay M. Berkling, Etienne Barnard

1800 Robust Prosodic Features for Speaker IdentificationM.J. Carey, E.S. Parris, H. Lloyd-Thomas, S.J. Bennett

1804 Text Independent Speaker Identification on NoisyEnvironments by Means of Self Organizing MapsE. Monte, J. Hemando, X. Miro, A. Adolf

1808 Language-identification Using Language-dependentPhonemes and Language-independent Speech UnitsPaul Dalsgaard, Ove Andersen, Hanne Hesselager,Bojan Petek

* Large Vocabulary Speech Recognition: The SwitchboardDomainRonald Rosenfeld, Hervi Bourlard

-Emotfbn in Recognit ion^

* Adding the Affective Dimension: A New Look inSpeech Analysis and SynthesisKlaus R. Scherer

1812 Ethological Theory and the Expression of Emotion inthe VoiceJohn J. Ohala

1816 Synthesizing Emotions in Speech: Is it Time to GetExcited?Iain R. Murray, John L Amott

SaP2Ll - Stochastic T e c h i i k ^ in Robast Speech

1820 A Study on Task-independent Subword Selection andModeling for Speech RecognitionChin-Hui Lee, Biing-Hwang Juang, Wu Chou, J.J.Molina-Perez

1824 Simultaneous ANN Feature and HMM RecognizerDesign using String-based Minimum ClassificationError (MCE) TrainingMazin G. Rahim, Chin-Hui Lee

1828 Quantizing Mixture-weights in a Tied-mixture HMMSunil K. Gupta, Frank K. Soong, Raziel Haimi-Cohen

1832 Variance Compensation within the MLLR Frameworkfor Robust Speech Recognition and Speaker AdaptationM.J.F. Gales, D. Pye, P.C. Woodland

1836 Maximum-likelihood Stochastic Matching Approach toNon-linear Equalization for Robust Speech RecognitionA.C. Surendran, Chin-Hui Lee, Mazin G. Rahim

1840 Estimation of Channel Bias for Telephone SpeechRecognitionJen-Tzung Chien, Hsiao-Chuan Wang, Lee-Min Lee

:SaP2L2-Prosodic Synthesis in Text to Speech

1844 Synthesis of English Intonation using Explicit Models ofReading and Spontaneous SpeechM. E. Johnson

1848 Implementation and Evaluation of a Model for Synthesisof Swedish IntonationMerle Home, Marcus Filipsson

1852 Natural Prosody Generation for Domain Specific Text-to-Speech SystemsNobuyuki Katae, Shinta Kimura

1856 Improving Text-to-Speech SynthesisMark Tatham, Eric Lewis

1860 Synthesis of Stressed Speech from Isolated NeutralSpeech Using HMM-based ModelsSahar E. Bou-Ghazale, John H.L Hansen

1864 Modeling Segment Intonation for Slovene TTS SystemAles Dobnikar

SaP2L3 - DialogueJEvents

1868 Word Predictability After Hesitations: A Corpus-basedStudyElizabeth Shriberg, Andreas Stolcke

1872 Interruptions and IntonationLi-chiung Yang

1876 On not Recognizing Disfluencies in DialogueRobin J. Lickley, Ellen Gurman Bard

1880 A Theory of Word Frequencies and its Application toDialogue Move RecognitionPhil Gamer, Sue Browning, Roger Moore, MartinRussell

1884 Utterance Units and Grounding in Spoken DialogueDavid R. Traum, Peter A. Heeman

1888 Coordinating Turn-taking with GazeDavid G. Novick, Brian Hansen, Karen Ward

SaE2Pl -Databases and Took

1892 BABEL: An Eastern European Multi-language DatabasePeter Roach, Simon Arnfield, W. Barry, J. Baltova, M.Boldea, A. Fourcin, W. Gonet, R. Gubrynowicz, E.Hallum, L Lamel, K. Marasek, A. Marchal, E. Meister,K. Vicsi

1894 USTC95—A Putonghua CorpusRen-Hua Wang, Deyu Xia, Jinfu Ni, Bicheng Liu

1898 Telephone Data Collection using the World Wide WebEdward Hurley, Joseph Polifroni, James Glass

1902 The "SIVA" Speech Database for Speaker Verification:Description and EvaluationM. Falcone, A. Callo

1906 A Multi-level Description of Date Expressions inGerman Telephone SpeechChristoph Draxler

1910 Viterbi Search Visualization Using Vista: A GenericPerformance Visualization ToolRobert H. Halstead Jr., Ben Serridge, Jean-Manuel VanThong, William Goldenthal

1914 A Multilingual Phonetic Representation and AnalysisSystem for Different Speech DatabasesToomas Altosaar, Matti Karjalainen, Martti Vainio

1918 FRESCO: The French Telephone Speech DataCollection - Part of the European SpeechDat(M) ProjectD. Langmann, R. Haeb-Umbach, Louis Boves, E. den Os

1922 Predicting the Out-of-Vocabulary Rate and the RequiredVocabulary Size for Speech Processing ApplicationsJohannes Miiller, Holger StahL Manfred Lang

1926 AMULET: Automatic MUltisensor Speech Labellingand Event Tracking: Study of the Spatio-temporalCorrelations in Voiceless Plosive ProductionNathalie Parlangeau, Alain Marchal

1930 Constructing Multi-level Speech Database forSpontaneous Speech ProcessingMinsoo Hahn, Sanghun Kim, Jung-Chul Lee, Yong-JuLee

1934 Preliminaries to a Romanian Speech DatabaseMarian Boldea, Alin Doroga, Tiberiu Dumitrescu,Maria Pescaru

1938 Labelled Data Bank of Spoken Standard German TheKiel Corpus of Read/Spontaneous SpeechKlaus J. Kohler

1942 SAPPHIRE: An Extensible Speech Analysis andRecognition Tool Based on Tcl/TkLee Hetherington, Michael McCandless

1946 Automatic Detection of Topic Boundaries and Keywordsin Arbitrary Speech Using Incremental ReferenceInterval-free Continuous DPJiro Kiyama, Yoshiaki Itoh, Ryuichi Oka

1950 Very-large-vocabulary Mandarin Voice Message FileRetrieval using Speech QueriesBo-Ren Bai, Lee-Feng Chien, Lin-Shan Lee

1954 Gandalf - A Swedish Telephone Speaker VerificationDatabaseH.Melin

1958 The DCIEM Map Task Corpus: Spontaneous DialogueUnder Sleep Deprivation and Drug TreatmentEllen Gurman Bard, C. Sotillo, A. H. Anderson, M. M.Taylor

1962 The Nemours Database of Dysarthric Speech

Xavier Menendez-PidaL James B. Polikoff, Shirley M.Peters, Jennie E. Leonzio, H.T. Bunnell

1966 POST: Parallel Object-oriented Speech ToolkitJean Hennebert, Dijana Petrovska Delacritaz

2013 Lombard Effect Compensation and Noise Suppressionfor Noisy Lombard Speech RecognitionSang-mun Chi, Yung-Hwan Oh

1970 Recognizing Emotion in SpeechFrank Dellaert, Thomas Polzin, Alex Waibel

1974 Emotions in Time Domain SynthesisBarbara Heuft, Thomas Portele, Monika Rauth

1978 Word Class Driven Synthesis of Prosodic AnnotationsSimon Amfield

1981 Dynamical Modelling of Vowel Sounds as a SynthesisToolM. Banbrook, S. McLaughlin

1985 Emotional Speech Elicited using Computer GamesTom Johnstone

1989 Automatic Statistical Analysis of the Signal andProsodic Signs of Emotion in SpeechRoddy Cowie, Ellen Douglas-Cowie

'^ "•" T ^

£f r4r"-* ? *

— Ridmst

* ^ . r 1 1 _ • > •

%KedliPrw-'* f

n o

1993

1997

2001

2005

2009

Channel and Noise Normalization Using AffineTransformed CepstrumXiaoyu Zhang, Richard J. Mammone

Spectral Estimation and Normalisation for RobustSpeech RecognitionTom Claes, Fei Xie, Dirk Van Compemolle

Trellis Encoded Vector Quantization for Robust SpeechRecognitionWu Chou, Nambi Seshadri, Mazin Rahim

Phone Clustering using the Bhattacharyya DistanceBrian Mak, Etienne Barnard

Variability of Lombard Effects Under Different NoiseConditionsAtsushi Wakao, Kazuya Takeda, Fumitada Itakura

2017 The Use of Shibboleth Words for AutomaticallyClassifying Speakers by DialectA.W.F. Huggins, Yogen Patel

' The Organization of Dialect Diversity in North AmericaWilliam Labov

2021 Data Collection of Japanese Dialects and its Influenceinto Speech RecognitionIkuo Kudo, Takao Nakama, Tomoko Watanabe, ReikoKameyama

2025 Statistical Dialect Classification Based on MeanPhonetic FeaturesDavid R. Miller, James Trischitta

2028 Norwegian Numerals: a Challenge to Automatic SpeechRecognitionKnut Kvale

2032 Evaluation of the Telefdnica I+D Natural NumbersRecognizer over Different Dialects of Spanish fromSpain and AmericaC. de la Torre, J. Caminero-Gil, J. Alvarez. C. Martindel Alamo, L Hemdndez-Gdmez

2036 Rhythmic Constraints on English Stress TimingFred Cummins, Robert F. Port

2040 On the Interaction of Clash, Focus and PhonologicalPhrasingIrene Vogel, Steve Ho skins

2044 On the Quantal Nature of Speech TimingGunnar Font, Anita Kruckenberg

2048 Differential Perception of Tonal Contours Through theSyllableDavid House

2052 Pitch, Loudness, and Segmental Duration Correlates:Towards a Model for the Phonetic Aspects of FinnishProsodyMartti Vainio, Toomas Altosaar

2056 Prosodic Manipulation System of Speech Material forPerceptual ExperimentsNobuaki Minematsu, Seiichi Nakagawa, Keikichi Hirose

proceedings - gbv · 2007-03-16 · proceedings october 3-6, 1996 wyndham franklin plaza hotel...

Documents