chinese-english mixlingual automatic speech recognition system

Introduction

”

Chinese-English Mixlingual Automatic Speech Recognition SystemEmily Hua, Kelly Chen, Wendy Wang

Department of Computer Science, Engineering, QMSS @ Columbia University in the City of New York

GMM-HMM

Hybrid System

Data Processing (Acoustic Data, Language Data)

Features and Network Setup

Data Preprocessing (Transcript Clean up) & Data Description Results

GMM-HMM Acoustic Features & Language Model Further Thoughts

Hybrid System DNN References

40 x 1features

Fig 1.University Of Edinburgh-ASR Course Slides, 2013Fig 2.GMM-HMM语音识别模型-原理篇, Rachel Zhang, 2014Fig 3. Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition, Dahl et al., 2011Fig 6.Decoding graph construction in Kaldi: A visual walkthrough, Vassil Panayotov, 2012Some Kaldi Notes, Josh Meyer, 2016[1][2]Improving Deep Neural Network Acoustic Models Using Generalized Maxout Networks, Povey et al, 2014[3]A First Speech Recognition System For Mandarin-English Code-switch Conversational Speech, Vu et al., 2012[4]Audio Augmentation for Speech Recognition, Ko et al., 2015[5]Vocal Tract Length Perturbation (VTLP) improves speech recognition, Jaitly & Hinton, 2013

Deep learning project presentation 今晚 due

\2-gram

\3-gram

http://www.inf.ed.ac.uk/teaching/courses/asr/2012-13/asr03-hmmgmm-4up.pdf

http://www.inf.ed.ac.uk/teaching/courses/asr/2012-13/asr03-hmmgmm-4up.pdf

http://blog.csdn.net/abcjennifer/article/details/27346787

http://blog.csdn.net/abcjennifer/article/details/27346787

https://blog.acolyer.org/2016/04/19/context-dependent-pre-trained-deep-neural-networks-for-large-vocabulary-speech-recognition/

https://blog.acolyer.org/2016/04/19/context-dependent-pre-trained-deep-neural-networks-for-large-vocabulary-speech-recognition/

http://vpanayotov.blogspot.com/2012/06/kaldi-decoding-graph-construction.html

http://vpanayotov.blogspot.com/2012/06/kaldi-decoding-graph-construction.html

http://jrmeyer.github.io/kaldi/2016/02/01/Kaldi-notes.html

http://jrmeyer.github.io/kaldi/2016/02/01/Kaldi-notes.html

http://www.danielpovey.com/files/2014_icassp_dnn.pdf

http://www.danielpovey.com/files/2014_icassp_dnn.pdf

https://csl.anthropomatik.kit.edu/downloads/vu_CodeSwitch_icassp2012.pdf

https://csl.anthropomatik.kit.edu/downloads/vu_CodeSwitch_icassp2012.pdf

http://www.danielpovey.com/files/2015_interspeech_augmentation.pdf

http://www.danielpovey.com/files/2015_interspeech_augmentation.pdf

http://www.cs.toronto.edu/~ndjaitly/jaitly-icml13.pdf

http://www.cs.toronto.edu/~ndjaitly/jaitly-icml13.pdf

chinese-english mixlingual automatic speech recognition system

Documents