effective use of linguistic and contextual information for statistical machine translation libin...
TRANSCRIPT
![Page 1: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas](https://reader036.vdocument.in/reader036/viewer/2022083005/56649f275503460f94c3e865/html5/thumbnails/1.jpg)
Effective Use of Linguistic and Contextual Informationfor Statistical Machine
TranslationLibin Shen and Jinxi Xu and Bing Zhang and
Spyros Matsoukas and RalphWeischedelBBN Technologies
EMNLP2009Presented by Cai
![Page 2: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas](https://reader036.vdocument.in/reader036/viewer/2022083005/56649f275503460f94c3e865/html5/thumbnails/2.jpg)
Question
Lexical features are useful in MT But parameter’s number is large How to effectively use these features?
![Page 3: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas](https://reader036.vdocument.in/reader036/viewer/2022083005/56649f275503460f94c3e865/html5/thumbnails/3.jpg)
Previous Work
Discriminative training the parameters : the need of scalable development set and careful selection
Estimate a single score or likelihood of a translation with rich features (using ME): feature space too large, not practical
![Page 4: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas](https://reader036.vdocument.in/reader036/viewer/2022083005/56649f275503460f94c3e865/html5/thumbnails/4.jpg)
Main Contribution
Design effective and efficient statistical models (simple probabilistic models) to capture useful linguistic and context information for MT decoding
Features: robust and ideal
![Page 5: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas](https://reader036.vdocument.in/reader036/viewer/2022083005/56649f275503460f94c3e865/html5/thumbnails/5.jpg)
Features introduced
non-terminal labels (+performance) Length distribution of non-terminals
(+performance) Source-side context information
(+performance) Source-side structural information
(dependency information) no performance gain, surprisingly
![Page 6: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas](https://reader036.vdocument.in/reader036/viewer/2022083005/56649f275503460f94c3e865/html5/thumbnails/6.jpg)
What’s special
Assume the distribution of length of non-terminal is Gaussian (sampling,estimation, smoothing)
Soft dependency constraints by introducing labels of non-terminals
Context language model String-to-dependency rule-> dependency-to-
dependency rule
![Page 7: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas](https://reader036.vdocument.in/reader036/viewer/2022083005/56649f275503460f94c3e865/html5/thumbnails/7.jpg)
Experiments
Baseline: string-to-dependency system presented in (Shen et.al 2008)
Test each feature and their combinations Arabic-to-English and Chinese-to-English Measure: Bleu and TER Results: 2 points of BLEU in A-E and 1 points of B
LEU in C-E (nist06); 1.7 points of BLEU in A-E and 0.8 points of BLEU in C-E (nist06); 1.7 poi
![Page 8: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas](https://reader036.vdocument.in/reader036/viewer/2022083005/56649f275503460f94c3e865/html5/thumbnails/8.jpg)
Main Related Work
Z. He, Q. Liu, and S. Lin. 2008. Improving statistical machine translation using lexicalized rule, COLING ’08
A. Ittycheriah and S. Roukos. 2007. Direct translation model 2. NACCL 07
L. Shen, J. Xu, and R. Weischedel. 2008. A New String-to-Dependency Machine Translation Algorithm with a Target Dependency Language Model. ACL 2008