effective use of linguistic and contextual information for statistical machine translation libin...

8

Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang a nd Spyros Matsoukas and RalphWeischedel BBN Technologies EMNLP2009 Presented by Cai

Upload: jodie-davidson

Post on 05-Jan-2016

212 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas

Effective Use of Linguistic and Contextual Informationfor Statistical Machine

TranslationLibin Shen and Jinxi Xu and Bing Zhang and

Spyros Matsoukas and RalphWeischedelBBN Technologies

EMNLP2009Presented by Cai

Page 2: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas

Question

Lexical features are useful in MT But parameter’s number is large How to effectively use these features?

Page 3: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas

Previous Work

Discriminative training the parameters : the need of scalable development set and careful selection

Estimate a single score or likelihood of a translation with rich features (using ME): feature space too large, not practical

Page 4: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas

Main Contribution

Design effective and efficient statistical models (simple probabilistic models) to capture useful linguistic and context information for MT decoding

Features: robust and ideal

Page 5: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas

Features introduced

non-terminal labels (+performance) Length distribution of non-terminals

(+performance) Source-side context information

(+performance) Source-side structural information

(dependency information) no performance gain, surprisingly

Page 6: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas

What’s special

Assume the distribution of length of non-terminal is Gaussian (sampling,estimation, smoothing)

Soft dependency constraints by introducing labels of non-terminals

Context language model String-to-dependency rule-> dependency-to-

dependency rule

Page 7: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas

Experiments

Baseline: string-to-dependency system presented in (Shen et.al 2008)

Test each feature and their combinations Arabic-to-English and Chinese-to-English Measure: Bleu and TER Results: 2 points of BLEU in A-E and 1 points of B

LEU in C-E (nist06); 1.7 points of BLEU in A-E and 0.8 points of BLEU in C-E (nist06); 1.7 poi

Page 8: Effective Use of Linguistic and Contextual Information for Statistical Machine Translation Libin Shen and Jinxi Xu and Bing Zhang and Spyros Matsoukas

Main Related Work

Z. He, Q. Liu, and S. Lin. 2008. Improving statistical machine translation using lexicalized rule, COLING ’08

A. Ittycheriah and S. Roukos. 2007. Direct translation model 2. NACCL 07

L. Shen, J. Xu, and R. Weischedel. 2008. A New String-to-Dependency Machine Translation Algorithm with a Target Dependency Language Model. ACL 2008

Beanile Lace Tatted Lace of Beads by Nina Libin

Single Image Super-resolution - Brown Universitycs.brown.edu/courses/csci1290/2012/lectures/15.pdf · Single Image Super-resolution Cs129 Computational Photography Slides from Libin

Libin Cherian _Graphic Designer_CV_2015

f6publishing.blob.core.windows.net · Web viewDepartments of Pediatrics, Cardiac Sciences, Biochemistry & Molecular Biology, Alberta Children’s Hospital Research Institute, Libin

PressureEngineeringoftheDiracFermionsin Quasi-One ...E-mail: [email protected] (B Li) and [email protected] (X Xu) Abstract. Topological band dispersions other than the standard

Terrain Runner: Control, Parameterization, Composition ...Terrain Runner: Control, Parameterization, Composition, and Planning for Highly Dynamic Motions Libin Liu* KangKang Yin†

Base Station Association Game in Multi-cell Wireless Network Libin Jiang, Shyam Parekh, Jean Walrand

Libin Ding, HVDC Service/Power World 2014, Nov 26, 2014

Using Genetic Programming to Learn Probability Distributions as Mutation Operators with Evolutionary Programming Libin Hong, John Woodward, Ender Ozcan,

Learning Basketball Dribbling Skills Using Trajectory ...graphics.cs.cmu.edu/.../uploads/2018/05/BasketballSIGGRAPH2018.pdf · 142:2 • Liu, Libin and Hodgins, Jessica the locomotion

A genome-wide BAC-end sequence survey provides first ... · provides first insights into sweetpotato (Ipomoea batatas (L.) Lam.) genome composition Zengzhi Si, Bing Du, Jinxi Huo,

AQUAINT Building an Initial Cross-lingual Question Answering System: English Question -> Chinese Collection Ralph Weischedel, Ana Licuanan, Jinxi Xu 6

Reviewing QACs for Biomonitoring: Metabolism, …...2020/03/04 · Metabolism, Analytical Considerations, and Effects on Cholesterol Homeostasis Libin Xu, PhD Assistant Professor

Optimization and Incentives in Communication Networks€¦ · Optimization and Incentives in Communication Networks by Libin Jiang B.E. (University of Science and Technology of China)

Libin Life Fall 2013

Facing the Screen Dilemma - Campaign for a … Shara Drew and Niki Matsoukas For permission to reprint or translate, contact [email protected] Facing the Screen Dilemma

Low-Pressure Plasma Process for Nanoparticle Coating Investigators: Farzad Mashayek, MIE/UIC; Themis Matsoukas, ChE/Penn State Prime Grant Support: NSF

Michael Slawnych MD PhD Libin Cardiovascular Institute ...€¦ · Libin Cardiovascular Institute March 2012 . Case •62 Year Old Male –Retired airline pilot –Paroxysmal atrial

Visual Identity Standards · business communications and merchandise. The horizontal conﬁguration gives prominence to the Libin name, while balancing elements symmetrically. Secondary

Passive UHF RFID Road Tag Antenna Sustainability Jinxi Chen, Yen Bao Le, Chanyoon Park Electrical Engineering Department

Osama F. Harraz and Donald G. Welsh Libin Cardiovascular ... · Hotchkiss Brain and Libin Cardiovascular Research Institutes and Department of Physiology & Pharmacology, University

Todd Anderson Libin Cardiovascular Institute University of ...€¦ · Libin Cardiovascular Institute University of Calgary . Disclosures •Department of Cardiac Sciences and Libin

Ischemic Conditioning and Endothelial Function Todd Anderson Libin Cardiovascular Institute

Expert PubMed/Medline Searching Skills Konstantina (Dina) Matsoukas, MLIS Head of Reference & Education Coordinator CUMC - Health Sciences Library [email protected]

An SVM Based Voting Algorithm with Application to Parse Reranking Paper by Libin Shen and Aravind K. Joshi Presented by Amit Wolfenfeld

Optimal Demand Response - The Resnick Sustainability ...resnick.caltech.edu/docs/sg_low.pdf · Optimal Demand Response Libin Jiang Steven Low Computing + Math Sciences Electrical

Bidirectional LTAG Dependency Parsing - cis.upenn.eduxtag/spinal/papers/bidirectional.pdf · Bidirectional LTAG Dependency Parsing Libin Shen BBN Technologies Aravind K. Joshi University

PAC, PVCs, Holter Monitors - TotalCardiology · 2020-05-13 · PAC, PVCs, Holter Monitors Dr. Erkan ILHAN Cardiology/Cardiac Electrophysiology Total Cardiology Libin Cardiovascular

Surgical Considerations of TEVAR - Calgary Thoracic Aorta ...Surgical Considerations of TEVAR University of Alberta, June 14th, 2013 Jehangir Appoo Libin Cardiovascular Institute University

Computer forensics libin

Associate Professor Electrophysiology Training Program ... · Associate Professor Electrophysiology Training Program Director Libin Cardiovascular Institute of Alberta University

Media melina matsoukas

Solutions Manual for Fundamentals of Chemical …collegetestbank.eu/sample/Solution...Thermodynamics... · Engineering Thermodynamics Themis Matsoukas Upper Saddle River, NJ ¥ Boston

Contact-aware simulations of particulate Stokesian suspensions · Contact-aware simulations of particulate Stokesian suspensions Libin Lu a, Abtin Rahimiana,, Denis Zorin aCourant

Calcul2 Programming Language Reference Manual sedwards/classes/2013/w4115-fall/lrms/Calcul^2.pdf · Calcul2 Programming Language Reference Manual Junde Huang Kewei Ge Zhan Shu Jinxi