modified distortion matrices for phrase-based smt arianna bisazza & marcello federico – fbk...

Post on 01-Apr-2015

213 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Modified Distortion Matrices for Phrase-Based SMT

Arianna Bisazza & Marcello Federico – FBK (Italy)

w0 w1 w2 w3 w4 w5 w6 w7 w8 w9w1

0

<s> 0 1 2 3 4 5 6 7 8 9 10w0 0 1 2 3 4 5 6 7 8 9w1 2 0 1 2 3 4 5 6 7 8w2 3 2 0 1 2 3 4 5 6 7w3 4 3 2 0 1 2 3 4 5 6w4 5 4 3 2 0 1 2 3 0 5w5 6 5 4 3 2 0 1 2 3 4w6 7 6 5 4 3 2 0 1 2 3w7 8 7 6 5 4 3 2 0 1 0w8 9 8 7 6 5 4 3 2 0 0w9 10 9 8 7 6 2 2 3 2 0w10 11 10 9 8 7 6 5 4 3 2

2 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

2

PSMT decoding overview

E' necessario incoraggiare tale mobilità garantendo la sicurezza dei percorsi professionali

3

Freedom of movement

must be encouraged

LM scores

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

3

PSMT decoding overview

E' necessario incoraggiare tale mobilità garantendo la sicurezza dei percorsi professionali

LM scores

TM

scores

TM scores

ReoM scores

ReoM scores

4

career paths …

while ensuring that

Freedom of movement

must be encouraged

LM scoresLM scoresLM scores

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

4

PSMT decoding overview

E' necessario incoraggiare tale mobilità garantendo la sicurezza dei percorsi professionali

LM scores

TM scoresTM

scores TM

scor

esTM

scores

ReoM scores

ReoM scores

ReoM scores

ReoM scores

5

LM scoresLM scoresLM scores

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

5

PSMT decoding overview

E' necessario incoraggiare tale mobilità garantendo la sicurezza dei percorsi professionali

Freedom of movement must be encouraged while ensuring that career paths …

LM scores

TM scoresTM

scores TM

scor

esTM

scores

ReoM scores

ReoM scores

ReoM scores

ReoM scores

6 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

6

Reordering Models

E' necessario incoraggiare tale mobilità garantendo la sicurezza dei percorsi professionali

ReoM scores

ReoM scores

ReoM scores

ReoM scores

Many solutions have been proposed

with different reo. classes, features,

train modes etc.

Tillman 04, Zens & Ney 06AlOnaizan & Papineni 06Galley & Manning 08Green & al.10, Feng & al.10…

7 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

7

Reordering Models

E' necessario incoraggiare tale mobilità garantendo la sicurezza dei percorsi professionali

ReoM scores

ReoM scores

ReoM scores

ReoM scores

No matter what reordering model is used, permutation search space must be limited!

The power of all reordering models is bound to the reordering constraints in use

Tillman04, Zens&Ney06AlOnaizan & Papineni06Galley & Manning08Green &al.10, Feng &al.10…

Many solutions have been proposed

with different reo. classes, features,

train modes etc.

Tillman 04, Zens & Ney 06AlOnaizan & Papineni 06Galley & Manning 08Green & al.10, Feng & al.10…

8 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

8

E' necessario incoraggiare tale mobilità garantendo la sicurezza dei percorsi professionali

ReoM scores

ReoM scores

ReoM scores

ReoM scores

9 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

9

Reordering Constraints

E' necessario incoraggiare tale mobilità garantendo la sicurezza dei percorsi professionali

#perm.=11!≈40,000,000

ReoM scores

ReoM scores

ReoM scores

ReoM scores

10 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

10

E' necessario incoraggiare tale mobilità garantendo la sicurezza dei percorsi professionali

Source-to-Source distortion

#perm.=11!≈40,000,000

D(x,y)=|y-x-1|

w0 w1 w2 w3 w4 w5 w6 w7 w8 w9 w10

<s>

0 1 2 3 4 5 6 7 8 9 10

w0 0 1 2 3 4 5 6 7 8 9w1 2 0 1 2 3 4 5 6 7 8w2 3 2 0 1 2 3 4 5 6 7w3 4 3 2 0 1 2 3 4 5 6w4 5 4 3 2 0 1 2 3 4 5w5 6 5 4 3 2 0 1 2 3 4w6 7 6 5 4 3 2 0 1 2 3w7 8 7 6 5 4 3 2 0 1 2w8 9 8 7 6 5 4 3 2 0 1w9 10 9 8 7 6 5 4 3 2 0w10 11 10 9 8 7 6 5 4 3 2

Reordering Constraints

11 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

11

E' necessario incoraggiare tale mobilità garantendo la sicurezza dei percorsi professionali

Source-to-Source distortion

#perm.=11!≈40,000,000

D(x,y)=|y-x-1|

DL=3 #perm.≈7,000

w0 w1 w2 w3 w4 w5 w6 w7 w8 w9 w10

<s>

0 1 2 3 4 5 6 7 8 9 10

w0 0 1 2 3 4 5 6 7 8 9w1 2 0 1 2 3 4 5 6 7 8w2 3 2 0 1 2 3 4 5 6 7w3 4 3 2 0 1 2 3 4 5 6w4 5 4 3 2 0 1 2 3 4 5w5 6 5 4 3 2 0 1 2 3 4w6 7 6 5 4 3 2 0 1 2 3w7 8 7 6 5 4 3 2 0 1 2w8 9 8 7 6 5 4 3 2 0 1w9 10 9 8 7 6 5 4 3 2 0w10 11 10 9 8 7 6 5 4 3 2

DL: distortion limit

Reordering Constraints

12 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

12

The problem with DL…

Arabic-English

AR

EN

AR

EN

w0 w1 w2 w3 w4 w5 w6 w7 w8 w9 w1

0

<s>

0 1 2 3 4 5 6 7 8 9 10

w0 0 1 2 3 4 5 6 7 8 9w1 2 0 1 2 3 4 5 6 7 8w2 3 2 0 1 2 3 4 5 6 7w3 4 3 2 0 1 2 3 4 5 6w4 5 4 3 2 0 1 2 3 4 5w5 6 5 4 3 2 0 1 2 3 4w6 7 6 5 4 3 2 0 1 2 3w7 8 7 6 5 4 3 2 0 1 2w8 9 8 7 6 5 4 3 2 0 1w9 10 9 8 7 6 5 4 3 2 0w10 1110 9 8 7 6 5 4 3 2

13 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

13

German-English

DE

EN

DE

EN

w0 w1 w2 w3 w4 w5 w6 w7 w8 w9 w1

0

<s>

0 1 2 3 4 5 6 7 8 9 10

w0 0 1 2 3 4 5 6 7 8 9w1 2 0 1 2 3 4 5 6 7 8w2 3 2 0 1 2 3 4 5 6 7w3 4 3 2 0 1 2 3 4 5 6w4 5 4 3 2 0 1 2 3 4 5w5 6 5 4 3 2 0 1 2 3 4w6 7 6 5 4 3 2 0 1 2 3w7 8 7 6 5 4 3 2 0 1 2w8 9 8 7 6 5 4 3 2 0 1w9 10 9 8 7 6 5 4 3 2 0w10 1110 9 8 7 6 5 4 3 2

The problem with DL…

14 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

14

Source-to-Source distortion

w0 w1 w2 w3 w4 w5 w6 w7 w8 w9 w10

<s>

0 1 2 3 4 5 6 7 8 9 10

w0 0 1 2 3 4 5 6 7 8 9w1 2 0 1 2 3 4 5 6 7 8w2 3 2 0 1 2 3 4 5 6 7w3 4 3 2 0 1 2 3 4 5 6w4 5 4 3 2 0 1 2 3 4 5w5 6 5 4 3 2 0 1 2 3 4w6 7 6 5 4 3 2 0 1 2 3w7 8 7 6 5 4 3 2 0 1 2w8 9 8 7 6 5 4 3 2 0 1w9 10 9 8 7 6 5 4 3 2 0w10 11 10 9 8 7 6 5 4 3 2

Current solution:

increase the DLimit

#perm.=11! ≈40,000,000

D(x,y)=|y-x-1|

DL=3 #perm.≈7,000

15 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

15

Source-to-Source distortion

w0 w1 w2 w3 w4 w5 w6 w7 w8 w9 w10

<s>

0 1 2 3 4 5 6 7 8 9 10

w0 0 1 2 3 4 5 6 7 8 9w1 2 0 1 2 3 4 5 6 7 8w2 3 2 0 1 2 3 4 5 6 7w3 4 3 2 0 1 2 3 4 5 6w4 5 4 3 2 0 1 2 3 4 5w5 6 5 4 3 2 0 1 2 3 4w6 7 6 5 4 3 2 0 1 2 3w7 8 7 6 5 4 3 2 0 1 2w8 9 8 7 6 5 4 3 2 0 1w9 10 9 8 7 6 5 4 3 2 0w10 11 10 9 8 7 6 5 4 3 2

Current solution:

increase the DLimit

Generally leads to worse translations!

#perm.=11! ≈40,000,000

D(x,y)=|y-x-1|

DL=3 #perm.≈7,000

DL=7 #perm.≈7,000,000

16 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

16

Source-to-Source distortion

#perm.=11! ≈40,000,000

D(x,y)=|y-x-1|

DL=3 #perm.≈7,000

DL=7 #perm.≈7,000,000

w0 w1 w2 w3 w4 w5 w6 w7 w8 w9 w10

<s>

0 1 2 3 4 5 6 7 8 9 10

w0 0 1 2 3 4 5 6 7 8 9w1 2 0 1 2 3 4 5 6 7 8w2 3 2 0 1 2 3 4 5 6 7w3 4 3 2 0 1 2 3 4 5 6w4 5 4 3 2 0 1 2 3 4 5w5 6 5 4 3 2 0 1 2 3 4w6 7 6 5 4 3 2 0 1 2 3w7 8 7 6 5 4 3 2 0 1 2w8 9 8 7 6 5 4 3 2 0 1w9 10 9 8 7 6 5 4 3 2 0w10 11 10 9 8 7 6 5 4 3 2

Our solution:

17 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

17

Source-to-Source distortion

#perm.=11! ≈40,000,000

D(x,y)=|y-x-1|

DL=3 #perm.≈7,000

DL=7 #perm.≈7,000,000

DL=3 & modif(D)

#perm.≈20,000

w0 w1 w2 w3 w4 w5 w6 w7 w8 w9 w10

<s>

0 1 2 3 4 5 6 7 8 9 10

w0 0 1 2 3 4 5 6 7 8 9w1 2 0 1 2 3 4 0 0 7 8w2 3 2 0 1 2 3 0 0 6 7w3 4 3 2 0 1 2 3 4 5 6w4 5 4 3 2 0 1 2 3 4 5w5 6 5 4 3 2 0 1 2 3 0w6 7 6 5 4 3 2 0 1 2 3w7 8 7 6 5 4 3 2 0 1 2w8 9 8 7 6 5 4 3 2 0 1w9 10 9 8 2 2 5 4 3 2 0w10 11 10 9 8 7 6 5 4 3 2

Our solution:

modify distortion for each test

sentence

Simplifies the task of reordering models!

18

Rest of the talk:

How to modify the distortion matrix?

What effect on translation quality?

What effect on baseline runtimes?

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

19 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

19

Chunk-basedfuzzy reordering

rulesShallow syntax chunking:

• cheaper and easier than deep parsing

• constrains reorderings in a softer way

Fuzzy (non-determinisic) reordering rules:

• generate N permutations for each matching sequence

• final reordering decision is taken during translation, guided by all SMT models (reoM, LM...)

Few rules for language pair, to only capture long reordering

20 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

20

Arabic-English

“Move verb chunk (and following chunk) to the right by 1 to N

chunks”

Chunk-basedfuzzy reordering

rules

CC1 VC2 PC3 NC4 PC5 Pct6

w- $Ark fy AltZAhrp E$rAt AlmslHyn mn AlktA}b . and took part in the march dozens of militants from the Brigades

21 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

21

Arabic-English

“Move verb chunk (and following chunk) to the right by 1 to N

chunks”

CC1 VC2 PC3 NC4 PC5 Pct6

CC1 VC2PC3 NC4 PC5

VC2PC3 NC4

VC2PC3 NC4 PC5

CC1

CC1

PC5

Pct6

Pct6

Pct6

w- $Ark fy AltZAhrp E$rAt AlmslHyn mn AlktA}b . and took part in the march dozens of militants from the Brigades

Chunk-basedfuzzy reordering

rules

22 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

22

Arabic-English

“Move verb chunk (and following chunk) to the right by 1 to N

chunks”

CC1 VC2 PC3 NC4 PC5 Pct6

CC1 VC2PC3 NC4 PC5

VC2 PC3NC4

VC2PC3 NC4

VC2 PC3NC4 PC5

VC2PC3 NC4 PC5

CC1

CC1

CC1

CC1

PC5

PC5

Pct6

Pct6

Pct6

Pct6

Pct6

w- $Ark fy AltZAhrp E$rAt AlmslHyn mn AlktA}b . and took part in the march dozens of militants from the Brigades

Chunk-basedfuzzy reordering

rules

23 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

23

CC1 VC2 PC3 NC4 PC5 Pct6

CC1 VC2PC3 NC4 PC5

VC2 PC3NC4

VC2PC3 NC4

VC2 PC3NC4 PC5

VC2PC3 NC4 PC5

CC1

CC1

CC1

CC1

PC5

PC5

Pct6

Pct6

Pct6

Pct6

Pct6

w- $Ark fy AltZAhrp E$rAt AlmslHyn mn AlktA}b . and took part in the march dozens of militants from the Brigades

Chunk-basedfuzzy reordering

rulesReordering selection

Reordered source LM

0.9

0.4

0.10.1

0.7

24 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

24

CC1 VC2 PC3 NC4 PC5 Pct6

CC1 VC2PC3 NC4 PC5

VC2 PC3

Pct6

Pct6

w- $Ark fy AltZAhrp E$rAt AlmslHyn mn AlktA}b . and took part in the march dozens of militants from the Brigades

Chunk-basedfuzzy reordering

rulesReordering selection

Reordered source LM

0.9

0.7

0.4

0.10.1

Reorderings to encode in the distortion matrix

NC4 PC5 CC1

25 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

25

Modifying the distortion

matrix

CC1 VC2 PC3 NC4 PC5 Pct6

w0 w1 w2 w3 w4 w5 w6 w7 w8

<s>

0 1 2 3 4 5 6 7 8

CC1 w0 0 1 2 3 4 5 6 7VC2 w1 2 0 1 2 3 4 5 6

PC3

w2 3 2 0 1 2 3 4 5w3 4 3 2 0 1 2 3 4

NC4

w4 5 4 3 2 0 1 2 3w5 6 5 4 3 2 0 1 2

PC5

w6 7 6 5 4 3 2 0 1w7 8 7 6 5 4 3 2 0

Pct6 w8 9 8 7 6 5 4 3 2

CC1 VC2PC3 NC4 PC5

VC2 PC3

Reorderings to encode in the distortion matrix

NC4 PC5 CC1

Pct6

Pct6

26

CC1 VC2PC3 NC4 PC5

VC2 PC3

Reorderings to encode in the distortion matrix

NC4 PC5 CC1

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

26

Modifying the distortion

matrix

CC1 VC2 PC3 NC4 PC5 Pct6

w0 w1 w2 w3 w4 w5 w6 w7 w8

<s>

0 1 2 3 4 5 6 7 8

CC1 w0 0 0 0 3 4 5 6 7VC2 w1 2 0 1 2 3 4 5 6

PC3

w2 3 2 0 1 2 3 4 5w3 4 3 2 0 1 2 3 4

NC4

w4 5 4 3 2 0 1 2 3w5 6 5 4 3 2 0 1 2

PC5

w6 7 6 5 4 3 2 0 1w7 8 7 6 5 4 3 2 0

Pct6 w8 9 8 7 6 5 4 3 2

Pct6

Pct6

27 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

27

Modifying the distortion

matrix

CC1 VC2 PC3 NC4 PC5 Pct6

w0 w1 w2 w3 w4 w5 w6 w7 w8

<s>

0 1 2 3 4 5 6 7 8

CC1 w0 0 0 0 3 4 5 6 7VC2 w1 2 0 1 2 3 4 5 6

PC3

w2 3 2 0 1 2 3 4 5w3 4 2 2 0 1 2 3 4

NC4

w4 5 4 3 2 0 1 2 3w5 6 5 4 3 2 0 1 2

PC5

w6 7 6 5 4 3 2 0 1w7 8 7 6 5 4 3 2 0

Pct6 w8 9 8 7 6 5 4 3 2

CC1 VC2PC3 NC4 PC5

VC2 PC3

Reorderings to encode in the distortion matrix

NC4 PC5 CC1

Pct6

Pct6

28 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

28

Modifying the distortion

matrix

CC1 VC2 PC3 NC4 PC5 Pct6

w0 w1 w2 w3 w4 w5 w6 w7 w8

<s>

0 1 2 3 4 5 6 7 8

CC1 w0 0 0 0 3 4 5 6 7VC2 w1 2 0 1 0 0 4 5 6

PC3

w2 3 2 0 1 2 3 4 5w3 4 2 2 0 1 2 3 4

NC4

w4 5 4 3 2 0 1 2 3w5 6 5 4 3 2 0 1 2

PC5

w6 7 6 5 4 3 2 0 1w7 8 7 6 5 4 3 2 0

Pct6 w8 9 8 7 6 5 4 3 2

CC1 VC2PC3 NC4 PC5

VC2 PC3

Reorderings to encode in the distortion matrix

NC4 PC5 CC1

Pct6

Pct6

29 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

29

Modifying the distortion

matrix

CC1 VC2 PC3 NC4 PC5 Pct6

w0 w1 w2 w3 w4 w5 w6 w7 w8

<s>

0 1 2 3 4 5 6 7 8

CC1 w0 0 0 0 0 0 5 6 7VC2 w1 2 0 1 0 0 4 5 6

PC3

w2 3 2 0 1 2 3 4 5w3 4 2 2 0 1 2 3 4

NC4

w4 5 4 3 2 0 1 2 3w5 6 5 4 3 2 0 1 2

PC5

w6 7 6 5 4 3 2 0 1w7 8 7 6 5 4 3 2 0

Pct6 w8 9 8 7 6 5 4 3 2

CC1 VC2PC3 NC4 PC5

VC2 PC3

Reorderings to encode in the distortion matrix

NC4 PC5 CC1

Pct6

Pct6

30 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

30

Modifying the distortion

matrix

CC1 VC2 PC3 NC4 PC5 Pct6

w0 w1 w2 w3 w4 w5 w6 w7 w8

<s>

0 1 2 3 4 5 6 7 8

CC1 w0 0 0 0 0 0 5 6 7VC2 w1 2 0 1 0 0 4 5 6

PC3

w2 3 2 0 1 2 3 4 5w3 4 2 2 0 1 2 3 4

NC4

w4 5 4 3 2 0 1 2 3w5 6 5 4 3 2 0 1 2

PC5

w6 7 2 5 4 3 2 0 1w7 8 2 6 5 4 3 2 0

Pct6 w8 9 8 7 6 5 4 3 2

CC1 VC2PC3 NC4 PC5

VC2 PC3

Reorderings to encode in the distortion matrix

NC4 PC5 CC1

Pct6

Pct6

31 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

31

Modifying the distortion

matrix

CC1 VC2 PC3 NC4 PC5 Pct6

w0 w1 w2 w3 w4 w5 w6 w7 w8

<s>

0 1 2 3 4 5 6 7 8

CC1 w0 0 0 0 0 0 5 6 7VC2 w1 2 0 1 0 0 4 5 6

PC3

w2 3 2 0 1 2 3 4 0w3 4 2 2 0 1 2 3 0

NC4

w4 5 4 3 2 0 1 2 3w5 6 5 4 3 2 0 1 2

PC5

w6 7 2 5 4 3 2 0 1w7 8 2 6 5 4 3 2 0

Pct6 w8 9 8 7 6 5 4 3 2

CC1 VC2PC3 NC4 PC5

VC2 PC3

Reorderings to encode in the distortion matrix

NC4 PC5 CC1

Pct6

Pct6

32 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

32

Modifying the distortion

matrix

CC1 VC2 PC3 NC4 PC5 Pct6

w0 w1 w2 w3 w4 w5 w6 w7 w8

<s>

0 1 2 3 4 5 6 7 8

CC1 w0 0 0 0 0 0 5 6 7VC2 w1 2 0 1 0 0 4 5 6

PC3

w2 3 2 0 1 2 3 4 0w3 4 2 2 0 1 2 3 0

NC4

w4 5 4 3 2 0 1 2 3w5 6 5 4 3 2 0 1 2

PC5

w6 7 2 5 4 3 2 0 1w7 8 2 6 5 4 3 2 0

Pct6 w8 9 8 7 6 5 4 3 2

CC1 VC2PC3 NC4 PC5

VC2 PC3

Reorderings to encode in the distortion matrix

NC4 PC5 CC1

Pct6

Pct6

33 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

33

Experiments

• Tasks: NIST-MT09 for Ar-En, WMT10 for De-En

• Systems based on Moses, include state-of-the-art hierarchical lexicalized reordering models [Tillmann 04; Koehn & al 05; Galley & Manning 08]

• Baseline Distortion Limits: 5 in Ar-En, 10 in De-En

• Evaluation by: - BLEU for lexical match & local order - KRS for global order

35 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

35

Arabic-English:

Test set: eval09-NW

Distortion modified with 3-best reorderings per rule-matching sequence

Translation QualityTranslation Time

+0.9 BLEU+0.6 KRS(signif.)

37 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

37

German-English:

Test set: newstest10

Distortion modified with 3-best reorderings per rule-matching sequence

Translation QualityTranslation Time

+0.4 BLEU+0.7 KRS(signif.)

38 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

38

Conclusions

• Modified distortion allows for finer & linguistically motivated definition of search space

• We achieve better translation & faster decoding in language pairs where long reordering concentrates on few patterns

• Our method is complementary to reordering modeling

• For now, few reordering rules are needed to modify distortion

• We are currently working on a fully data-driven approach to replace the rules

39 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

39

w0 w1 w2 w3 w4 w5 w6 w7 w8 w9 w1

0

<s>

0 1 2 3 4 5 6 7 8 9 10

w0 0 1 2 3 4 5 6 7 8 9w1 2 0 1 2 3 4 5 6 7 8w2 3 T 0 1 2 3 4 5 6 7w3 4 H 2 0 1 2 3 Y 5 6w4 5 A T T E N T I O N !w5 6 N 4 3 2 0 1 U 3 4w6 7 K 5 4 3 2 F O R 2 3w7 8 S 6 5 4 3 2 0 1 2w8 9 8 7 6 5 4 3 2 0 1w9 10 9 8 7 6 5 4 3 2 0w10 11 10 9 8 7 6 5 4 3 2

40 A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

A. Bisazza & M. Federico – Modified Distortion Matrices for PSMT

40

w0 w1 w2 w3 w4 w5 w6 w7 w8 w9 w1

0

<s>

0 1 2 3 4 5 6 7 8 9 10

w0 0 1 2 3 4 5 6 7 8 9w1 2 0 1 2 3 4 5 6 7 8w2 3 T 0 1 2 3 4 5 6 7w3 4 H 2 0 1 2 3 Y 5 6w4 5 A T T E N T I O N !w5 6 N 4 3 2 0 1 U 3 4w6 7 K 5 4 3 2 F O R 2 3w7 8 S 6 5 4 3 2 0 1 2w8 9 8 7 6 5 4 3 2 0 1w9 10 9 8 7 6 5 4 3 2 0w10 11 10 9 8 7 6 5 4 3 2

top related