taus open source machine translation showcase, paris, sándor sojnóczky, hunnect, 4 june 2012

13
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE The ups and downs of implementing an MT environment 15:45-16:00 Monday 4 June Sndor Sojnczky Hunnect

Upload: taus-enabling-better-translation

Post on 29-Nov-2014

714 views

Category:

Technology


1 download

DESCRIPTION

This presentation is a part of the MosesCore project that encourages the development and usage of open source machine translation tools, notably the Moses statistical MT toolkit. MosesCore is supported by the European Commission Grant Number 288487 under the 7th Framework Programme. For the latest updates, follow us on Twitter - #MosesCore

TRANSCRIPT

Page 1: TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunnect, 4 June 2012

TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE

The ups and downs of implementing an MT environment15:45-16:00Monday 4 June

Sandor SojnoczkyHunnect

Page 2: TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunnect, 4 June 2012

22

The Ups and Downs of Implementing an MT Environment

2

English Hungarian

Page 3: TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunnect, 4 June 2012

33

The Ups and Downs of Implementing an MT Environment

Introduction

Hunnect Limited (since 2003)

LSP based in Hungary

Sándor Sojnóczky, MITI (linguist, economist, certified translator and university lecturer)

Page 4: TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunnect, 4 June 2012

44

Background

Experience - 10 years, over 120 million words

Service requirements - Translation-Editing-Proofreading

Languages - Mainly Eastern European (35+)

Volumes – the number of words are constantly growing

Business Model – Faster–Better–More Economical

The Ups and Downs of Implementing an MT Environment

Page 5: TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunnect, 4 June 2012

55

The Ups and Downs of Implementing an MT Environment

What changed in the past 10 years?

Service requirements – mostly TEP, but…

Languages – no significant changes in composition

Volumes – growing

Business Model – same direction, but…

5

Page 6: TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunnect, 4 June 2012

66

The Ups and Downs of Implementing an MT Environment

The focus increases on…

Faster – Better – More Economical

Translation Automation

6

Page 7: TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunnect, 4 June 2012

77

The Ups and Downs of Implementing an MT Environment

Challenge - Can we implement it as a business model?

Yes, if we consider 3 factors and calculate accordingly:

• Translation velocity• Pricing• Quality (TEP)

Page 8: TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunnect, 4 June 2012

88

The Ups and Downs of Implementing an MT Environment

Translation(10 days)

Editing(3 days)

Proofing(2 days)

MT(1 day)

Post-editing(5 days)

Proofing(2 days)

Business Model Analysis – Timeframe/velocity

(based on a 25,000 word project)

Free (7 days)46% saving in time

C

o

s

t

Timeframe

Page 9: TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunnect, 4 June 2012

99

The Ups and Downs of Implementing an MT Environment

Business Model Analysis – Pricing

40-50%

45-60%

5%

30-35%

5-15%

25-35%

5%

20%

Margin

Proofing

Editing

Translation

Margin

Ling. QA

Post-editing incl. fuzzies

Machine trans.

Page 10: TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunnect, 4 June 2012

1010

The Ups and Downs of Implementing an MT Environment

Business Model Analysis – Pricing (Rewarding the post-editor)

TR PE TR/PE%

2,500 5,000

1.00 0.60 - 40%

+ 20%3,0002,500

Daily output

Unit paid

Total paid

Page 11: TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunnect, 4 June 2012

1111

The Ups and Downs of Implementing an MT Environment

Successes

• Selection of the MT environment and collection of the domain-specific training data

• Training strategy for would-be post-editors

• Implementing MT as a business model

Page 12: TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunnect, 4 June 2012

12

The Ups and Downs of Implementing an MT Environment

Challenges

• Retraining the MT engine– Collecting feedback from post-editors

– Handling lower quality data for the retraining

• Engaging post-editors– Linguists vs. domain-specific experts– Cloud/crowd

• Business model– Pricing

– Velocity

• New languages

Page 13: TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunnect, 4 June 2012

1313

The Ups and Downs of Implementing an MT Environment

Thank you for your attention!

For further information and questions please feel free to contact:

Mr. Sándor Sojnóczky ([email protected])

Web: www.hunnect.hu

Web: www.hunnectacademy.com