i used to be a translator, now i run machine translation - taus #locworldlondon 2013

15
PangeaMT Manuel Herranz I used to be a translator, Now I run Machine Translation #manuelhrrnz #pangeanic E: [email protected] pangeanic

Upload: manuel-herranz

Post on 24-Dec-2014

7.474 views

Category:

Technology


5 download

DESCRIPTION

In this presentation, I describe briefly how Pangea Technologies have enabled translation practitioners to empower themselves and be active in machine translation rather than just be passive users / post-editors. Pangea allows users to manage and utilize their TMs as an engine-training tool, customizing their translation engines even with small data sets. Translation practitioners include corporations, LSPs and even savvy translators. All of them have been able to create, customize, run and update their engines since 2011. This not only breaks down entry levels, but makes the technology a productivity enhancer, a "must have" just like terminology tools.

TRANSCRIPT

  • 1. PangeaMT Manuel Herranz I used to be a translator, Now I run Machine Translation #manuelhrrnz #pangeanic E: [email protected]

2. The Data Deluge As of May 2009: 487 Billion gigabytes or 1,000,000,000 * 487,000,000,000 = 4,87 x 1020 Estimates Up 50% a year (Oracle) Doubles every 11 hours (IBM) Language translation as a job becoming unmanageable. Increasing demands, increasing volumes, shorter deadlines. Human production is not sufficient (why is TR so labour intensive? Film production, publishing, XF are highly computarised!). As Content Volume Explodes, Machine Translation Becomes an Inevitable Part of Global Content Strategy http://ow.ly/jVuhZ 3. 2007 a EU post-editing contract won by Pangeanic triggers in-house translators thinking - Is this the best technology can do? - Why is the result so unpredictable? - Why is it so BAD? - Can we do something to improve? 4. 2007 a EU post-editing contract won by Pangeanic triggers in-house translators thinking - Is this the best technology can do? - Why is the result so unpredictable? - Why is it so BAD? - Can we do something to improve? Since 2011: Pangeanic DIY SMT http://ow.ly/lQXSv Test, feedback, install, iteration euromatrixplus. net/moses- decoder/ 5. Machine Translation application, usage and success depend on MT for assimilation: gisting or understanding Sports Politics Social etc Output format Practically unlimited demand; but free web-based services reduce incentive to improve technology Coverage + important. Instant quality MT for dissemination: publication MT for direct communication Output format Sports Politics Social etc Publishable quality that can only be achieved by humans. MT & tools a productivity booster Output format Output format Sports Politics Social etc Current R&D, Military uses systems for spoken MT, first applications for smartphones 6. - Translator resistance: depending on language: French, Japanese the most reluctant - Translation industry has lagged behind in many areas, even in ISO or technology adoption - Free web-based MT services double-edged sword: familiarize users, but unrealistic expectations Miguel ngel Daz, Chairman of the Spanish Association for Emotional Intelligence, June 2013: It makes a lot of sense to talk about emotional intelligence in education since the current system prepares the new generations for a world which is not going to exist. The job market has changed very much in the last 20 or 30 years however the educational model has not. 7. Unrest is continuing in Cairo as protesters set up their demand for Egypts military rulers to resign + specific language rules + job or client glossary + hybrid technologies 8. Translator resistance: - I DONT LIKE IT - IT TAKES ME MORE TIME THAN DOING IT THE NORMAL WAY - NO CONTROL - MANAGEMENT-IMPOSED - THREAT TO OWN JOB - DE-HUMANIZATION Management view: - COST-EFFICIENCY - TIGHTER BUDGETS, MORE MONEY FOR VALUE, LOWER COSTS - COMPETITION ALREADY AHEAD - TECHNOLOGY CAN BE APPLIED ELSEWHERE, NOT JUST DOCUMENTATION (REVENUE OPPORTUNITY) 9. Translator resistance: A MOANING PROFESSION? 10. Domain Selection 11. TM Upload 12. TM Cleaning 13. TM Upload 14. Myth: MT will never be as good as humans We cannot solve the problem using the same tools and the way of thinking that created it A. Einstein uhmmm, it is going to get really good... 2nd stage PE material and more data make engines even more predictable. More specialist engines 3rd stage Beyond 2030... no predictions 1st stage We are creating usable engines, first PE experiences 2009-2015 or 2020 15. My translation company.com Franoise Moreau [email protected] 87 Rue Legendre, 75017 Paris Tel +33 1 42 25 55 77 Translation Project ManagerMT Customization Specialist