how machine translation works

10
HOW MACHINE TRANSLATION WORKS? @learn.machinelearning hello Bonjour

Upload: others

Post on 23-Jun-2022

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: How Machine Translation Works

[email protected]

01

hello

Bonjour

one language to another

Page 2: How Machine Translation Works

Machine translation (MT) is automated

translation. It is the process by which

computer software is used to translate a text

from one natural language (such as English) to

another (such as French).

WHAT IS MACHINE TRANSLATION?

learn.machinelearning02

Hello family, hope all are good. stay home and stay safe.

Bonjour la famille, j'espère quetout va bien. rester à la maison et

rester en sécurité

English

French

Page 3: How Machine Translation Works

RULE - BASED TRANSLATION?Classical machine translation methods often

involve rules. The rules are often developed

by linguists and may operate at the lexical,

syntactic, or semantic level.

The key limitations of the classical machine

translation approaches are both the

expertise required to develop the rules, and

the vast number of rules and exceptions

required.

learn.machinelearning03

source - javatpoint

Page 4: How Machine Translation Works

It uses of statistical models that learn to

translate text from a source language to

a target language given a large corpus

of examples.

Limitation of statistical approaches is, it

requires careful tuning of each module

in the translation pipeline.

STATISTICS - BASED TRANSLATION?

learn.machinelearning04

split thesentence

into chunksInput

Output

Find all possibletranslations for

each chunk

Generate all possiblesentences and findthe most likely one

based on the corpus.

Page 5: How Machine Translation Works

It uses of neural networks especially Recurrent

neural networks which are organized into an

encoder-decoder architecture that allow for

variable length input and output sequences.

The benefit is that a single system can be trained

directly on source and target text, no longer

requiring the pipeline of specialized systems

used in statistical machine learning.

NEURAL MACHINE TRANSLATION?

RecurrentNeural

Networks

(LSTM,GRU, .....)

learn.machinelearning05

Input Output

saves the current model'sstate and use that as oneinput of our next calculation

memory of previouswords influence nextwords

Page 6: How Machine Translation Works

Although effective, the Encoder-Decoder

architecture has problems with long sequences

of text to be translated.

The problem stems from the fixed-length internal

representation that must be used to decode each

word in the output sequence.

ATTENTION BASED TRANSLATION?

RNN+

Attention 

Transformer+

Attention

learn.machinelearning06

Input Outputor

Page 7: How Machine Translation Works

The solution is the use of an attention

mechanism that allows the model to learn where

to place attention on the input sequence as each

word of the output sequence is decoded.

The encoder-decoder recurrent neural network

architecture with attention is currently the state-

of-the-art on some benchmark problems for

machine translation.

ATTENTION BASED TRANSLATION?

learn.machinelearning07

Page 8: How Machine Translation Works

OPEN SOURCE IMPLEMENTATIONS

learn.machinelearning08

Google-GNMT (Tensorflow)

Facebook-fairseq (Torch)

Amazon-Sockeye (MXNet)

NEMATUS (Theano)

THUMT (Theano)

OpenNMT (PyTorch)

StanfordNMT (Matlab)

DyNet-lamtram(CMU)

EUREKA(MangoNMT)

Many more.....

Page 9: How Machine Translation Works

APPLICATIONS OF MACHINE TRANSLATION

learn.machinelearning09

Text-to-text

Text-to-speech

Speech-to-text

Speech-to-speech

Image (of words)-to-text

Major domains like Government, Software &

technology, Military & defence, Healthcare,

Finance, Legal, E-discovery amd Ecommerce