mul(lingualweb/lt$ execu(ve$summary$ - w3

17
The Mul(lingualWebLT Working Group receives funding by the European Commission (project name LTWeb) through the Seventh Framework Programme (FP7) in the area of Language Technologies. Grant Agreement No. 287815. Mul(lingualWebLT Execu(ve Summary Felix Sasaki DFKI / W3C Fellow

Upload: others

Post on 06-May-2022

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Mul(lingualWeb-­‐LT  Execu(ve  Summary  

Felix  Sasaki  DFKI  /  W3C  Fellow  

Page 2: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Project  goals  •  Provide  reference  implementa(ons  of  metadata  for  mul(lingual  processes  – Content  crea(on,  (human  or  machine)  transla(on,  localiza(on  workflows,  ...  

•  Define  a  metadata  standard  based  on  implementa(ons  and  exis(ng  work  – From  Interna(onaliza(on  Tag  Set  (ITS)  1.0  >  ITS  2.0  

•  Con(nue  and  enlarge  a  community  around  the  Mul(lingualWeb  

Page 3: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Groups  involved  

MLW-­‐LT  consor(um  (Reference  Implementa(ons)  

W3C  MLW-­‐LT  Working  Group  

Members  (Standardiza(on)  

MLW  PC  members  (Community  building)  

Page 4: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Requirements  Gathering  •  Workshop  June  2012,  Dublin  – 71  a^endees  –   New  stakeholders:  linked  open  data  community  –   New  implementers:  Adobe,  ]init[,  Logrus,  Tilde  

•  Requirements  gathering  document  – W3C  public  working  drab  – Wiki  version  21.000+  access  

Page 5: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Standardiza(on  Process  ...  •  ITS  2.0  drab  development  June  –  December  2012  – 40+  individuals  par(cipa(ng  – 2100+  emails,  aggressive  standardiza(on  progress  – Engaging  “invited  experts”  and  further  par(cipants,  including  higher-­‐level  decision  makers:  

 Adobe,  CNR,  DERI,  Ecole  Mohammadia  

d'Ingenieurs  Rabat,  ]init[,  Logrus,  NCSR,  Opera,  SAP,  Tilde  

Page 6: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

...  driven  by  implementa(ons  •  Test  suite  development  star(ng  August  2012,  driven  by  TCD  –  Input:  Files  with  ITS  2.0  metadata  – Output:  metadata  overview  –  Current  state:  223  input  files,  839  implementer  output  files,  80%  coverage  

<!DOCTYPE  html>  ...        <p>Everything  started  when  Zebulon  discovered  that  he  had  a  <span  translate="NO">doppelgänger</span>  ...  </html>  

...  /html/body[1]/p[1]  translate="yes"  /html/body[1]/p[1]/span[1]  translate="no"  ...  

Page 7: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

“Metadata  for  the  Mul(lingual  Web”  •  Summarizing  usage  scenarios  and  implementa(ons  

•  Aligned  with  implementa(on  development  

Page 8: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Usage  scenarios  and  implementa(on  highlights  

•  XLIFF  transla(on  package  crea(on  driven  by  ITS  2.0  metadata  

•  Quality  check  driven  by  metadata  constraints  •  Installa(on  of  workflow  from  CMS  to  TMS  system  •  CMS  implementa(on  of  metadata  authoring  support  

Page 9: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Usage  scenarios  and  implementa(on  highlights  

•  Text-­‐processing  component  interconnected  with  Drupal  

•  Cocomore  –  Linguaserve:  showcase  “localiza(on  workflow  with  VDMA”  

•  Linguaserve:  “real  (me  MT  with  Spanish  Tax  Agency”  

•  Volunteer  implementer  Shaun  McCance  –  ITS  Tool:  XML  to  PO  and  back  

Page 10: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Not  covered  during  this  review  •  Valida(on  of  HTML5+ITS  (UEP)  – Available  at  h^p://validator.nu/    – Staged  for  integra(on  in  W3C  validator  

•  ITS  Libre  Office  Writer  Extension  -­‐  ]init[  •  ITS  2.0  Enriched  Terminology  Annota(on  –  Tilde  •  Visual  designs  to  render  "ITS  for  HTML5”  –  Logrus  •  Localisa(on  Workflows  Using  ITS  2.0  with  Adobe  CQ  and  Apache  JackRabbit  –  Adobe    

Page 11: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Deliverables  for  year  one  •  D1.1  Detailed  Overall  Management  and  Bodies  Management,  including  the  Quality  Assurance  Plan  

•  D1.2.1  Report  on  Internal  and  External  Communica(on  Tools  

•  D1.2.2  LT-­‐Web  -­‐  W3C  Coordina(on  Yearly  Report  •  D1.2.3  Contact  Database  •  D2.1  Requirements  and  Use  Case  Document  •  D2.2  LT-­‐Web  Metadata  Drab  Documents  

Page 12: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Deliverables  for  year  one  •  D4.1.1  Lucy  Modifica(on  •  D4.1.2  MaTrEx  Modifica(on  •  D4.1.3  Linguaserve  Online  System  Modifica(on  •  D4.1.4  Report  on  Modifica(ons  in  MT  Systems  •  D6.1.1  Workshop  1  •  D6.1.2  Summary  Report  1  

Page 13: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

WP5  “Deep  Web  Informa(on  and    MT  Training”  

•  Deliverables  – D5.1.1  MT  Training  Module  – D5.1.2  XLIFF  Deep  Web  MT  Training  Exporter  – D5.2  Metadata-­‐Aware  MT  Training  

•  Delivery  date  will  be  delayed  to  be  able  to  benefit  from  Cocomore  training  data  

•  Overall  WP  will  be  in  (me  (conclusion  by  M21)  

Page 14: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Communica(on  •  W3C  infrastructure  +  telephone  conference  tool  – Mailing  lists  –  IRC  – Ac(on  /  issue  tracker  –  ...  see  D1.2.1  

•  Separate  channels  for  – Working  Group  (standardiza(on)  – Workshop  planning  (MLW  PC)  – Public  

Page 15: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Management  

MLW-­‐LT  consor(um  (Reference  Implementa(ons)  

W3C  MLW-­‐LT  Working  Group  

Members  (Standardiza(on)  

MLW  PC  members  (Community  building)  Communica(on  

infrastructure  

Page 16: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Conclusion  •  Community  building  via  suppor(ng  ...  – Reference  implementa(on  – Standardiza(on  – Outreach  

•  ...  pays  of!  •  Similar  projects  could  be  useful  in  the  future  

Page 17: Mul(lingualWeb/LT$ Execu(ve$Summary$ - W3

The  Mul(lingualWeb-­‐LT  Working  Group  receives  funding  by  the  European  Commission  (project  name  LT-­‐Web)  through  the  Seventh  Framework  Programme  (FP7)  in  the  area  of  Language  Technologies.  Grant  Agreement  No.  287815.  

Q/A