web engineering meets natural language processing: a vocal interface generation practice

Post on 13-Jan-2016

18 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

Web engineering meets natural language processing: a vocal interface generation practice. 2006/12/11 江文成. Reference. Hendrik Macedo, Jacques Robin, Roberto Barros Proceedings of the 11th Brazilian Symposium on Multimedia and the web WebMedia '05 ACM , December, 2005. Introduction. Overview - PowerPoint PPT Presentation

TRANSCRIPT

Web engineering meets natural language processing: a vocal interface generation practice

2006/12/11江文成

Reference

Hendrik Macedo, Jacques Robin, Roberto Barros Proceedings of the 11th Brazilian Symposium on Mult

imedia and the web WebMedia '05 ACM , December, 2005

Introduction

Overview VoiceXML Voice Portals

● TTS

● ASR Voice-Driven interface

● PBTG mediator

● MPC

● COLEC Related work Conclusions

Overview VoiceXML Voice Portals

● TTS

● ASR Voice-Driven interface

● PBTG mediator

● MPC

● COLEC Related work Conclusions

Overview

PSTN ︰公共交換電話網

Overview VoiceXML Voice Portals

● TTS

● ASR Voice-Driven interface

● PBTG mediator

● MPC

● COLEC Related work Conclusions

VoiceXML

1999 年 3 月,由 Motorola 、 Lucent 、 AT&T 和 IBM四家公司聯合發起成立了 VoiceXML論 (http://www.voicexml.org)

可擴展標記語言 (XML) 的一種擴展 為電話和移動設備提供一種便捷的訪問 Internet 網路,

獲取服務和資訊的手段。

VoiceXML(example)

Overview VoiceXML Voice Portals

● TTS

● ASR Voice-Driven interface

● PBTG mediator

● MPC

● COLEC Related work Conclusions

Voice Portals

TTS

■ Text-To-Speech: 文字轉語音 ASR

■ Automatic Speech Recognition: 自動語音識別

Voice Portal application architecture

Voice Portals

Overview VoiceXML Voice Portals

● TTS

● ASR Voice-Driven interface

● PBTG mediator

● MPC

● COLEC Related work Conclusions

Voice-Driven interface

PBTG mediator

■ a artificial intelligence system ■ To connect a Web application and a Voice Portal ■ consist of a MPC and a COLEC

MPC ■ Message Production Component

COLEC ■ Content Organization and Linquistic Expression Component ■ COLEC pipeline (10 stages)

Overview VoiceXML Voice Portals

● TTS

● ASR Voice-Driven interface

● PBTG mediator

● MPC

● COLEC Related work Conclusions

PBTG mediator

PBTG mediator

MPC

Recommendation Message

■ “The system recommends Matrix Reloaded to

you?” Feature Message

■ Matrix Reloaded stars Keanu Reeves?

■ Matrix Reloaded is a science-fiction moive? Feature Asking Message

■ Would you like more information about this movie?

COLEC ( pipeline )

COLEC ( pipeline ) - CLTT

CoganateLexicalizedThematicTree(CLTT) Example: Matrix Reloaded is a science-fiction movie?

COLEC ( pipeline ) - FLST

FullyLexicalizedSyntacticTree(FLST) Input source : form CLTT ( output of the last step)

Matrix Reloaded is a science-fiction movie?

Overview VoiceXML Voice Portals

● TTS

● ASR Voice-Driven interface

● PBTG mediator

● MPC

● COLEC Related work Conclusions

Related Work

DCIE

■ proxy-based interactive service

■ browse dynamically generated audio renditions of

both e-mail and WWW documents

■ In April,1997 WIRE voice browser

■ for car radio

■ to access e-mail and WWW

■ in Octobeer ,1998

Overview VoiceXML Voice Portals

● TTS

● ASR Voice-Driven interface

● PBTG mediator

● MPC

● COLEC Related work Conclusions

Conclusions

Thanks!!

top related