hak voice-browser
DESCRIPTION
this is verry uss fullTRANSCRIPT
![Page 1: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/1.jpg)
WELCOME
![Page 2: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/2.jpg)
Presented By
Abdul hakeem.mvCPS5NO:2
RegNo :12130530
Voice Browser
SSMPTC Tirur
![Page 3: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/3.jpg)
What is a Voice Browser?
A voice browser is a device :
that interprets voice input and interprets voice markup languages to generate voice output.
that interprets a script which specifies exactly what to verbally present to the user as well as when to present each piece of information.
![Page 4: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/4.jpg)
An advantage to people with visual impairment
Mobile Web
Naturalistic dialogs with Web-based services
![Page 5: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/5.jpg)
MotivationThere are 10 times as many telephones as
connected PCs.
Cell phones usage is growing dramatically.
Speaking and listening are the natural usage modes for modes. Easy to use - for people with no knowledge or fear of computers.
Voice interaction can escape the physical limitations on keypads and displays as mobile devices become ever smaller
![Page 6: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/6.jpg)
OverviewTime frame: 1998 to ??Hands-free accessing of web.Pragmatic interface for functionally blind
users.
![Page 7: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/7.jpg)
Key Technologies
Speech Recognition
Speech Synthesis
![Page 8: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/8.jpg)
Speech Recognition
Voice input VoXML file Text
![Page 9: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/9.jpg)
Speech Synthesis
Text VoXML file Output(Pre-recorded)
![Page 10: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/10.jpg)
Standardization World Wide Web Consortium(W3C)
Voice Browser Working GroupSpeech Interface Framework
![Page 11: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/11.jpg)
W3C Voice Browser Working Group
Established on 26 March 1999.Re-chartered through 31 January 2009.W3C Team Contacts are Kazuyuki Ashimura
and Matt Womer.Co-chaired by Jim Larson and Scott
McGlashan .
![Page 12: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/12.jpg)
Speech Interface FrameworkVoiceXML 1.0VoiceXML 2.0VoiceXML 2.1Voice XML 3.0Speech Recognition Grammar Specification (SRGS) 1.0Speech Synthesis Markup Language (SSML) 1.0Speech Synthesis Markup Language (SSML) 1.1Call Control XML (CCXML)State Chart XML (SCXML)Semantic Interpretation (SISR) 1.0Pronunciation Lexicon Specification (PLS) 1.0
![Page 13: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/13.jpg)
Voice XML(VoXML)Version 1.0 - designed for creating audio dialogs
.Version 2.0 - uses form interpretation
algorithm(FIA).Version 2.1 - 8 additional elements.Version 3.0 - relationship between
semantics and (31 August 2010) syntax.
![Page 14: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/14.jpg)
What about HTML ?HTML don’t have
Tampered promptsGrammar specifying alternative words that the
user can speak in response to the question.Instructions to the text-to-speech synthesizer
about how to say words and phrases.
Adding these capabilities would complicate HTML,a language developed just for visual UI.
![Page 15: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/15.jpg)
![Page 16: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/16.jpg)
Speech Recognition Grammar Specification(SRGS)Version 1.0 -for specifying grammars of each
user input to a speech application.
![Page 17: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/17.jpg)
Speech Synthesis Markup Language(SSML)Version 1.0 -for specifying the rendering of
synthesized speech to the user.Version 1.1 - enhancement of SSML 1.0 for
better support of the world's languages including Asian, Eastern European, and Middle Eastern languages.
![Page 18: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/18.jpg)
Call Control XML(CCXML) For specifying call control functions
State Chart XML(SCXML) Execution environment based on CCXML
and Harel State Tables.
![Page 19: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/19.jpg)
Semantic Interpretation Speech Recognizer(SISR) Version 1.0 - For specifying possible
translation of text from the output of a speech recognizer.
Pronunciation Lexicon Specification (PIS)
Version 1.0 - Syntax for specifying pronunciation lexicons to be used by Speech Recognition and Speech Synthesis.
![Page 20: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/20.jpg)
![Page 21: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/21.jpg)
Model Architecture
![Page 22: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/22.jpg)
ApplicationsIt can be divided into three categories :
Web BrowsingLimited information AccessSpoken Dialog Systems
![Page 23: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/23.jpg)
Web BrowsingBrowse any web pages using speech input.Parsing for the purpose of voice recognition
done when the page is accessed.May or may not produce a voice feed back.
![Page 24: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/24.jpg)
Limited Information AccessUseful information in limited domains like
weather in a city, checking stock updates etc.Audio feed back
![Page 25: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/25.jpg)
Spoken Dialog SystemsClient-server architecture is usedUsed for connecting to a remote server by a
Java applet(client).Examples are connecting to email servers
![Page 26: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/26.jpg)
BenefitsVoice is a very natural user interface which
speeds up browsing.Less space requirements.Portable voice browsers can also be
implemented.Practical interface for functionally blind
users.Users can browse web while keeping there
hands and eyes for other jobs
![Page 27: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/27.jpg)
FutureVoice browsing will become visual(Multi-
modal)Can be integrated to an OSIntegrated to every application.
![Page 28: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/28.jpg)
ConclusionsBrowser technology is changing very fast
these days and we are moving from the visual paradigm to the voice paradigm.
Voice browser is the technology to enter this paradigm.
Voice browser is a device which interpret voice input and generate voice output.
![Page 29: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/29.jpg)
Referenceshttp://www.w3.org/standards/webofdevices/
voicehttp://xml.coverpages.org/ccxml.htmlhttp://reactos.ccp14.ac.uk/Voice/http://www.w3.org/Voice/1998/Workshop/
PhilJenkins.html (for IBM)
![Page 30: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/30.jpg)
THANK YOU
![Page 31: Hak voice-browser](https://reader033.vdocument.in/reader033/viewer/2022061508/558654ded8b42a221b8b45eb/html5/thumbnails/31.jpg)
?