![Page 1: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/1.jpg)
speech in, speech out
![Page 2: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/2.jpg)
24 listopad 2006 WS0607 – elevator 2/15
Nuance server
compiled recognition grammar, master language package, licence manager
Nuance client
speech-in components
![Page 3: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/3.jpg)
24 listopad 2006 WS0607 – elevator 3/15
anticipate user’s responses
what pieces of information are needed to complete the dialog?
in what order will they be requested?
one piece of information at a time in particular order (directed dialog), several pieces at once, in any order, and prompt for missing items (mixedinitiative)?
recognition grammar
![Page 4: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/4.jpg)
24 listopad 2006 WS0607 – elevator 4/15
syntax
Nuance: Grammar Specification Language (GSL)
Diamant: Speech Recognition Grammar Format (SRGF)
recognition grammar
![Page 5: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/5.jpg)
24 listopad 2006 WS0607 – elevator 5/15
GSL grammar: doc in a file with .grammar extension; e.g. mygram.grammar (mygram will be the resulting package name)
contents: GrammarRuleName GrammarDescription
GrammarRuleName: at least one uppercase character
GrammarDescription: sequence of words, grammar names, and operators that define a set
of recognizable word sequences words (terminals) in lower-case operators:
recognition grammar
() concat (A B C ... Y) A and B and ...
[ ] disjunction [A B C ... Y ] either A or B or...
? optional ?Y Y is optional
+ positive closure +Y at least one Y
* Kleene star *Y zero or more Y
![Page 6: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/6.jpg)
24 listopad 2006 WS0607 – elevator 6/15
GSL grammar: example expressions
[morning afternoon evening]
“morning”, “afternoon”, “evening”
(good [morning afternoon evening])
“good morning”, “good afternoon”, “good evening”
(?good [morning afternoon evening])
“good morning”, “good afternoon”, “good evening”, “morning”, “afternoon”, “evening”
(thanks +very much)
“thanks very much”, “thanks very very much”, ...
(thanks *very much)
“thanks much”, “thanks very much”, “thanks very very much”, ...
recognition grammar
![Page 7: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/7.jpg)
24 listopad 2006 WS0607 – elevator 7/15
example GSL grammar
.grammar file
.slot_definitions file
.GO_FLOOR [ FLOOR:f (?the FLOOR:f floor) (?the FLOOR:f please) (?Filler ?the FLOOR:f floor ?please)] {<floor $f>}
Filler [ (i would like to go to) (i want to go to) (uh)]
FLOOR [ first {return("1")} second {return("2")} third {return("3")} fourth {return("4")}]
recognition grammar
floor
![Page 8: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/8.jpg)
24 listopad 2006 WS0607 – elevator 8/15
another option: SRGF and export as Nuance GSL
GrammarTest.bat
recognition grammar
![Page 9: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/9.jpg)
24 listopad 2006 WS0607 – elevator 9/15
compiling the package (compile-package.bat)
set PKGHOME = path to your gsl file (w/o extension)
nuance-compile %PKGHOME% English.America.1.3.0
recognition grammar
master recognition package
![Page 10: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/10.jpg)
24 listopad 2006 WS0607 – elevator 10/15
testing the grammar (text)
parse-tool -package path_to_your_model
nl-tool –package path_to_your_model –grammar grammar_in_your_model
recognition grammar
![Page 11: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/11.jpg)
24 listopad 2006 WS0607 – elevator 11/15
running Nuance:
licence manager: lm.bat
recognition server: rs.bat
set PKGHOME = path to your compiled model
recserver -package %PKGHOME% lm.Addresses=localhost config. ...
testing the grammar (speech)
xapp -package path to your compiled model lm.Addresses=localhost
speech recognition
![Page 12: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/12.jpg)
24 listopad 2006 WS0607 – elevator 12/15
running nuance client
edit Diamant config file: Clients.ini
NuanceClient.bat
(btw, have the licence manager and the server running too... duh!...)
Diamant with speech-in
![Page 13: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/13.jpg)
24 listopad 2006 WS0607 – elevator 13/15
adding speech-in
add device as usual
activate recognition: output <string> „start” (start command) to nuance client
read (speech) input from nuance client into variable as usual
access recognition confidence (of type Real) like this: var#confidence
Diamant with speech-in
![Page 14: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/14.jpg)
24 listopad 2006 WS0607 – elevator 14/15
Mary server
online at DFKI...
Mary client
MaryClient.bat
speech-out components
![Page 15: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/15.jpg)
24 listopad 2006 WS0607 – elevator 15/15
Diamant with speech-out
adding speech-out
add device as usual
optionally, set format: {format = <string>} (default plain text) and voice{voice = <string>}
in output node, output <string> to Mary client as usual
![Page 16: Speech in, speech out. 24 listopad 2006WS0607 – elevator2/15 Nuance server compiled recognition grammar, master language package, licence manager Nuance](https://reader036.vdocument.in/reader036/viewer/2022082713/5697bf9f1a28abf838c94e61/html5/thumbnails/16.jpg)
24 listopad 2006 WS0607 – elevator 16/15
speech-enabled dialogs
recognition tends to be imperfect...
if recognition confidence low, then, for example (btw, think: grounding):
repeat question
ask for confirmation („did you say blah?”)
inform user what they can say („you can say blah, bloo, and blee, please tryagain”)
but... don’t let user get stuck in endless clarification dialog either!