speech in, speech out. 24 listopad 2006ws0607 – elevator2/15 nuance server compiled recognition...

speech in, speech out

24 listopad 2006 WS0607 – elevator 2/15

Nuance server

compiled recognition grammar, master language package, licence manager

Nuance client

speech-in components

anticipate user’s responses

what pieces of information are needed to complete the dialog?

in what order will they be requested?

one piece of information at a time in particular order (directed dialog), several pieces at once, in any order, and prompt for missing items (mixedinitiative)?

recognition grammar

syntax

Nuance: Grammar Specification Language (GSL)

Diamant: Speech Recognition Grammar Format (SRGF)

recognition grammar

GSL grammar: doc in a file with .grammar extension; e.g. mygram.grammar (mygram will be the resulting package name)

contents: GrammarRuleName GrammarDescription

GrammarRuleName: at least one uppercase character

GrammarDescription: sequence of words, grammar names, and operators that define a set

of recognizable word sequences words (terminals) in lower-case operators:

recognition grammar

() concat (A B C ... Y) A and B and ...

[ ] disjunction [A B C ... Y ] either A or B or...

? optional ?Y Y is optional

+ positive closure +Y at least one Y

* Kleene star *Y zero or more Y

GSL grammar: example expressions

[morning afternoon evening]

“morning”, “afternoon”, “evening”

(good [morning afternoon evening])

“good morning”, “good afternoon”, “good evening”

(?good [morning afternoon evening])

“good morning”, “good afternoon”, “good evening”, “morning”, “afternoon”, “evening”

(thanks +very much)

“thanks very much”, “thanks very very much”, ...

(thanks *very much)

“thanks much”, “thanks very much”, “thanks very very much”, ...

recognition grammar

example GSL grammar

.grammar file

.slot_definitions file

.GO_FLOOR [ FLOOR:f (?the FLOOR:f floor) (?the FLOOR:f please) (?Filler ?the FLOOR:f floor ?please)] {<floor $f>}

Filler [ (i would like to go to) (i want to go to) (uh)]

FLOOR [ first {return("1")} second {return("2")} third {return("3")} fourth {return("4")}]

recognition grammar

another option: SRGF and export as Nuance GSL

GrammarTest.bat

recognition grammar

compiling the package (compile-package.bat)

set PKGHOME = path to your gsl file (w/o extension)

nuance-compile %PKGHOME% English.America.1.3.0

recognition grammar

master recognition package

testing the grammar (text)

parse-tool -package path_to_your_model

nl-tool –package path_to_your_model –grammar grammar_in_your_model

recognition grammar

running Nuance:

licence manager: lm.bat

recognition server: rs.bat

set PKGHOME = path to your compiled model

recserver -package %PKGHOME% lm.Addresses=localhost config. ...

testing the grammar (speech)

xapp -package path to your compiled model lm.Addresses=localhost

speech recognition

running nuance client

edit Diamant config file: Clients.ini

NuanceClient.bat

(btw, have the licence manager and the server running too... duh!...)

Diamant with speech-in

adding speech-in

add device as usual

activate recognition: output <string> „start” (start command) to nuance client

read (speech) input from nuance client into variable as usual

access recognition confidence (of type Real) like this: var#confidence

Diamant with speech-in

Mary server

online at DFKI...

Mary client

MaryClient.bat

speech-out components

Diamant with speech-out

adding speech-out

add device as usual

optionally, set format: {format = <string>} (default plain text) and voice{voice = <string>}

in output node, output <string> to Mary client as usual

speech-enabled dialogs

recognition tends to be imperfect...

if recognition confidence low, then, for example (btw, think: grounding):

repeat question

ask for confirmation („did you say blah?”)

inform user what they can say („you can say blah, bloo, and blee, please tryagain”)

but... don’t let user get stuck in endless clarification dialog either!

speech in, speech out. 24 listopad 2006ws0607 – elevator2/15 nuance server compiled recognition...

Documents

gazetainformator.pl nr 174 / listopad 2014

gazeta polonijna north / listopad 2012

dragon speech recognition guide enterprise solution ·...

nuance investments, llc semi- annual call: nuance mid cap...

nuance communications

panmedia pamnews: tv report za listopad 2010

smart recording on the go...nuance dragon professional...

eliminating legal transcription bottlenecks - nuance · 1...

getting started with cisco customer response...

goout 15. listopad 2015

intelligent, automated, ai-powered speech. the modern...

cisco collaboration flex plan · nuance speech recognition...

1 making the case for speech recognition in the legal...

menu listopad 2021 - u.profitroom.com

magazyn moto collection - listopad 2014

nuance...16 rak ceramcs 2017 17 red body ceramic tiles...

nuance of language

load balancing nuance...

nuance group

listener-control navigation of voicexml. nuance speech...