physically situated language interaction: an integrative ... · attention models engagement models...

Post on 08-Aug-2020

5 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Dan Bohus

Physically Situated Language Interaction:

an integrative-AI challenge

channel

signal

intention

conversation

understanding

planning

turn taking

engagement

communicative

competencies

channel

signal

intention

conversation

understanding

planning

turn taking

engagement

communicative

competenciessituational context

who: physical awarenessidentify, track, and characterize relevant

actors, objects, states and relationships

what: situation and activitysense and reason about relevant events

and activities of self and others

why: goals and intentionssense and reason about beliefs,

intentions, goals and long-term plans

situational context

who: physical awarenessidentify, track, and characterize relevant

actors, objects, states and relationships

what: situation and activitysense and reason about relevant events

and activities of self and others

why: goals and intentionssense and reason about beliefs,

intentions, goals and long-term plans

intention

conversation

understanding

planning

signalturn taking

channelengagement

communicative

competencies

situational context

attentionintention

conversation

understanding

planning

signalturn taking

channelengagement

communicative

competencies

Speaker:

Listener:

Speaker:

Listener:

Speaker:

Listener:

from Conducting Interaction: Achieving Mutual Orientation at Turn Beginning

Speaker:

Listener:

Listener:

Speaker:

from Conducting Interaction: Achieving Mutual Orientation at Turn Beginning

Infers attention on various targets

Relies on ML model for geometric visual attention

Leverage features from visual subsystems

Defined at phrase level

Specified at onset and production

Define expected targets

… … … Excuse me! … … … To get … … … To get to 3800

Phrase 1

Wait Interject Wait Speak 2w Wait Speak phrase

Phrase 2

go to […]

sensing and computation details

system incorrectly infers user is not attending and inappropriately triggers pauses, interjections and restarts

situational context

intention

conversation

understanding

planning

signalturn taking

channelengagement

communicative

competencies

attention

situational context

who: physical awarenessidentify, track, and characterize relevant

actors, objects, states and relationships

what: situation and activitysense and reason about relevant events

and activities of self and others

why: goals and intentionssense and reason about beliefs,

intentions, goals and long-term plans

intention

conversation

understanding

planning

signalturn taking

channelengagement

communicative

competencies

Microphone array capture

Sound source localization

Speech recognition

Language understanding

Infrared proximity sensors

Badge sensors

Face detection and tracking

Head-pose tracking

Facial feature tracking

Face identity recognition

Gender detection

Attention models

Engagement models

Turn-taking models

Behavioral control

Dialog management

Natural language generation

Speech synthesis

Avatar synthesis

Robot motion control

Floor-plan models

User models

complex (many components)programming models for coordinated computation; tools

complex (many components)programming models for coordinated computation; tools

Time and streams as 1st order citizens

double f; stream double f;

f=3; f=x*f-y;

Automatic persistence, historical access, sampling, transforms

act in real-time, under uncertaintyevolve programming languages? e.g. time & uncertainty

complex (many components)programming models for coordinated computation; tools

Uncertainty as 1st order citizen

double f; uncertain double f;

Representation, sampling, inference, belief updates

Microphone array capture

Sound source localization

Speech recognition

Language understanding

Infrared proximity sensors

Badge sensors

Face detection and tracking

Head-pose tracking

Facial feature tracking

Face identity recognition

Gender detection

Attention models

Engagement models

Turn-taking models

Behavioral control

Dialog management

Natural language generation

Speech synthesis

Avatar synthesis

Robot motion control

Floor-plan models

User models

act in real-time, under uncertaintyevolve programming languages? e.g. time & uncertainty

complex (many components)programming models for coordinated computation; tools

integration of human- and machine-authored components

Microphone array capture

Sound source localization

Speech recognition

Language understanding

Infrared proximity sensors

Badge sensors

Face detection and tracking

Head-pose tracking

Facial feature tracking

Face identity recognition

Gender detection

Attention models

Engagement models

Turn-taking models

Behavioral control

Dialog management

Natural language generation

Speech synthesis

Avatar synthesis

Robot motion control

Floor-plan models

User models

act in real-time, under uncertaintyevolve programming languages? e.g. time & uncertainty

complex (many components)programming models for coordinated computation; tools

integration of human- and machine-authored components

act in real-time, under uncertaintyevolve programming languages? e.g. time & uncertainty

complex (many components)programming models for coordinated computation; tools

integration of human- and machine-authored componentsengineering of integrated learning systems

Engineering of integrated learning systems

learning in connected systems (new frontiers for ML & software engineering?)

learning in interactive settings: online, lifelong vs. batch

act in real-time, under uncertaintyevolve programming languages? e.g. time & uncertainty

complex (many components)programming models for coordinated computation; tools

integration of human- and machine-authored componentsengineering of integrated learning systems

meta-reasoning & system-level (self)-optimization

Engineering of integrated learning systems

learning in connected systems (new frontiers for ML & software engineering?)

learning in interactive settings: online, lifelong vs. batch

Meta-reasoning and system-level (self)-optimization

self-monitoring and diagnosis / blame assignment

self-optimization

act in real-time, under uncertaintyevolve programming languages? e.g. time & uncertainty

complex (many components)programming models for coordinated computation; tools

integration of human- and machine-authored componentsengineering of integrated learning systems

meta-reasoning & system-level (self)-optimization

machinelearning

softwareengineering

programminglanguages

distributedsystems

?

top related