wh2014 session: vocal-diary a voice command based ground truth collection system for activity...

23
WLSA CONVERGENCE SUMMIT VOCAL-DIARY : A VOICE COMMAND BASED GROUND TRUTH COLLECTION SYSTEM FOR ACTIVITY RECOGNITION ENAMUL HOQUE

Upload: wireless-life-science-alliance

Post on 19-Jun-2015

86 views

Category:

Healthcare


1 download

DESCRIPTION

Wireless Health 2014 Conference Technical Session 4 featuring speaker Enamul Hoque.

TRANSCRIPT

Page 1: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

WLSACONVERGENCE SUMMIT

VOCAL-DIARY : A VOICE COMMAND BASED GROUND TRUTH COLLECTION SYSTEM FOR ACTIVITY RECOGNITION

ENAMUL HOQUE

Page 2: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

UVA Center for Wireless Health 2

Vocal-Diary : A Voice Command based Ground Truth Collection System for Activity Recognition

Enamul HoqueRobert Dickerson

John Stankovic

Page 3: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

3

Motivation• Learning regular

behavior very important for most home healthcare applications

• The underlying activity recognition system requires ground truth for training

Page 4: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

4

Motivation

• Each research group develops their own ground truth collection system

• To facilitate future home healthcare research, we need a ground truth collection system that is:– Easy to install and use– Accurate– Reusable by other research groups for new studies

Page 5: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

5

Existing Systems

• Not easy to use

Camera Real-time User Logging Daily Journal

Page 6: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

6

Motivation

• Interaction with devices by voice becoming common

Page 7: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

7

Challenges

• Residents may forget to log activities• Residents may forget to turn on microphone• Muti-resident homes• Ambient noise in homes• Privacy

Page 8: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

8

Contributions

• Design, implementation and evaluation of Vocal-Diary, a privacy-aware, robust voice command based ground truth collection system for in-home activities

Easy-to-Use

Robust

Privacy-aware

Two-way acknowledgement

Speaker Recognition

Querying Residents(based on sensors)

Feat

ures

Nov

eltie

s

Publicly Available

Page 9: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

9

System Description

Listen Voice Command

System ‘A’ Start / End

• Only listens to voice commands of specific format

Page 10: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

10

System DescriptionListen Voice Command

Command DetectedSystem ‘A’ Start / End

Recognize Speaker

• Filters out noise and other speakers

Page 11: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

11

System DescriptionListen Voice Command

Command DetectedSystem ‘A’ Start / End

Recognize Speaker

Playback Command

Are You Starting / Ending ‘A’?

Speaker Matched

Wait for Acknowledgment

• Corrects confusion among commands & filters out other conversations recognized as commands

Page 12: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

12

System DescriptionListen Voice Command

Command DetectedSystem ‘A’ Start / End

Recognize Speaker

Playback Command

Are You Starting / Ending ‘A’?

Speaker Matched

Wait for Acknowledgment

Recognize Speaker

Ack. Received

System Yes / No

Page 13: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

13

System DescriptionListen Voice Command

Command DetectedSystem ‘A’ Start / End

Recognize Speaker

Playback Command

Are You Starting / Ending ‘A’?

Speaker Matched

Wait for Acknowledgment

Recognize Speaker

Ack. Received

System Yes / No

Log Activity DetailsSpeakerMatched

Page 14: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

14

System DescriptionListen Voice Command

Playback Command

Wait for Acknowledgment

Recognize SpeakerLog Activity Details

Command Detected

SpeakerMatched

Ack. Received

System ‘A’ Start / End

System Yes / No

Are You Starting / Ending ‘A’?

Recognize Speaker

Speaker Matched

Page 15: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

15

System Description

• Robust– Two-way acknowledgement and speaker recognition make Vocal-Diary robust

• Privacy aware– Only listens to voice commands in a specific format– No raw audio file containing residents’ voice is saved– Only start and end times of each activity are saved

• Ease of Use– Residents do not need to carry any microphone– No need to start the microphone before talking

Page 16: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

16

Evaluation

• Data collected for 1 month each from 3 homes– Two single-resident homes– One double-resident home

• Evaluation Metrics

Page 17: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

17

Evaluation (Single Resident Home 1)

• Speaker Recognition & Two-way acknowledgement are necessary for robustness

Slee

p

Brea

kfas

t

Din

ner

Lunc

h

Prep

are.

..

Cook

Snac

k

Dish

was

h

Toile

t

Show

er TV

Lapt

op Exit

0102030405060708090

100

Without Two-way Ack. & Speaker Recognition (SAPI)With Speaker Recognition OnlyWith Two-Way Ack. & Speaker Recognition (Vocal-Diary)

Activity

Prec

isio

n (%

)

Page 18: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

18

Evaluation (Summary of Precisions)

1 2 30

10

20

30

40

50

60

70

80

90

100

Without Two-Way Ack. & Speaker Recognition (SAPI)

With Speaker Recognition Only

With Two-Way Ack. & Speaker Recognition (Vocal-Diary)

Home ID

Prec

isio

n (%

)

• Ambient noise introduces significant number of false positives

Page 19: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

19

Evaluation (Summary of Recalls)

• If the resident gives a voice command, Vocal-Diary always detects it

1 2 380828486889092949698

100

Without Two-Way Ack. & Speaker Recognition (SAPI)

With Speaker Recognition Only

With Two-Way Ack. & Speaker Recogni-tion (Vocal-Diary)

Home ID

Reca

ll (%

)

Page 20: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

20

Evaluation (Feasibility of Voice Commands)

• Vocal-Diary was deployed for 3 months in a home instrumented with different sensors

• The goal was to evaluate how many times the resident forgot to log activities using voice commands

• Ground truth for ground truth was collected by offline inference based on sensor firings

• 992 total activity instances, 59 not logged (6%)

Page 21: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

21

Evaluation (Effectiveness of Querying Residents)

• Vocal-Diary was deployed for 15 days in a home instrumented with different sensors

• Controlled experiments• Number of times the resident did not use voice

commands: 25• Vocal-Diary queried in all 25 instances• Number of false queries: 14• Number of false queries if motion sensors are

ignored: 6

Page 22: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

22

Conclusion

• To use Vocal-Diary, contact Professor John Stankovic ([email protected])

Easy-to-Use

Robust

Privacy-aware

Two-way acknowledgement

Speaker Recognition

Querying Residents(based on sensors)

Feat

ures

Nov

eltie

s

Publicly Available

Page 23: WH2014 Session: Vocal-diary  a voice command based ground truth collection system for activity recognition

WLSACONVERGENCE SUMMIT

www.wirelesshealth2014.org