academic excellence for business and the professions€¦ · 03/05/2013 27 [] m.asad, w.ikram...

33
Academic excellence for business and the professions Hand gesture recognition using Kinect Muhammad Asad

Upload: others

Post on 15-Jun-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Academic excellence for business and the professions

Hand gesture recognition using Kinect

Muhammad Asad

Page 2: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Table of Contents

1. Introduction to Kinect

I. What is Kinect?

II. How it works?

III. Why Kinect?

IV. Does Kinect has any limitations?

2. Hand Gesture recognition

I. Types of Hand gestures

II. Distance-Invariant Segmentation of Hand

III. Feature Extraction

IV. Neural Network and HMM training

3. Guidance system for visually impaired

I. Feature extraction

II. Feature Selection

III. Guidance decision

IV. Test sequences

03/05/2013 2

Page 3: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

What is Kinect?

• A sensor which is capable of providing

– Depth Image

– RGB(intensity) Image

– Audio from Multi-array microphone

• Real-time provision

• Low cost

• Operating range: 0.5m to 8m

• Originally developed for Microsoft Xbox 360 gaming console

• Now used in different computer vision research areas [1]

3 03/05/2013

[1] Z. Zhang, “Microsoft kinect sensor and its effect,” IEEE Multimedia, vol. 19, no. 2, pp. 4–10, 2012. Image taken from: http://en.wikipedia.org/wiki/Kinect

Figure 1. Kinect Sensor

Page 4: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

How it works? • Projection of pattern of infra red points

• Image from infra red camera

• Correlated to pattern for known distance

• Real-time depth image (30 fps)

4 03/05/2013

Image taken from: [1] Z. Zhang, “Microsoft kinect sensor and its effect,” IEEE Multimedia, vol. 19, no. 2, pp. 4–10, 2012.

Figure 2. Inside Kinect Sensor

Page 5: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

How it works?

5 03/05/2013

Figure 3. How Kinect Sensor Works?

Page 6: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

How it works?

• Depth Image

– Grayscale image

– Normalized Range: 0 – 255

– Invalid Depth: 0

– Darker pixels = Less distance

– Bright pixels = More distance

– Black pixels = no depth

6 03/05/2013 Figure 4. Kinect Depth Map Visualization

Page 7: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Why Kinect?

• Depth Image

– 3D Shape Information

– Invariant to illumination/lighting changes

– Segmentation

• RGB (Intensity) Image

– Can be aligned to depth

– Details about texture/colour

• Audio from Multi-array microphone

– Source localization

– Ambient noise suppression

• Real-time

• Low cost

7 03/05/2013

Page 8: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Why Kinect?

8 03/05/2013

Taken from: [1] Z. Zhang, “Microsoft kinect sensor and its effect,” IEEE Multimedia, vol. 19, no. 2, pp. 4–10, 2012. [2] Weise, Thibaut, Sofien Bouaziz, Hao Li, and Mark Pauly. "Realtime performance-based facial animation." ACM Trans. Graph 30,no. 4 (2011): 77.

Page 9: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Does Kinect has any limitations?

• Occlusion of projected pattern

• Small object

• Light absorbing surfaces

• Scattering of infra-red pattern

• Noise with increased distance [3]

9 03/05/2013

[3] K. Khoshelham and S.O. Elberink, “Accuracy and resolution of kinect depth data for indoor mapping applications,” Sensors, vol. 12, no. 2, pp. 1437–1454, 2012.

Page 10: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Does Kinect has any limitations?

10 03/05/2013

Figure 5. Limitations of Kinect Sensor

Page 11: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Types of Hand gestures

• Static hand gestures

– Defined by hand shape, position and orientation only

– Example: symbolic gestures in sign language

• Dynamic hand gestures

– Temporal integration of static hand gestures

– Defined by hand shape, positiong, orientation, motion,

acceleration and displacement.

– More natural gestures can be modelled

– Examples: telepresence robotics, mapped gestures on mobile

devices

11 03/05/2013

Page 12: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

OpenNI hand tracker

12 03/05/2013

Page 13: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Distance-invariant segmentation

• OpenNI hand tracker used

13 03/05/2013

Figure 6. Motivation behind Distance Invariant Segmentation

Page 14: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Distance-invariant segmentation

• Inverse relation between:

– distance Pz of hand

– side length S of segmented hand region

• Dataset collection:

– Varying distance Pz of hand

– Ground truth segmentation size S

14 03/05/2013

Page 15: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Results: Distance-invariant segmentation

15 03/05/2013

Figure 7. Distance-Invariant Segmentation vs Fixed size Segmentation

Page 16: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Feature Extraction

• Projection extraction:

where Cz= Pz – S/2

16 03/05/2013

Page 17: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Feature Extraction

• Projection extraction

• Based on projection based action recognition [4]

17 03/05/2013

[4] W. Li, Z. Zhang, and Z. Liu, “Action recognition based on a bag of 3d points,” in IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2010. IEEE, 2010, pp. 9–14.

Figure 8. Projection Mask extraction; (a) XY Projection (b) ZX Projection (c) ZY Projection

Page 18: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Quantization Error in Projections

18 03/05/2013

Figure 9. Quantization and Random Error noise in Projection Masks at varying distance from Kinect sensor; (a) 700mm (b) 950mm (c) 1200mm (d) 1450mm (e) 1700mm

Page 19: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Quantization Error Reduction [5] • Morphological Operation

• Averaging based interpolation

19 03/05/2013

[5] M.Asad, C.Abhayaratne “Kinect Depth Stream Pre Processing for Hand Gesture Recognition",IEEE International Conference on Image Processing (ICIP'13), September 15-18, 2013 (Accepted)

Page 20: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Feature Extraction

20 03/05/2013

Figure 10. Contour feature extraction

Page 21: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Gesture Stages

• Swipe right and swipe left gestures

• Divided into four gesture stages

• Swipe left: 1->2->3->4

• Swipe right: 4->3->2->1

21 03/05/2013

Figure 11. Gesture stages for swipe gestures; (a) Stage 1 (b) Stage 2 (c) Stage 3 (d) Stage 4

Page 22: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Neural Network Training

22 03/05/2013

• Number of Neurons • 64x64x3 • 300 • 15 • 1

Figure 12. Neural Network Structure

Page 23: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Neural Network Training

03/05/2013 23

Page 24: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Neural Network Output with Varying Distance

24 03/05/2013 Figure 13. Neural Network Response with varying distance from the sensor

Page 25: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Neural Network and HMM

25 03/05/2013

Figure 14. Neural Network and HMM Response against time

Page 26: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Demo

26 03/05/2013

Page 27: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Guidance system for Visually Impaired

Person[6]

27 03/05/2013

[6] M.Asad, W.Ikram “Smartphone based Guidance System for Visually Impaired Person”, IEEE International Conference on Image Processing Theory, Tools and Applications (IPTA'12), October 15-18, 2012, Turkey

Figure 15. Flowchart of the guidance system

Page 28: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Feature Extraction: Edge Detection

28 03/05/2013

Figure 16. Edge Detection using (b) Canny (c) Method in [7]

[7] Y. Zhao, W. Gui, and Z. Chen, “Edge detection based on multi-structure elements morphology,” in Intelligent Control and Automation, 2006. WCICA 2006. The Sixth World Congress on, vol. 2. IEEE, 2006, pp. 9795–9798.

(a) (b) (c)

Page 29: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Feature Selection: Hough Transform

29 03/05/2013

Figure 17. Hough Transform of Fig. 16 with Hough Peaks

Page 30: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Feature Selection: Vanishing Point

30 03/05/2013 Figure 18. Vanishing Point Extraction

Page 31: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Guidance Decision

31 03/05/2013

Figure 19. Guidance Decision

Page 32: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Test Sequences

32 03/05/2013

Page 33: Academic excellence for business and the professions€¦ · 03/05/2013 27 [] M.Asad, W.Ikram ^Smartphone based Guidance System for Visually Impaired Person, IEEE International Conference

Thank you!!

33 03/05/2013