h uman a ction r ecognition using l ocal s patio -t emporal d iscriminant e mbedding kui jia and...

12
HUMAN ACTION RECOGNITION USING LOCAL SPATIO- TEMPORAL DISCRIMINANT EMBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and Pattern Recognition Instructor: Jenn-Jier Lien Reporter: Mei-Hsuan Chao

Upload: britton-wood

Post on 13-Dec-2015

218 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: H UMAN A CTION R ECOGNITION USING L OCAL S PATIO -T EMPORAL D ISCRIMINANT E MBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and

HUMAN ACTION RECOGNITION USING LOCAL SPATIO-TEMPORAL DISCRIMINANTEMBEDDINGKui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and Pattern Recognition

Instructor: Jenn-Jier LienReporter: Mei-Hsuan Chao

Page 2: H UMAN A CTION R ECOGNITION USING L OCAL S PATIO -T EMPORAL D ISCRIMINANT E MBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and

2

OUTLINE

Introduction Related work Local spatio-temporal discriminant embedding Experiments Conclusion

Page 3: H UMAN A CTION R ECOGNITION USING L OCAL S PATIO -T EMPORAL D ISCRIMINANT E MBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and

3

INTRODUCTION

Recognizing human activities in videos has many important computer vision applications.

A human silhouette contains both instant spatial information about the body pose and dynamic temporal motion information of the global body and local body parts.

Human silhouettes can be considered as data points on nonlinear dynamic shape manifolds.

The aim in this paper is to find a manifold embedding method which can optimally make use of the discriminative temporal shape variation information between different types of actions.

Page 4: H UMAN A CTION R ECOGNITION USING L OCAL S PATIO -T EMPORAL D ISCRIMINANT E MBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and

4

RELATED WORK

LPP(Locality preserving projections) LPP constructs a nearest neighbor graph. By using the Laplacian of the graph, LPP can find a mapping

which optimally preserves the local neighborhood information.

LSDA(locality sensitive discriminant analysis) LSDA first constructs one nearest neighbor graph, and then

splits it into the within-class graph and the between-class graph.

Page 5: H UMAN A CTION R ECOGNITION USING L OCAL S PATIO -T EMPORAL D ISCRIMINANT E MBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and

5

LOCAL SPATIO-TEMPORAL DISCRIMINANT EMBEDDING

Neighbor graph G

Short video segment Si

Page 6: H UMAN A CTION R ECOGNITION USING L OCAL S PATIO -T EMPORAL D ISCRIMINANT E MBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and

6

LOCAL SPATIO-TEMPORAL DISCRIMINANT EMBEDDING

Objective functions

Page 7: H UMAN A CTION R ECOGNITION USING L OCAL S PATIO -T EMPORAL D ISCRIMINANT E MBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and

7

LOCAL SPATIO-TEMPORAL DISCRIMINANT EMBEDDING

Principal angles between Si and Sj

Page 8: H UMAN A CTION R ECOGNITION USING L OCAL S PATIO -T EMPORAL D ISCRIMINANT E MBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and

8

LOCAL SPATIO-TEMPORAL DISCRIMINANT EMBEDDING

Optimal embedding

The columns of an optimal A can be obtained as the generalized eigenvectors corresponding to the l largest eigenvalues.

Page 9: H UMAN A CTION R ECOGNITION USING L OCAL S PATIO -T EMPORAL D ISCRIMINANT E MBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and

9

LOCAL SPATIO-TEMPORAL DISCRIMINANT EMBEDDING

Iterative learning

Page 10: H UMAN A CTION R ECOGNITION USING L OCAL S PATIO -T EMPORAL D ISCRIMINANT E MBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and

10

EXPERIMENTS

Data setting

Design of two-stage recognition scheme

Frame by frame basis

Short segment

basis

Test silhouette

frame

Recognitionresult

Page 11: H UMAN A CTION R ECOGNITION USING L OCAL S PATIO -T EMPORAL D ISCRIMINANT E MBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and

11

EXPERIMENTS

Page 12: H UMAN A CTION R ECOGNITION USING L OCAL S PATIO -T EMPORAL D ISCRIMINANT E MBEDDING Kui Jia and Dit-Yan Yeung, IEEE Conference on Computer Vision and

12

CONCLUSION

Propose a novel local spatio-temporal discriminant embedding (LSTDE) method.

Perform recognition on a frame-by-frame or short video segment basis.

Experimental results demonstrate that the proposed method can accurately recognize human actions, and outperforms some representative manifold embedding methods.