carl vondrick, antonio torralba adria recasens*, aditya...

32
Where are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick, Antonio Torralba Presented by: Surbhi Goel

Upload: others

Post on 27-Sep-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Where are they looking?Adria Recasens*, Aditya Khosla*,Carl Vondrick, Antonio Torralba

Presented by: Surbhi Goel

Page 2: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Where are they looking?

Follow the gaze of the person and identify the object being looked at

Page 3: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Demo: http://gazefollow.csail.mit.edu/demo.html

Page 4: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Experiments

● Dataset Visualizations○ Images in the Dataset○ Head Locations○ Gaze Locations/Length

● Model Experiments○ Qualitative Evaluation○ Visualizing Gaze Mask and Saliency Map○ Animal Gaze Following○ Extending to Short Video

Page 5: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Dataset Visualizations

Page 6: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Training Set Images

Page 7: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Training Set Images

Page 8: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Training Set Images

Page 9: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Heatmaps for Head Location

Train Test

Page 10: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Heatmaps for Gaze Location

Train Test

Page 11: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Heatmaps for Relative Gaze Location

Train Test

Page 12: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Histogram for Length of Gaze

Train Test

Page 13: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Observations

● Head/Gaze are concentrated for train and scattered for test

● Relative gaze is concentrated for both

● Gaze length relatively short (0.2 peak)

Page 14: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Model Evaluation

Page 15: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Good Cases

Page 16: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Good Cases

Page 17: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Bad Cases

Head fully tilted but missed

Page 18: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Bad Cases

Face forward but eyes tiltedNo object of attention

Page 19: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Bad Cases

Back facing

Page 20: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Observations

● Handle groups well

● Gaze location is very accurate, head location often not

● Unable to capture eye movement independent of face orientation

● Fails at a lot of back facing cases

Page 21: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Gaze Mask and Saliency Map

Page 22: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Gaze Mask and Saliency Map

● Gaze Mask incorporates the general direction of gaze

● Saliency Map incorporates the salient objects in image

● Element-wise product captures locations that satisfy both

Page 23: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Gaze Mask and Saliency Map

Image with Gaze Gaze Mask Saliency Map

Page 24: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Animal Gaze Follow

Page 25: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Animal Gaze Follow

Page 26: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Animal Gaze Follow

Works (almost) for even birds

Page 27: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Animal Gaze Follow

Works even when more than one salient object

Page 28: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Animal Gaze Follow

● Model generalizes to animals○ Initialized with ImageNet which has animal data

● Able to learn properties based on orientation of head

● Point of gaze is not always correct

Page 29: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Extension to a Short Video

Apply model per frame of video

Page 30: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Extension to a Short Video

Head detector often fails, could use temporal context to improve

Page 31: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Conclusions

● Can be confused with mixed orientations and back-facing

● Model generalizes well to animals

● Could be potentially extended to videos

● Could be applied to other domains?

Page 32: Carl Vondrick, Antonio Torralba Adria Recasens*, Aditya ...vision.cs.utexas.edu/381V-spring2016/slides/goel-expt.pdfWhere are they looking? Adria Recasens*, Aditya Khosla*, Carl Vondrick,

Thank You!