we are humor beings: understanding and predicting visual...

31
We Are Humor Beings: Understanding and Predicting Visual Humor Shuai Wang University of Toronto March 29, 2016 1 / 31

Upload: vuongkiet

Post on 05-Mar-2018

413 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

We Are Humor Beings: Understanding andPredicting Visual Humor

Shuai Wang

University of Toronto

March 29, 2016

1 / 31

Page 2: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Intro

I An integral part but not understood in detail

I An adult laughs 18 times a dayI A good sense humor

I is related to communication competenceI helps raise an individual’s social status & popularityI even helps attract compatible matesI makes yourself happier :)

2 / 31

Page 3: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Intro

I An integral part but not understood in detail

I An adult laughs 18 times a day

I A good sense humorI is related to communication competenceI helps raise an individual’s social status & popularityI even helps attract compatible matesI makes yourself happier :)

3 / 31

Page 4: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Intro

I An integral part but not understood in detail

I An adult laughs 18 times a dayI A good sense humor

I is related to communication competenceI helps raise an individual’s social status & popularityI even helps attract compatible matesI makes yourself happier :)

4 / 31

Page 5: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

What makes an image funny?

5 / 31

Page 6: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Humor Techniques

I Animal doing something unusual

I Person doing something unusual

I Somebody getting hurt

I Somebody getting scared

6 / 31

Page 7: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Animal doing something unusual

7 / 31

Page 8: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Person doing something unusual

8 / 31

Page 9: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Somebody getting hurt

9 / 31

Page 10: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Somebody getting scared

10 / 31

Page 11: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Changing objects can alter the funniness of a scene

11 / 31

Page 12: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Removing Incongruities

An elderly person kicking afootball while skateboarding isincongruous, but a young girldoing so is not

12 / 31

Page 13: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Adding Incongruities

Add incongruities (and humor)by replacing the expected withthe unexpected

13 / 31

Page 14: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Two Tasks to Understand Visual Humor

I Predicting how funny a given scene is (scene-level)

I Changing the funniness of a scene (object-level)

14 / 31

Page 15: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Object-level Features

I Object embedding (150-d): captures the context in whichan object usually occurs

I Local embedding (150-d): weighted sum of objectembeddings of all other instances

15 / 31

Page 16: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Scene-level Features

I Cardinality (150-d): bag-of-words representation of howmany instances of each object are in the scene

I Location (300-d): horizontal and vertical coordinates of everyobject (closest to the center if multiple instance)

I Scene embedding (150-d): sum of object embeddings of allobjects in the scene

16 / 31

Page 17: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Predicting Funniness Score

I Dataset: 6,400 scenes, with funny score from 1-5 labelled byworkers from Amazon Mechanical Turk

I Support Vector Regressor (SVR) on scene-level features

I Metric: average relative error

17 / 31

Page 18: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Predicting Funniness Score

I Dataset: 6,400 scenes, with funny score from 1-5 labelled byworkers from Amazon Mechanical Turk

I Support Vector Regressor (SVR) on scene-level features

I Metric: average relative error

18 / 31

Page 19: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Predicting Funniness Score

I Dataset: 6,400 scenes, with funny score from 1-5 labelled byworkers from Amazon Mechanical Turk

I Support Vector Regressor (SVR) on scene-level features

I Metric: average relative error

19 / 31

Page 20: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Predicting Funniness Score: Ablation Analysis

Different feature subsets perform about the same: slightly betterthan baseline (average score of the training scenes)

20 / 31

Page 21: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Alter Funniness of a Scene

I Detect the objects that do (or do not) contribute to humor

I Identify which objects should replace the objects from step 1

21 / 31

Page 22: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Predicting Objects to be Replaced

I On average, the model replaces 3.67 objects (2.54 groundtruth) → this bias towards replace ensures a large ‘margin’

I Animate objects like humans and animals are more likelysources of humor → tends to replace these objects

22 / 31

Page 23: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Predicting Objects to be Replaced

I On average, the model replaces 3.67 objects (2.54 groundtruth) → this bias towards replace ensures a large ‘margin’

I Animate objects like humans and animals are more likelysources of humor → tends to replace these objects

23 / 31

Page 24: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Funny → Unfunny

Old man dancing → young boy dancingHawk stealing meat → baseball

24 / 31

Page 25: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Funny → Unfunny

Cute puppy → InsectWatermelon → Ax

25 / 31

Page 26: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Unfunny → Funny

Couple having dinner at the table → Puppies having dinner at thetable

26 / 31

Page 27: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Unfunny → Funny

Cating playing around → Racoon driving motorcycle

27 / 31

Page 28: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Discussion

I Style/genre of an image or painting can make a difference

I Dataset is small: 6,400 images

I Feature representation can be improved

28 / 31

Page 29: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Discussion

I Style/genre of an image or painting can make a difference

I Dataset is small: 6,400 images

I Feature representation can be improved

29 / 31

Page 30: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

Discussion

I Style/genre of an image or painting can make a difference

I Dataset is small: 6,400 images

I Feature representation can be improved

30 / 31

Page 31: We Are Humor Beings: Understanding and Predicting Visual …fidler/teaching/2015/slides/CSC2523/shuai... · We Are Humor Beings: Understanding and Predicting Visual Humor ... with

31 / 31