coached active learning for interactive video search xiao-yong wei, zhen-qun yang machine...

76
Coached Active Learning for Interactive Video Search Xiao-Yong Wei , Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University, China

Upload: amice-boyd

Post on 01-Jan-2016

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Coached Active Learning for Interactive Video Search

Xiao-Yong Wei, Zhen-Qun Yang

Machine Intelligence LaboratoryCollege of Computer ScienceSichuan University, China

Page 2: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Coached Active Learning for Interactive Video Search

Xiao-Yong Wei, Zhen-Qun Yang

Machine Intelligence LaboratoryCollege of Computer ScienceSichuan University, China

Page 3: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Xiao-Yong Wei, Zhen-Qun Yang

Machine Intelligence LaboratoryCollege of Computer ScienceSichuan University, China

Coached Active Learning for Interactive Video Search

Page 4: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Coached Active Learning for Interactive Video Search

Xiao-Yong Wei, Zhen-Qun Yang

Machine Intelligence LaboratoryCollege of Computer ScienceSichuan University, China

Page 5: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,
Page 6: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Search “car on road” with Google

Page 7: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Search “car on road” with Google

Page 8: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,
Page 9: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Similar to the Automatic Search task in TRECVID

Page 10: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Relevance Feedback

Interactive Search

Page 11: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Relevance Feedback

Interactive Search

Query Modeling, i.e., to figure out what the

searcher wants?

Page 12: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Relevance Feedback

Interactive Search

Query Modeling, i.e., what the

searcher wants?

Querying Strategy, i.e., how to keep the searcher’s patience on labeling?

Page 13: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Query Modeling is indeed a task of Space Exploration

Page 14: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Search & Space Exploration

Page 15: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Search & Space Exploration

Page 16: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Feature Space with Unlabeled

Instances

Page 17: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Feature Space with Unlabeled

Instances

Page 18: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,
Page 19: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,
Page 20: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,
Page 21: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Space Exploration for Query Modeling

How to explore effectively?

How to explore effectively?

Querying Strategy: which instances to query next (i.e., being presented to the

searcher for labeling)?

Querying Strategy: which instances to query next (i.e., being presented to the

searcher for labeling)?

Query Distribution

Page 22: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

The Goal of Query Modeling in Terms of Space Exploration

• A reasonable querying strategy– To find the Query Distribution (i.e., the distribution

of the relevant instances) ASAP– To satisfy Searchers with Relevant Samples ASAP

Page 23: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Active Learning with Uncertainty Sampling – A Popular and

Effective Strategy

Page 24: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Active Learning with Uncertainty Sampling

The 1st Round – Training Classifier

Page 25: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Active Learning with Uncertainty Sampling

The 1st Round – Query the Searcher

Page 26: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Active Learning with Uncertainty Sampling

The 2nd Round – Training Classifier

Page 27: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Active Learning with Uncertainty Sampling

The 2nd Round – Query the Searcher

Page 28: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Active Learning with Uncertainty Sampling

The 3rd Round

Page 29: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Active Learning with Uncertainty Sampling

The 4th Round

Page 30: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Active Learning with Uncertainty Sampling

The 5th Round

Page 31: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Active Learning with Uncertainty Sampling

Page 32: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Active Learning with Uncertainty Sampling may not be a “kindred

soul” with Video Search

Page 33: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Active Learning with Uncertainty Sampling may not be a “kindred soul” with Video Search

Page 34: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

The spirit of Active Learning(to reduce labeling efforts)

The goal of Query Modeling

(to harvest relevant instances)Also found by A.G. Hauptmann and et al. [MM’06]

Active Learning with Uncertainty Sampling may not be a “kindred soul” with Video Search

Page 35: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

The Dilemma of Exploration and Exploitation

• Exploration: to explore more unknown area and get more about the Query Distribution and to boost future gains

• Exploitation: to harvest more relevant instances and to obtain immediate rewards

Page 36: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Our Idea – The Query Distribution, even Unknown, is Predictable

Page 37: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

The Idea of Coached Active Learning

• Estimate the Query Distribution after each round to avoid the risk of learning on a completely unknown distribution

Page 38: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

The Idea of Coached Active Learning

• Estimate the Query Distribution

pdf of the underlying Query Distribution

Page 39: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

The Idea of Coached Active Learning

• Estimate the Query Distribution

• Query users with instances picked from both dense and uncertain areas of the distribution

• Balance the proportion of the two types of instances

Page 40: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

The Idea of Coached Active Learning

Modeling the Query Distribution

Balancing the Exploration and

Exploitation

Page 41: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Modeling the Query Distribution

Page 42: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Modeling the Query Distribution

L+: An incomplete sampling of the Query Distribution

?

?

?

??

?

?

?

?

Page 43: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Modeling the Query Distribution

• An indirect way of estimating the pdf of the Query Distribution– Find training examples

which are statistically from the same distribution as L+

– Use the pdf(s) of those training examples to estimate that of the query

?

?

?

??

?

?

?

?

Page 44: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Modeling the Query Distribution• A practical implementation

– Training: organize training examples into nonexclusive semantic groups

– Testing: check L+ with each group, see whether they are statistically from the same distribution using two-sample Hotelling T-Square Test

– Testing: estimate Query Distribution with GMM

?

?

?

??

?

?

?

?

Page 45: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Modeling the Query Distribution

• Rationale of the implementation– Samples from the same distribution may carry

similar semantics– Semantic groups (which passed the test) then reflect

the searcher’s query intention

Page 46: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Modeling the Query Distribution

Page 47: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Modeling the Query DistributionDistribution of the semantic group “road”

Page 48: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Modeling the Query DistributionDistribution of the semantic group “road”

Distribution of the semantic group “car”

Distribution of the query “car on road”

Page 49: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Balancing the Exploration and Exploitation

Page 50: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Balancing the Exploration and Exploitation

• The priority of selecting an instance x to query next (i.e., present to the searcher for labeling)

Exploitative Priority Explorative Priority

Balancing Factor

Page 51: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Harvest(x): Exploitative Priority

• How likely the exploitation will be boosted when x is selected to query next

Current Decision Boundary

Page 52: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

• How likely the exploitation will be boosted when x is selected to query next

Current Decision Boundary

Harvest(x): Exploitative Priority

Page 53: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

• How likely the exploitation will be boosted when x is selected to query next

Harvest(x): Exploitative Priority

Page 54: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

• How likely the exploitation will be boosted when x is selected to query next

Harvest(x): Exploitative Priority

Page 55: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Explore(x): Explorative Priority

• How likely the exploration will be improved when x is selected to query next

Entropy of the prior distribution

Entropy of the posterior distribution

Page 56: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

• How likely the exploration will be improved when x is selected to query next

Explore(x): Explorative Priority

Page 57: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

λ: Balancing Factor

• Updated by investigating the Harvest History, i.e., numbers of relevant instances found in previous rounds

The expected harvest

Page 58: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Experimental Results

Page 59: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Experiments

• Learning pdf(s) of the semantic groups– Large Scale Concept Ontology for Multimedia

(LSCOM), 449 concept labeled on TRECVID 2005 development dataset

– Using concept combination to create groups– Results: 23,064 groups and their pdfs

(http://www.cs.cityu.edu.hk/~xiaoyong/CAL/)

Page 60: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Experiments

• Comparison with Uncertainty Sampling (US)– 12 volunteers (age 19~26, 3 girls and 9 boys)– TREVID 2005 test dataset (45,765 video shots in

English/Chinese/Arabic)– 24 TRECVID queries and ground truth– Mean Average Precision (MAP)– Concept-based search for the initial round– Interface

Page 61: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Experiments

• Comparison with Uncertainty Sampling (US)– 12 volunteers (age 19~26, 3 girls and 9 boys)– TREVID test dataset (45,765 video shots in

English/Chinese/Arabic)– 24 TRECVID queries and ground truth– Mean Average Precision (MAP)– Interface

Page 62: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Experiments

• Comparison with Uncertainty Sampling (US)

Page 63: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Experiments

• Comparison with Uncertainty Sampling (US)

Harvest the shots by initial search

Page 64: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Experiments

• Comparison with Uncertainty Sampling (US)

Harvest the shots in the most dense area

Page 65: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Experiments

• Comparison with Uncertainty Sampling (US)

Drop down little a bit to encourage exploration and then remains stable to harvest the newly found relevant shots

Page 66: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Experiments

• Comparison with Uncertainty Sampling (US)

Most relevant shots have been harvested, more and more emphasis on exploration

Page 67: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Experiments• Comparison with Uncertainty Sampling (US)

– Purely explorative US causes early give-up

Most shots have been harvested, more and more emphasis on exploration

“difficult” queries

Page 68: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Experiments

• Study of user patience on labeling

Page 69: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Experiments

• Can GMM stimulate the query distribution well?

Query “the road with one more cars”

Page 70: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Experiments

• Comparison with the state-of-the-art– CAL (with virtual searchers) vs. The best results

reported in TRECVID 06-09

Page 71: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Conclusions

Page 72: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Conclusions

• Advantages of CAL– Predictable Query Distribution– Fast Convergence– Balance between Exploitation and Exploration in a principled way

• Toys and demos available at http://www.cs.cityu.edu.hk/~xiaoyong/CAL/

Page 73: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Conclusions

• Future work– Replace the Harvest() and Explore() with more

advanced ones– Try it on larger datasets or web videos

Page 74: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Advantages of CAL

• Predictable Query Distribution

• Fast Convergence

Uncertainty Sampling CAL, the 1st round

Page 75: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Thanks!

Page 76: Coached Active Learning for Interactive Video Search Xiao-Yong Wei, Zhen-Qun Yang Machine Intelligence Laboratory College of Computer Science Sichuan University,

Thanks!