the adaboost algorithm-0.5 margin theorem 1 let d be a distribution over x x {—1, 1}, and let s be...

35

The AdaBoost Algorithm

Upload: others

Post on 15-Jul-2020

2 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

The AdaBoost Algorithm

Page 2: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

A typical learning curve

Page 3: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

…and a boosting one

Page 4: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

this lecture

Page 5: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

supervised learning

Page 6: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

boosting : introduction

Page 7: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

boosting example

  Start with uniform distribution on data

  Weak learners = halfplanes

Page 8: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

round 1

Page 9: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

round 2

Page 10: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

round 3

Page 11: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

final hypothesis

Page 12: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

the hypothesis points space

Page 13: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

separation

Page 14: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

AdaBoost - technical

Page 15: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

a bayesian interpretation

Page 16: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

AdaBoost – update rule

Page 17: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

online allocation - hedge algorithm

Page 18: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

AdaBoost – distribution update

Page 19: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

training error

  Theorem [Freund&Schapire ’97]

Page 20: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

Boosting and margin distribution

θ

Page 21: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

Why margins are important

•  Generalization error

Page 22: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

generalization error- based on margins

Page 23: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

AdaBoost - remarks

  AdaBoost is adaptive •  does not need to know or T a priori •  can exploit εt << ½-

  GOOD : does not overfit   BAD : Susceptible to noise

  but a nice property : identify outliers

Page 24: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

Recent advances

Page 25: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

Page 26: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

boosting vs bagging

Page 27: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

Boosting vs SVMs

Page 28: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

…hope you are still with me

Page 29: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

boosting using confidence-rated predictions

Page 30: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

multiclass : AdaBoost.M1

Page 31: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

multiclass,multilabel : AdaBoost.MH

Page 32: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

output coding for multiclass problems

Page 33: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

InfoBoost – [Aslam ’2000]

Page 34: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

applications

Page 35: The AdaBoost Algorithm-0.5 margin Theorem 1 Let D be a distribution over X X {—1, 1}, and let S be a sample of m examples chosen independently at random according to D. Assume that

so

Learning with AdaBoost

Eficiunhbox voidb@x group let unhbox voidb@x setbox

Adaboost Derek Hoiem March 31, 2004

Logistic Regression, AdaBoost and Bregman Distances1013912006537.pdf · Keywords: logistic regression, maximum-entropy methods, boosting, AdaBoost, Bregman distances, convex optimization,

AdaBoost and RealBoost of RandomSparse Granule Classifiers

AdaBoost - University of California, San Diego

Boosting and AdaBoost - University at Buffalojcorso/t/CSE555/files/lecture... · Boosting and AdaBoost Jason Corso SUNY at Bu alo J. Corso (SUNY at Bu alo) Boosting and AdaBoost 1

Face Detection via AdaBoost

Recognition Part II: Face Detection via AdaBoost · Recognition Part II: Face Detection via AdaBoost Linda Shapiro . CSE 455 . 1 . What’s Coming 1. The basic AdaBoost algorithm

Multi-class AdaBoost

AdaBoost for Learning Binary and Multiclass ... · AdaBoost Tutorial by Avi Kak AdaBoost for Learning Binary and Multiclass Discriminations (set to the “music” of Perl scripts)

Logistic Regression, AdaBoost and Bregman Distancesschapire/papers/breg-dist.pdf · Keywords: logistic regression, maximum-entropy methods, boosting, AdaBoost, Bregman distances,

Theory and Applications of Boostingschapire/talks/nips-tutorial.pdf · AdaBoost •[Freund & Schapire ’95]: • introduced “AdaBoost” algorithm • strong practical advantages

AdaBoost - cvut.czAdaBoost Lecturer: Jiří Matas Authors: JanŠochman,Jiří Matas, JanaKostlivá,OndřejDrbohlav CentreforMachinePerception CzechTechnicalUniversity,Prague 2/33 AdaBoost

HOG-AdaBoost Implementation for Human Detection …

AdaBoost Part of Lecture 6 - University at Buffalojcorso/t/2009S_555/files/lecture6.adaboost.… · J. Corso (SUNY at Buﬀalo) AdaBoost Part of Lecture 6 Mar. 17 2009 3 / 61. Introduction

Financial Distress Prediction Using Adaboost and Bagging

Adaboost for Face Detection

X xxxxxxxxxxxxxxxxxxxxxxxx Buy to Let mortgage

Unifying multi-class AdaBoost algorithms with binary …web.engr.oregonstate.edu/~sinisa/research/publications/PattRecLett... · Unifying multi-class AdaBoost algorithms with binary

Adaboost Tutorial

Online Adaboost-Based Parameterized Methods for …sjmaybank/NetworkIntrusion.pdf · Online Adaboost-Based Parameterized Methods for Dynamic Distributed Network Intrusion Detection

Recitation Decision Trees Adaboost 02-09-2006

Boosting Algorithms: AdaBoost

AdaBoost Learning for Detecting and Recognizing Textayuille/pubs/ucla/A188_xchen_CVPR2004.pdf · AdaBoost Learning for Detecting and Recognizing Text X. Chen Department of Statistics

AdaBoost - University of Oxford

Classification with AdaBoost

The AdaBoost Algorithm - Khoury College of Computer Sciences€¦ · AdaBoost - remarks AdaBoost is adaptive • does not need to know or T a priori • can exploit ε t

Learning with AdaBoost Fall 2007. 11/24/2015Learning with Adaboost2 Outline Introduction and background of Boosting and Adaboost Adaboost Algorithm

AdaBoost - cmp.felk.cvut.czcmp.felk.cvut.cz/cmp/courses/recognition/AdaBoost2/adaboost_talk.… · AdaBoost variants. 3/12 What is AdaBoost? AdaBoost is an algorithm for constructing

SpatialBoost: Adding Spatial Reasoning to AdaBoost

Let it snow - Npower · Let it snow Let it snow Let it snow 75mm x 75mm. Let it snow Let it snow Let it snow 150mm x 150mm. Let it snow Let it snow Let it snow 200mm x 200mm

BOOSTING & ADABOOST