3/29/2016 · • models of human decision making 8 focus of course • rigorous algorithm design...

70
Administrivia, Introduc1on to Online Learning CS 159: Advanced Topics in Machine Learning 1 3/29/2016

Upload: others

Post on 18-Oct-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Administrivia,Introduc1ontoOnlineLearning

CS159:AdvancedTopicsinMachineLearning

1

3/29/2016

Page 2: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

ClassDetails

•  Instructor:YisongYue•  TAs:

•  CourseWebsite:hBp://www.yisongyue.com/courses/cs159/

2

HoangLe StephanZheng

Page 3: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

StyleofCourse

•  Graduatelevelcourse

•  Givestudentsanoverviewoftopics

•  Digdeepintoonetopicforfinalproject

•  Assumestudentsaremathema1callymature– Goalistounderstandbasicconcepts– Understandspecificmathema1caldetailsdependingonyourinterest

3

Page 4: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

GradingBreakdown

•  Par1cipa1on(20%)

•  Mini-quizzes(10%)

•  FinalProject(70%)

4

Page 5: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

PaperReading&Discussion

•  PaperReadingCourse– Readingassignmentsforeachlecture– Lecturesmorelikediscussion

•  StudentPresenta1ons– Presenta1onschedulesignupsoon– Presentingroups– Canchoosewhichpaper(s)topresent

5

Page 6: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Mini-quizzes

•  Eveningabereverylecture– Veryshort– Easyifyoureadmaterial&aBendedlecture

•  ReleasedviaPiazza– AlsousePiazzaforQ&A

6

Page 7: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

FinalProject

•  Canbeonanytopicrelatedtothecourse

•  Workingroups

•  Willrelease1melineofprogressreportssoon

•  Peerreview(?)

7

Page 8: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Topics

•  OnlineLearning•  Mul1-armedBandits•  Ac1veLearning•  Crowdsourcing•  ReinforcementLearning•  ModelsofHumanDecisionmaking

8

Page 9: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

FocusofCourse

•  Rigorousalgorithmdesign– Mathintensive,butnothingtoohard– Willwalkthroughrelevantmathinclass

•  Applytointeres1ngapplica1ons– Whataretherightwaystomodelaproblem?

9

Page 10: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

WhatDoesRigorousMean?

•  Formalmodel– Explicitlystateyourassump1ons

•  Rigorouslyreasonabouthowyouralgorithmsolvesthemodel– Some1meswithprovableguarantees

•  Arguethatyourmodelisareasonableone

10

Page 11: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

WhatMakesaGoodFinalProject?

•  PureTheory–  Studyprooftechniques,trytoextendproof,orapplytonewsejng

•  Algorithms–  Extendalgorithms,designnewones,fornewsejngs

•  Modeling– Modelnewsejng,whataretherightassump1ons?

11

Page 12: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Outline

•  First3-5lectures– Reviewbasicalgorithms– Somewhatdry,butnecessary

•  Topics/readingschosenbystudents– Withcura1ngfromInstructor&Tas– Listofpapersalreadyonwebsite

•  Butisnego1able

12

Page 13: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

RestofToday

•  Introduc1ontoOnlineLearning– FollowtheLeader– Perceptron

•  BriefOverviewofOtherTopicsinCourse

13

Page 14: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Introduc1ontoOnlineLearning

14

Page 15: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

(MostBasic)OnlineLearning

•  Fort=1….T(some1mesTisunknown)

– Algorithmchoosespt– Worldrevealslossfunc1onLt– AlgorithmsufferslossLt(pt)

•  Goal:minimizetotalloss

15

Lt (pt )t=1

T

WhatarethesemanDcsofpt?

Whatistheloss?

Howisthelosschosen?

Page 16: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Recall:SupervisedLearning

•  Op1mizeviaStochas1cGradientDescent– Maintainawt

– Eachitera1onreceive:

– AssumesampledrandomlyfromS

– Choosewt+1basedonwtandLt

16

argminw

L yi, f (xi |w)( )i=1

N

∑ S = (xi, yi ){ }i=1N

Lt (wt ) = L yi, f (xi |wt )( )

Page 17: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

(MostBasic)OnlineLearning

•  Fort=1….T(some1mesTisunknown)

– Algorithmchoosespt– Worldrevealslossfunc1onLt– AlgorithmsufferslossLt(pt)

•  Goal:minimizetotalloss

17

Lt (pt )t=1

T

pt=wt

Lt(wt)=L(yt,f(xt|wt))

Ltchosenrandomly

Page 18: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Whatif…

•  Wereceiveaconstantstreamofdata?– Don’tknowTapriori

•  Wereceivedatainsomearbitraryway?– Notsampledindependentlyfromsomedistribu1on

•  CanwesDll(provably)achievegoodperformance?

18

Page 19: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Quan1fyingPerformance

•  Insupervisedlearningwecareabout:

•  Inonlinelearning,wecareabout:

19

L yi, f (xi |w)( )i=1

N

∑ = Li (w)i=1

N

L yt, f (xt |wt )( )t=1

T

∑ = Lt (wt )t=1

T

asinglew

asequenceofwt

Page 20: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Quan1fyingPerformance

•  Competeagainstsinglebestwinhindsight:

20

Lt (w*)

t=1

T

∑ =minw

Lt (w)t=1

T

R(T ) = Lt (wt )t=1

T

∑ − Lt (w*)

t=1

T

∑ “Regret”

InterpretaDon:bestpossiblelossw.r.t.supervisedlearning

Page 21: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Interpre1ngRegret

•  ExpectedTrainingErroris:

•  Wantexpectedtrainingerrorto(quickly)convergetoop1mal–  Equivalenttoaverageregret(quickly)convergingto0:

•  SaDsfiedwhenregretgrowssublinearlyw.r.t.T!

21

1TR(T ) = 1

TLt (wt )

t=1

T

∑ − Lt (w*)

t=1

T

∑#

$%

&

'(→ 0

1T

Lt (wt )t=1

T

Page 22: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

SummaryofRegret

•  Genericwaytoquan1fyperformance– CharacterizesspeedofconvergenceforSGD

•  Appliestomanyonlinelearningsejngs

•  We’llseeotherwaystoquan1fyperformancelaterincourse

22

Page 23: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

FollowtheLeader

23

Page 24: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

BasicOnlineConvexOp1miza1on

•  Fort=1….T(Tunknown)– AlgorithmchoosesptinRD– Worldrevealslossfunc1onLt(pt)=|yt-pt|2

– AlgorithmsufferslossLt(pt)

•  Goal:minimizetotalloss

24

Lt (pt )t=1

T

SquaredDistancetoytIngeneral,convexloss

Page 25: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

FollowtheLeaderAlgorithm

•  The“leader”isthebestpointgivenwhatweknowsofar:

25

pt = argminp

Lt ' p( )t '=1

t−1

∑ = argminp

yt ' − p2

t '=1

t−1

∑ =1t −1

yt 't '=1

t−1

ThisistheenDrealgorithm!

Page 26: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

BenefitsandDrawbacks

•  Benefits:– Efficientregretbounds(willseenextslide)– Conceptuallyverysimple

•  Canbeappliedtomanysejngs

•  Drawbacks:– Canbecomputa1onallyveryexpensive

•  Forarbitrarylossfunc1ons–  (can’tuseaverageallthe1me)

26

Page 27: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Defini1ons

•  Besthindsightchoiceoffirstt1mesteps:

•  FollowtheLeaderplays:

27

pt* = argmin

pLt ' p( )

t '=1

t

∑ = argminp

yt ' − p2

t '=1

t

∑ =1t

yt 't '=1

t

pt = pt−1*

pt = argminp

Lt ' p( )t '=1

t−1

∑ = argminp

yt ' − p2

t '=1

t−1

∑ =1t −1

yt 't '=1

t−1

Page 28: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Goal

•  MinimizeRegret:

28

R(T ) = Lt (pt )t=1

T

∑ − Lt (pT* )

t=1

T

pT* = argmin

pLt p( )

t=1

T

∑ = argminp

yt − p2

t=1

T

∑ =1T

ytt=1

T

Page 29: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Lemma1

•  InterpretaDon:–  themovingbesthindsightisatleastasgoodasthefinalbesthindsight

•  ProofbyInduc1on– Basecase(T=1):

29

L1(p1*) = L1(p1

*)

Lt (pt*)

t=1

T

∑ ≤ Lt (pT* )

t=1

T

Page 30: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

ProofCon1nued

•  Induc1veCase(T>1):–  Removelasttermbecauseit’sequivalent

–  Observe:

30

Lt (pt*)

t=1

T

∑ ≤ Lt (pT* )

t=1

T

∑ ⇒ Lt (pt*)

t=1

T−1

∑ ≤ Lt (pT* )

t=1

T−1

Lt (pt*)

t=1

T−1

∑ ≤ Lt (pT−1* )

t=1

T−1

∑ ≤ Lt (pT* )

t=1

T−1

Induc1veHypothesis

Defini1onofp*

Page 31: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

RegretBound

31

R(T ) = Lt (pt )t=1

T

∑ − Lt (pT* )

t=1

T

= Lt (pt−1* )

t=1

T

∑ − Lt (pT* )

t=1

T

≤ Lt (pt−1* )

t=1

T

∑ − Lt (pt*)

t=1

T

DefiniDonofFollowtheLeader

Lemma1

Page 32: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

RegretBound(con1nued)

32

Lt (pt−1* )

t=1

T

∑ − Lt (pt*)

t=1

T

∑ = pt−1* − yt

2

t=1

T

∑ − pt* − yt

2

t=1

T

= pt−1* − pt

*, pt−1* + pt

* − 2ytt=1

T

≤ pt−1* − pt

* ⋅t=1

T

∑ pt−1* + pt

* − 2yt

≤ pt−1* − pt

* ⋅t=1

T

∑ pt−1* + pt

*t + 2yt( )

Cauchy-Schwarz

TriangleInequality

Page 33: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

RegretBound(con1nued)

33

pt−1* − pt

* ⋅t=1

T

∑ pt−1* + pt

*t + 2yt( ) ≤ 4B pt−1

* − pt*

t=1

T

AssumeeachythasnormboundedbyB:

Notethateachp*alsohasnormboundedbyB

Page 34: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

RegretBound(con1nued)

34

pt−1* − pt

* = pt−1* −

(t −1)pt−1* + ytt

= 1tpt−1

* − yt

≤ 1t

pt−1* + yt( )

≤ 2Bt

Usethefactthat:

pt* =(t −1)pt−1

* + ytt

TriangleInequality

EachhasnormB

Page 35: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

RegretBound(complete)

35

R(T ) = Lt (pt )t=1

T

∑ − Lt (pT* )

t=1

T

≤ Lt (pt−1* )

t=1

T

∑ − Lt (pt*)

t=1

T

≤ 4B pt−1* − pt

*

t=1

T

≤ 8B2 1tt=1

T

∑ =O B2 lnT( ) LogarithmicRegret!

Independentofhoweachytischosen!

Page 36: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Recall:Interpre1ngRegret

•  ExpectedTrainingErroris:

•  Wantexpectedtrainingerrorto(quickly)convergetoop1mal–  Equivalenttoaverageregret(quickly)convergingto0:

•  SaDsfiedwhenregretgrowssublinearlyw.r.t.T!

36

1TR(T ) = 1

TLt (wt )

t=1

T

∑ − Lt (w*)

t=1

T

∑#

$%

&

'(→ 0

1T

Lt (wt )t=1

T

Page 37: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

WhenShouldYouUseFTLinPrac1ce?

•  Whensolvingeachop1miza1onproblemisnottheboBleneck– Forsimplesquareddistance,itistrivial– Formorecomplexlossfunc1ons,mightrequireexpensiveop1miza1on

•  WewillseeananalysisofSGD-stylealgorithmsnextTuesday– MakesmallupdatestoptusingonlyLt

37

Page 38: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Perceptron

38

Page 39: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

BinaryClassifica1onOnlineLearning

•  Fort=1….T(some1mesTisunknown)

– AlgorithmchooseswtinRD– Worldrevealslossfunc1on:

– AlgorithmsufferslossLt(wt)

•  Goal:minimizetotalloss

39

Lt (pt )t=1

T

Lt (wt ) =1 yt≠sign wt ,xt( )"# $%0/1Loss

Page 40: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

PerceptronLearningAlgorithm

40

IfLt(wt)=1: wt+1 = wt + ytxt

Else: wt+1 = wt

y ∈ −1,+1{ }x ∈ RD

Page 41: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

41

PerceptronLearningAssumeLinearlySeparable

Page 42: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

42

Misclassified!

PerceptronLearningAssumeLinearlySeparable

Page 43: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

43

Update!

PerceptronLearningAssumeLinearlySeparable

Page 44: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

44

Correct!

PerceptronLearningAssumeLinearlySeparable

Page 45: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

45

Misclassified!

PerceptronLearningAssumeLinearlySeparable

Page 46: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

46

Update!

PerceptronLearningAssumeLinearlySeparable

Page 47: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

47

Update!

PerceptronLearningAssumeLinearlySeparable

Page 48: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

48

Correct!

PerceptronLearningAssumeLinearlySeparable

Page 49: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

49

Correct!

PerceptronLearningAssumeLinearlySeparable

Page 50: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

50

Misclassified!

PerceptronLearningAssumeLinearlySeparable

Page 51: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

51

Update!

PerceptronLearningAssumeLinearlySeparable

Page 52: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

52

Update!

PerceptronLearningAssumeLinearlySeparable

Page 53: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

53

AllTrainingExamplesCorrectlyClassified!

PerceptronLearningAssumeLinearlySeparable

Page 54: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

RegretBound=MistakeBound(forSeparableCase)

•  Forseparablecase:

•  Regret=#MistakesPerceptronmakes

54

R(T ) = Lt (wt )t=1

T

∑ − Lt (w*)

t=1

T

Lt (w*)

t=1

T

∑ = 0

Page 55: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Lemma2

55

ytxtt∈I∑ = (wt+1 −wt )

t∈I∑ = wT+1

= wt+12− wt

2( )t∈I∑

= wt + ytxt2− wt

2( )t∈I∑

= 2yt wt, xt + xt2( )

t∈I∑

≤ xt2

t∈I∑

ytxtt∈I∑ ≤ xt

2

t∈I∑

Proof:

MistakeItera1ons

TelescopingSum

UpdateDefiniDon

≤0

Page 56: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

PerceptronMistakeBound

56

#MistakesBoundedBy: B2

γ 2

Margin

B =maxx

x

**IfLinearlySeparable

Holdsforanyorderingoftrainingexamples!

“Radius”ofFeatureSpace

Page 57: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Proof

•  Margin:

57

γ =maxwmin(xt ,yt )

yt w, xtw

!"#

$#

%&#

'#MustbeposiDveduetolinearseparability

I γ ≤w, ytxt

t∈I∑w

≤ ytxtt∈I∑ ≤ xt

2

t∈I∑ ≤ I B2

I γ ≤ I B2 ⇒ I ≤ B2

γ 2

Page 58: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Interpreta1on

•  Ifthedataislinearlyseparable

•  ThenANYorderingof(x,y)willcauseperceptrontoconvergewithfinitemistakes

•  NodependenceonIIDsamplingfromtruedistribu1on

58

Page 59: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

BriefOverviewofOtherTopics

59

Page 60: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

ContextualOnlineLearning(akaOnlineLearningwithExperts)

•  Given:Setofexperts{fk}•  Fort=1….T(some1mesTisunknown)

– Eachexpertpredictsfk,t– Algorithmchoosespt– Worldrevealslossfunc1onLt– AlgorithmsufferslossLt(pt)

•  Goal:minimizetotalloss

60

Lt (pt )t=1

T

GeneralizesBoosDng

Page 61: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Par1alInforma1onOnlineLearning

•  Fort=1….T(some1mesTisunknown)

– Algorithmchoosespt– WorldrevealslossLt(pt)– AlgorithmsufferslossLt(pt)

•  Goal:minimizetotalloss

61

Lt (pt )t=1

T

Wedon’tknowlossofotherchoices

Needto“explore”tomeasurelossofalternaDves

Page 62: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

BasicAc1veLearning(forsupervisedlearning)

•  Fort=1….– Algorithmchoosesx– Worldrevealsassociatedlabely– Add(x,y)totrainingset

•  Terminatewhensufficientlyconfidentofbestmodel

62

Page 63: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

SimpleExample

•  1feature•  Learnthresholdfunc1on

63

TrueModelPassiveLearningSamplefromdistribu1on

LearnedModel

Page 64: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

SimpleExample

•  1feature•  Learnthresholdfunc1on

64

TrueModelAcDveLearningBinarySearch

Page 65: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

ComparisonwithPassiveLearning

•  #samplestobewithinεoftruemodel

•  PassiveLearning:

•  Ac1veLearning:

65

O 1ε

!

"#$

%&

O log 1ε

!

"#

$

%&

Simple'Example'

•  1'feature'•  Learn'threshold'func7on'

39'

True'Model'Passive'Learning'Sample'from'distribu7on'

Learned'Model'Simple'Example'

•  1'feature'•  Learn'threshold'func7on'

40'

True'Model'Ac#ve&Learning&Binary'Search'

Page 66: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Crowdsourcing

66

Y LeCunMA Ranzato

Object Recognition [Krizhevsky, Sutskever, Hinton 2012]

“Mushroom”

Labeled and Unlabeled data

Human expert/Special equipment/

Experiment

“Crystal” “Needle” “Empty”

Cheap and abundant ! Expensive and scarce !

“0” “1” “2” …

“Sports”“News”“Science”

Unlabeled

LabeledIni1allyEmpty

Repeat

Page 67: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

HowReliableareAnnotators?

•  Ifweknewwhatthelabelswere– Canjudgeworkersonlabelquality

•  Ifweknewwhothegoodworkerswere– Cancreatelabelsfromtheirannota1ons

•  Chickenandeggproblem!

67

Page 68: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

ReinforcementLearning

68

•  Inprevioussejngs:– Ac1onsdonotimpactstate– “Stateless”

•  ReinforcementLearning– Ac1onseffectstateyou’rein– Rewardfunc1ondependsonstate– Example:PlayingGo

Page 69: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

Off-PolicyEvalua1on

•  Example:Wehavehospitallogsofpneumoniadeathsundervariouscondi1ons.

– Wanttotrainmodelpredictwhoismostatrisk

– Modelpredictsthatasthmapa1entshaveLOWERriskforpneumoniadeath….

– BecausedoctorspaycloseraBen1ontoasthmapa1ents!

69

Page 70: 3/29/2016 · • Models of Human Decision making 8 Focus of Course • Rigorous algorithm design – Math intensive, but nothing too hard – Will walk through relevant math in class

ModelingHumanDecisionMaking

•  Howdohumansreactinsequen1aldecisionmakingprocesses?

– Dotheybehavelikefollowtheleader?

– Dotheybehavelikeaperceptron?

70