introduction to bag of little bootstrap

44
ML-IR Discussion: Bag of Little Bootstrap (BLB)

Upload: wayne-lee

Post on 06-May-2015

700 views

Category:

Education


2 download

DESCRIPTION

Reading group presentation on Bag of Little Bootstrap (BLB)

TRANSCRIPT

Page 1: Introduction to Bag of Little Bootstrap

ML-IR Discussion:Bag of Little Bootstrap (BLB)

Page 2: Introduction to Bag of Little Bootstrap

Recap:

- Recap- Why bootstrap- What is bootstrap

- Bag of Little Bootstrap (BLB)- Guarantees- Examples

Page 3: Introduction to Bag of Little Bootstrap

Recap:

Population

Our Sample

Page 4: Introduction to Bag of Little Bootstrap

Estimate the median!

Page 5: Introduction to Bag of Little Bootstrap

Estimate the median!

Page 6: Introduction to Bag of Little Bootstrap

Asymptotic Approach

Theory has it:

Page 7: Introduction to Bag of Little Bootstrap

Asymptotic Approach

Theory has it:

?

Page 8: Introduction to Bag of Little Bootstrap

Asymptotic Approach

95% Confidence Interval

Page 9: Introduction to Bag of Little Bootstrap

Problems with the asymptotic Approach:

- Density “f” is hard to estimate- Sample size demand is much larger than the mean for

Central Limit theorem to kick in- True median unknown

Page 10: Introduction to Bag of Little Bootstrap

Solution:When theory is too hard…Let’s empirically estimate theoretical truth!

Page 11: Introduction to Bag of Little Bootstrap

Empirical Approach: Ideal

Population

Sample Over and Over again!

Page 12: Introduction to Bag of Little Bootstrap

Empirical Approach: Ideal

Population

Median Est 1 Median Est 2

Sample Over and Over again!

Page 13: Introduction to Bag of Little Bootstrap

Empirical Approach: Ideal

Page 14: Introduction to Bag of Little Bootstrap

Empirical Approach: Ideal

95% of sample medians

Page 15: Introduction to Bag of Little Bootstrap

SimilarEnough?

Population

Our Sample

Page 16: Introduction to Bag of Little Bootstrap

Empirical Approach: BootstrapEfron Tibshirani (1993)

Our Sample

Draw with replacement n samples

Median Est* 1 Median Est* 2

Page 17: Introduction to Bag of Little Bootstrap

Empirical Approach: Bootstrap

Page 18: Introduction to Bag of Little Bootstrap

Empirical Approach: Bootstrap95% of sample medians

Page 19: Introduction to Bag of Little Bootstrap

Empirical Approach: Bootstrap

Used for:- Bias estimation- Variance- Confidence intervals

Main benefits:- Automatic- Flexible- Fast convergence (Hall, 1992)

Page 20: Introduction to Bag of Little Bootstrap

Key: There are 3 distributions

Population

Page 21: Introduction to Bag of Little Bootstrap

Key: There are 3 distributions

Approximatedistribution

Population

Actual Sample

Page 22: Introduction to Bag of Little Bootstrap

Key: There are 3 distributions

Approximatedistribution

Approximatedistribution

Population

Actual Sample

Bootstrap Samples

Page 23: Introduction to Bag of Little Bootstrap

Key: There are 3 distributions

Approximatedistribution

Approximatedistribution

Approximatethe approximation- Is there bias?- What’s the variance?- etc.

Population

Actual Sample

Bootstrap Samples

Page 24: Introduction to Bag of Little Bootstrap

No free meals:

- Bootstrapping requires re-sampling the entire population B times

- Each sample is size n- Sampling m < n will violate the sample size

properties- Original sample size cannot be too small

- “Pre-asymptopia” cases

Page 25: Introduction to Bag of Little Bootstrap

Hope- Resample expects .632n unique samples- Sample less – m out of n bootstrap is possible with

analytical adjustments. (Bickel 1997)

Page 26: Introduction to Bag of Little Bootstrap

Hope- Resample expects .632n unique samples- Sample less – m out of n bootstrap is possible with

analytical adjustments. (Bickel 1997)

Intuition: Need less than all n values for each bootstrap.

Page 27: Introduction to Bag of Little Bootstrap

Hope- Resample expects .632n unique samples- Sample less – m out of n bootstrap is possible with

analytical adjustments. (Bickel 1997)

Intuition: Need less than all n values for each bootstrap.

Problem:- Analytical adjustment is not as automatic as desirable- m out of n bootstrap is sensitive to choices of m

Page 28: Introduction to Bag of Little Bootstrap

Bag of Little Bootstrap- Sample without

replacement the sample s times into sizes of b

Page 29: Introduction to Bag of Little Bootstrap

Bag of Little Bootstrap- Sample without

replacement the sample s times into sizes of b- Resample each

until sample size is n, r times.

Page 30: Introduction to Bag of Little Bootstrap

Bag of Little Bootstrap- Sample without

replacement the sample s times into sizes of b- Resample each

until sample size is n, r times.

- Compute the median for each

Med 1 Med r

Page 31: Introduction to Bag of Little Bootstrap

Bag of Little Bootstrap- Sample without

replacement the sample s times into sizes of b- Resample each

until sample size is n, r times.

- Compute the median for each

- Compute the confidence interval for each

Med 1 Med r

Page 32: Introduction to Bag of Little Bootstrap

Bag of Little Bootstrap- Sample without

replacement the sample s times into sizes of b- Resample each

until sample size is n, r times.

- Compute the median for each

- Compute the confidence interval for each

Med 1 Med r

Page 33: Introduction to Bag of Little Bootstrap

Bag of Little Bootstrap- Sample without

replacement the sample s times into sizes of b- Resample each

until sample size is n, r times.

- Compute the median for each

- Compute the confidence interval for each

- Take average of each upper and lower point for the confidence interval

Med 1 Med r

Page 34: Introduction to Bag of Little Bootstrap

Bag of Little BootstrapKlein et al. 2012

Computational Gains:- Each sample only has b unique values!

- Can sample a b-dimensional multinomial with n trials.

- Scales in b instead of n- Easily parallelizable

Page 35: Introduction to Bag of Little Bootstrap

Bag of Little BootstrapKlein et al. 2012

Computational Gains:- Each sample only has b unique values!

- Can sample a b-dimensional multinomial with n trials.

- Scales in b instead of n- Easily parallelizable

If b=n^(0.6), a dataset of size 1TB:- Bootstrap storage demands ~ 632GB- BLB storage demands ~ 4GB

Page 36: Introduction to Bag of Little Bootstrap

Bag of Little Bootstrap

Theoretical guarantees:- Consistency- Higher order correctness- Fast convergence rate (same as bootstrap)

Page 37: Introduction to Bag of Little Bootstrap

Performanceb = n^(gamma), 0.5<= gamma <=1These choices of gamma ensures bootstrap convergence rates.

Page 38: Introduction to Bag of Little Bootstrap

Performanceb = n^(gamma), 0.5<= gamma <=1These choices of gamma ensures bootstrap convergence rates.

Relative error of confidence interval width of logistic regressioncoefficients(Klein et al. 2012)

Page 39: Introduction to Bag of Little Bootstrap

Performanceb = n^(gamma), 0.5<= gamma <=1These choices of gamma ensures bootstrap convergence rates.

Relative error of confidence interval width of logistic regressioncoefficients(Klein et al. 2012)

Gamma residuals t-distr residuals

Page 40: Introduction to Bag of Little Bootstrap

Performance vs Time

Page 41: Introduction to Bag of Little Bootstrap

Selecting Hyperparameters• b, the number of unique samples for each little bootstrap• s, the number of size b samples w/o replacement• r, the number of multinomials to draw

Page 42: Introduction to Bag of Little Bootstrap

Selecting Hyperparameters• b, the number of unique samples for each little bootstrap• s, the number of size b samples w/o replacement• r, the number of multinomials to draw

b: the larger the betters, r: adaptively increase this until a convergence has been reached. (Median doesn’t change)

Page 43: Introduction to Bag of Little Bootstrap

Bag of Little Bootstrap

Main benefits:- Computationally friendly- Maintains most statistical properties of bootstrap- Flexibility- More robust to choice of b than older methods

Page 44: Introduction to Bag of Little Bootstrap

Reference• Efron, Tibshirani (1993) An Introduction to the Bootstrap• Kleiner et al. (2012) A Scalable Bootstrap for Massive Data

Thanks!