intensive surrogate model exploitation in self-adaptive surrogate-assisted cma-es (saacm-es) ilya

19
State-of-the-art Intensive surrogate model exploitation Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya Loshchilov 1 , Marc Schoenauer 2 and Michèle Sebag 2 1 LIS, École Polytechnique Fédérale de Lausanne 2 TAO, INRIA - CNRS - Université Paris-Sud July 9th, 2013 Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 1/ 19

Upload: ilya-loshchilov

Post on 29-Jun-2015

112 views

Category:

Business


0 download

TRANSCRIPT

Page 1: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Intensive Surrogate Model Exploitation inSelf-adaptive Surrogate-assisted CMA-ES

(saACM-ES)

Ilya Loshchilov1, Marc Schoenauer2 and Michèle Sebag 2

1LIS, École Polytechnique Fédérale de Lausanne

2TAO, INRIA − CNRS − Université Paris-Sud

July 9th, 2013

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 1/ 19

Page 2: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Historical overview: PPSN’2010, GECCO’2012

PPSN’2010

CMA-ES assisted with comparison-based surrogate models(ACM algorithm) 1.Good performance but exploitation is independent on surrogatemodel quality, surrogate hyper-parameters are fixed.

GECCO’2012

Self-adaptive surrogate-assisted CMA-ES (IPOP-saACM-ES andBIPOP-saACM-ES) on noiseless2 and noisy testbeds3.BIPOP-saACM-ES demonstrates good performance w.r.t. allalgorithms tested during the BBOB-2009, 2010 and 2012.

1[Loshchilov, Schoenauer and Sebag; PPSN 2010] "Comparison-based optimizers needcomparison-based surrogates"

2[Loshchilov, Schoenauer and Sebag; GECCO-BBOB 2012] "Black-box optimizationbenchmarking of IPOP-saACM-ES and BIPOP-saACM-ES on the BBOB-2012 noiseless testbed"

3[Loshchilov, Schoenauer and Sebag; GECCO-BBOB 2012] "Black-box optimizationbenchmarking of IPOP-saACM-ES on the BBOB-2012 noisy testbed"

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 2/ 19

Page 3: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Content

1 State-of-the-artCovariance Matrix Adaptation Evolution Strategy (CMA-ES)s∗ACM-ES: Self-Adaptive Surrogate-Assisted CMA-ES

2 Intensive surrogate model exploitationIntensive surrogate model exploitation

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 3/ 19

Page 4: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Covariance Matrix Adaptation Evolution Strategy (CMA-ES)s∗ACM-ES: Self-Adaptive Surrogate-Assisted CMA-ES

(µ, λ)-Covariance Matrix Adaptation Evolution Strategy

Rank-µ Update 4 5

yi ∼ N (0,C) , C = I

xi = m + σ yi, σ = 1

sampling of λsolutions

Cµ = 1µ

∑yi:λy

Ti:λ

C← (1− 1)×C + 1×Cµ

calculating C frombest µ out of λ

mnew ←m + 1µ

∑yi:λ

new distribution

The adaptation increases the probability of successful steps to appear again.

Other components of CMA-ES: step-size adaptation, evolution path.

4 [Hansen et al., ECJ 2003] "Reducing the time complexity of the derandomized evolution strategy with covariance matrix adaptation(CMA-ES)"

5 the slide adopted by courtesy of Nikolaus Hansen

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 4/ 19

Page 5: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Covariance Matrix Adaptation Evolution Strategy (CMA-ES)s∗ACM-ES: Self-Adaptive Surrogate-Assisted CMA-ES

Invariance: Guarantee for Generalization

Invariance properties of CMA-ES

Invariance to order-preservingtransformations in function space

true for all comparison-based algorithms

Translation and rotation invariancethanks to C

−3 −2 −1 0 1 2 3−3

−2

−1

0

1

2

3

−3 −2 −1 0 1 2 3−3

−2

−1

0

1

2

3

CMA-ES is almost parameterless (as a consequence of invariances)

Tuning on a small set of functions Hansen & Ostermeier 2001

Default values generalize to whole classes

Exception: population size for multi-modal functions a b

a[Auger & Hansen, CEC 2005] "A restart CMA evolution strategy with increasing population size"b[Loshchilov et al., PPSN 2012] "Alternative Restart Strategies for CMA-ES"

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 5/ 19

Page 6: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Covariance Matrix Adaptation Evolution Strategy (CMA-ES)s∗ACM-ES: Self-Adaptive Surrogate-Assisted CMA-ES

Contents

1 State-of-the-artCovariance Matrix Adaptation Evolution Strategy (CMA-ES)s∗ACM-ES: Self-Adaptive Surrogate-Assisted CMA-ES

2 Intensive surrogate model exploitationIntensive surrogate model exploitation

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 6/ 19

Page 7: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Covariance Matrix Adaptation Evolution Strategy (CMA-ES)s∗ACM-ES: Self-Adaptive Surrogate-Assisted CMA-ES

s∗ACM-ES: Self-Adaptive Surrogate-Assisted CMA-ES

Using Ranking SVM as the surrogate model

Build a global model using Ranking SVM 6

xi ≻ xj iff F̂(xi) < F̂(xj)

Comparison-based surrogate models → invariance torank-preserving transformations of F(x)

How to choose an appropriate Kernel?

Use covariance matrix C adapted by CMA-ES in Gaussiankernel7

K(xi, xj) = e−(xi−xj)

T (xi−xj)

2σ2 ; KC(xi, xj) = e−(xi−xj)

T C−1(xi−xj)

2σ2

Invariance to rotation of the search space thanks to C6[Runarsson et al., PPSN 2006] "Ordinal Regression in Evolutionary Computation"7[Loshchilov et al., PPSN 2010] "Comparison-based optimizers need comparison-based

surrogates"

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 7/ 19

Page 8: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Covariance Matrix Adaptation Evolution Strategy (CMA-ES)s∗ACM-ES: Self-Adaptive Surrogate-Assisted CMA-ES

s∗ACM-ES: Self-Adaptive Surrogate-Assisted CMA-ES

Surrogate-assistedCMA-ES with onlineadaptation of modelhyper-parameters a

a[Loshchilov et al., GECCO 2012]"Self-Adaptive Surrogate-Assisted CovarianceMatrix Adaptation Evolution Strategy"

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 8/ 19

Page 9: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Covariance Matrix Adaptation Evolution Strategy (CMA-ES)s∗ACM-ES: Self-Adaptive Surrogate-Assisted CMA-ES

Results on Black-Box Optimization CompetitionBIPOP-s∗aACM and IPOP-s∗aACM (with restarts) on 24 noiseless 20 dimensional functions

0 1 2 3 4 5 6 7 8 9log10 of (ERT / dimension)

0.0

0.5

1.0

Proportion of functions

RANDOMSEARCHSPSABAYEDADIRECTDE-PSOGALSfminbndLSstepRCGARosenbrockMCSABCPSOPOEMSEDA-PSONELDERDOERRNELDERoPOEMSFULLNEWUOAALPSGLOBALPSO_BoundsBFGSONEFIFTHCauchy-EDANBC-CMACMA-ESPLUSSELNEWUOAAVGNEWUOAG3PCX1komma4mirser1plus1CMAEGSDEuniformDE-F-AUCMA-LS-CHAINVNSiAMALGAMIPOP-CMA-ESAMALGAMIPOP-ACTCMA-ESIPOP-saACM-ESMOSBIPOP-CMA-ESBIPOP-saACM-ESbest 2009f1-24

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 9/ 19

Page 10: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Intensive surrogate model exploitation

Contents

1 State-of-the-artCovariance Matrix Adaptation Evolution Strategy (CMA-ES)s∗ACM-ES: Self-Adaptive Surrogate-Assisted CMA-ES

2 Intensive surrogate model exploitationIntensive surrogate model exploitation

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 10/ 19

Page 11: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Intensive surrogate model exploitation

Intensive surrogate model exploitation

When optimizing F̂ , λ = kλλdef , µ = µdef

Parameter settings: kλ = 1 for D<10 and kλ = 10, 100, 1000 for10, 20, 40-D.

An interpretation: trust-region search.

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 11/ 19

Page 12: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Intensive surrogate model exploitation

Divergence prevention

Surrogate model error estimation can be impreciseThe proposed simple fix:

Exploit surrogate only if the model is "reasonably" precise:kλ > 1 is used only of n̂ ≥ n̂kλ

Parameter settings: n̂kλ= 4.

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 12/ 19

Page 13: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Intensive surrogate model exploitation

Surrogate model hyper-parameters adaptation: theoriginal saACM

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 13/ 19

Page 14: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Intensive surrogate model exploitation

Surrogate model hyper-parameters adaptation: theproposed saACM-k

CPU time per function evaluation: 1.22 sec. (kλ = 1), 0.313 sec.(kλ = 10), 0.311 sec. (kλ = 100), 0.367 sec. (kλ = 1000), 1.52sec. (kλ = 10000).

Here, for kλ = 100 it is even computationally cheaper!

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 14/ 19

Page 15: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Intensive surrogate model exploitation

Ill-conditioned problems

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 15/ 19

Page 16: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Intensive surrogate model exploitation

All BBOB problems

* smaller budget for surrogate-assisted search: 104D forBIPOP-saACM-k versus 106D for BIPOP-saACM.

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 16/ 19

Page 17: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Intensive surrogate model exploitation

Hybridization

HCMA = NEWUOA (first 10n evaluations) + STEP (to exploitseparability if any) + BIPOP-saACM-k.

0 1 2 3 4 5 6 7 8log10 of (# f-evals / dimension)

0.0

0.2

0.4

0.6

0.8

1.0

Proportion of function+target pairs

fmincon

lmm-CMA-ES

IPOP-texp

MOS

BIPOP-saACM-k

HCMA

best 2009f1-24,20-D

* smaller budget for surrogate-assisted search: 104D forIlya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 17/ 19

Page 18: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Intensive surrogate model exploitation

Conclusion

Pros:

Faster convergence, especially on ill-condition problems.

Potentially smaller CPU per function evaluation.

Cons:

Decrease of performance if surrogate error estimation isimprecise.

Perspective

Kullback-Leibler divergence measure for surrogate modelcontrol.8

8[Loshchilov, Schoenauer and Sebag; CAP 2013] "KL-based Control of the Learning Schedulefor Surrogate Black-Box Optimization"

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 18/ 19

Page 19: Intensive Surrogate Model Exploitation in Self-adaptive Surrogate-assisted CMA-ES (saACM-ES) Ilya

State-of-the-artIntensive surrogate model exploitation

Intensive surrogate model exploitation

Thank you for your attention!

Questions?

Ilya Loshchilov, Marc Schoenauer and Michèle Sebag Intensive Surrogate Model Exploitation in saACM-ES 19/ 19