implementation of multi-model approach for estimation of ... · problem size #models rtr? final...

31
By Mohammad Reza Karimi Dastjerdi Supervisor: Dr. Amin Nikanjam Implementation of Multi - model Approach for Estimation of Distribution Algorithms (EDA) to Solve Multi - Structural Problems 1 Summer 2016

Upload: others

Post on 11-Jul-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

By Mohammad Reza Karimi Dastjerdi

Supervisor: Dr. Amin Nikanjam

Implementation of Multi-model Approach

for Estimation of Distribution Algorithms (EDA)

to Solve Multi-Structural Problems

1

Summer 2016

Page 2: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Contents

■ Introduction

■ Heuristic strategies

■ Stochastic Heuristic Searches

■ Genetic Algorithm

■ Estimation of Distribution Algorithms

■ EDAs Pseudo code

■ Bayesian Optimization Algorithm

■ BOAs Pseudo code

■ Bayesian Networks

■ Bayesian Dirichlet Metric

■ Find Best Network

■ Generate New Solutions2

■ Selection Operators

■ Parent Selection

■ Truncation Selection

■ Tournament Selection

■ Survivor Selection

■ Worst Replacement

■ Restricted Tournament Replacement

■ Multi-model Approach

■ Multi-structural Problems

■ Conclusion & Suggestions

■ References

Page 3: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Introduction

■ Optimization is a mathematical discipline that concerns the finding of

the extreme of numbers, functions, or systems.

■ All the search strategy types can be classified as:

• Complete strategies : All possible solutions of the search space

• Heuristic strategies : Part of them following a known algorithm

3

Page 4: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Heuristic strategies

■ Deterministic

• Same Conditions → Same Solution

• Main drawback → Risk of getting stuck in local optimum values

■ Non-deterministic

• Even in Same Conditions → Different Solutions

• Escape from these local maxima by means of the randomness

4

Page 5: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Stochastic Heuristic Searches

■ Store one solution: Simulated Annealing

■ Population-based: Evolutionary Computation.

5

Page 6: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Genetic Algorithms

■ One of the examples of evolutionary computation is Genetic Algorithms (GAs).

■ Inspired by the process of natural selection

■ Relied on bio-inspired operators such as mutation, crossover and selection.

■ The behavior of GAs depends on too many parameters like:

• Operators and probabilities of crossing and mutation

• Size of the population

• Rate of generational reproduction

• Number of generations

• and so on.

6

Page 7: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Estimation of Distribution Algorithms

■ All these reasons have motivated the creation of a new type of algorithms classified

under the name of Estimation of Distribution Algorithms (EDAs).

■ EDAs are:

• Population-based

• Probabilistic modelling of promising solutions

• Simulation of the induced models to guide their search

7

Page 8: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

EDAs Pseudo code

8

Page 9: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Bayesian Optimization Algorithm

■ By Pelikan, Goldberg, & Cantu-paz [1998]

9

Page 10: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

BOAs Pseudo code

10

Page 11: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Bayesian Networks■ A Bayesian network encodes the relationships between the variables contained in

the modeled data.

■ Bayesian networks can be used to:

• Describe the data

• Generate new instances of the variables with similar properties as those of given data

■ Each node → one variable.

𝑝 𝑋 =

𝑖=0

𝑛−1

𝑝 𝑋𝑖 𝑋𝑖

11

Page 12: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Bayesian Dirichlet Metric

■ By Heckerman et al [1994]

■ Prior knowledge about the problem + Statistical data from a given data set.

■ The BD metric for a network B given a data set D and the background information 𝜉denoted by 𝑝 𝐷, 𝐵 𝜉 is defined as:

𝒑 𝑫,𝑩 𝝃 = 𝒑 𝑩 𝝃

𝒊=𝟎

𝒏−𝟏

𝝅𝑿𝒊

𝒎′ 𝝅𝑿𝒊 !

𝒎′ 𝝅𝑿𝒊 +𝒎 𝝅𝑿𝒊 !

𝒙𝒊

𝒎′ 𝒙𝒊,𝝅𝑿𝒊 +𝒎 𝒙𝒊,𝝅𝑿𝒊 !

𝒎′ 𝒙𝒊, 𝝅𝑿𝒊 !

𝑚 𝜋𝑋𝑖 =

𝑥𝑖

𝑚 𝑥𝑖 , 𝜋𝑋𝑖

12

Page 13: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Find Best Network

■ K = 0

• An empty network is the best one (and the only one possible).

■ K = 1

• special case of the so-called maximal branching problem

■ K > 1

• NP-complete

• A simple greedy search

• Only operations are allowed that:

o keep the network acyclic

o at most k incoming edges into each node

• The algorithms can start with:

o An empty network

o Best network with one incoming edge for each node

o Randomly generated network. 13

Page 14: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Generate New Solutions

■ Time complexity of generating the value for each variable: 𝑂 𝑘 .

■ Time complexity of generating an instance of all variables: 𝑂 𝑛𝑘 .

14

Page 15: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Multi-model Approach

22

■ Many real-world problems present several global optima.

■ Most EDAs typically bypass the issue of global multi-modality.

■ It is often preferable or even necessary to obtain as many global optima as possible.

■ Used in ECGA by Chuang and Hsu [2010].

Page 16: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Implementation

23

Page 17: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Results

Average

fitness calls

Average model-

building time

Average

algorithm

time

Final Population sizeRTR?#ModelsProblem Size

68220.0190.02813120130

164160.0210.01518120330

159950.0220.0287341330

24

0200

400600800

1000

12001400

160018002000

Traditional Multi-Models Multi-Models with

RTR

Po

pu

lati

on

Siz

e

Different Approaches

0

2000

4000

6000

8000

10000

12000

14000

16000

18000

Traditional Multi-Models Multi-Models with

RTR

Ave

rage

Fit

ne

ss C

alls

Different Approaches

Page 18: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Implementation

25

𝑁𝑚𝑜𝑑𝑒𝑙𝑖 = 𝑓𝑖

𝑗=1𝑀𝑜𝑑𝑒𝑙𝑠 𝑓𝑗

𝑁𝑚𝑜𝑑𝑒𝑙𝑖 =𝑒𝑓𝑖

𝑗=1𝑀𝑜𝑑𝑒𝑙𝑠 𝑒𝑓𝑗

Page 19: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Results

Average

fitness calls

Average

model-

building time

Average

algorithm

time

Final Population

sizeSoftMax?RTR?#ModelsProblem Size

68220.0190.028131200130

1436890.1930.1261525010330

265800.0390.03957811330

26

0

2000

4000

6000

8000

10000

12000

14000

16000

18000

Traditional Multi-Models with

Softmax

Multi-Models with

Softmax and RTR

Po

pu

lati

on

Siz

e

Different Approaches

0

20000

40000

60000

80000

100000

120000

140000

160000

Traditional Multi-Models with

Softmax

Multi-Models with

Softmax and RTR

Ave

rage

Fit

ne

ss C

alls

Different Approaches

Page 20: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Implementation

27

Page 21: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

ResultsAverage

fitness calls

Average

model-

building

time

Average

algorithm

time

Final Population sizeParent

Displacement?SoftMax?RTR?#Models

Problem

Size

68220.0190.0281312000130

77310.0090.0381562100330

72040.0120.071921101330

28

0

200

400

600

800

1000

1200

1400

1600

1800

Traditional Multi-Models with

Parent Displacement

Multi-Models with

Parent Displacement

and RTR

Po

pu

lati

on

Siz

e

Different Approaches

0

1000

2000

3000

4000

5000

6000

7000

8000

9000

Traditional Multi-Models with

Parent Displacement

Multi-Models with

Parent Displacement

and RTR

Ave

rage

Fit

ne

ss C

alls

Different Approaches

Page 22: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Implementation

29

Page 23: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

ResultsAverage fitness calls

Average model-

building time

Average

algorithm

time

Final

Population

size

Pop.

Type

Parent

Displacement?SoftMax?RTR?#Models

Problem

Size

68220.0190.0281312----000130

77310.0090.0381562Single100330

72040.0120.071921Single101330

148710.0150.0583125Multi100330

30

0

500

1000

1500

2000

2500

3000

3500

Traditional SinglePop. SinglePop. and

RTR

Multi Pop.

Po

pu

lati

on

Siz

e

Different Approaches

0

2000

4000

6000

8000

10000

12000

14000

16000

Traditional Single Pop. Single Pop. and

RTR

Multi Pop.

Ave

rage

Fit

ne

ss C

alls

Different Approaches

Page 24: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Multi-structural Problems

■ Deceptive Behavior!

𝑀𝑆𝑃1 𝑥 = 𝛼 + 𝑓𝑡𝑟𝑎𝑝 𝑚,𝑘 𝑥2, … , 𝑥𝑛 𝑥1 = 1

𝑛 − 1 − 𝑓𝑂𝑛𝑒𝑀𝑎𝑥 𝑥2, … , 𝑥𝑛 𝑥1 = 0

𝑀𝑆𝑃2 𝑥 = 𝑚𝑎𝑥 𝛼 + 𝑓𝑡𝑟𝑎𝑝 𝑚,𝑘 𝑥 , 𝑛 − 1 − 𝑓𝑂𝑛𝑒𝑀𝑎𝑥 𝑥

31

Page 25: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

MSP1 Test Results

32

Average fitness

calls

Average model-

building time

Average

algorithm time

Final Population

sizeParent Replacement?RTR?#ModelsProblem Size

248970.0660.0935750-----0131

278400.0220.1687510331

233470.0220.163487511331

339070.0420.311825010531

318120.0190.218625011531

0

1000

2000

3000

4000

5000

6000

7000

8000

9000

Traditional Three

Models

Three

Models and

RTR

Five Models Five Models

and RTR

Po

pu

lati

on

Siz

e

Different Approaches

0

5000

10000

15000

20000

25000

30000

35000

40000

Traditional Three

Models

Three

Models and

RTR

Five Models Five Models

and RTR

Ave

rage

Fit

ne

ss C

alls

Different Approaches

Page 26: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

MSP2 Test Results

33

Average fitness

calls

Average

model-building

time

Average

algorithm

time

Final Population sizeParent Replacement?RTR?#ModelsProblem Size

5969602.9494.231164000-----0130

2386800.3792.3277800010330

2017500.0890.6972500011330

2566600.2372.7978200010530

1859250.0670.871850011530

0

20000

40000

60000

80000

100000

120000

140000

160000

180000

Traditional Three

Models

Three

Models and

RTR

Five Models Five Models

and RTR

Po

pu

lati

on

Siz

e

Different Approaches

0

100000

200000

300000

400000

500000

600000

700000

Traditional Three

Models

Three

Models and

RTR

Five Models Five Models

and RTR

Ave

rage

Fit

ne

ss C

alls

Different Approaches

Page 27: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Conclusions and Suggestions

■ The proposed method seems Nice!

■ Method of learning: Sporadic or Incremental

■ Number of Models

34

Page 28: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

References

■ E. Bengoetxea, "Inexact Graph Matching Using Estimation of Distribution Algorithms," Paris, 2002

■ M. Pelikan, D. E. Goldberg and E. E. Cantú-paz, "Linkage Problem, Distribution Estimation, and Bayesian Networks," Evolutionary Computation, pp. 311-340, 2000

■ T. Blickle and L. Thiele, "A comparison of selection schemes used in evolutionary algorithms," Evolutionary Computation, vol. 4, no. 4, pp. 361-394, 1996

■ A. E. Eiben and E. J. Smith, Introduction to Evolutionary Computing, Heidelberg: springer, 2003

■ C. F. Lima, C. Fernandes and F. G. Lobo, "Investigating restricted tournament replacement in ECGA for non-stationary environments," in Genetic and evolutionary computation, New York, 2008

■ C.-Y. Chuang and W.-L. Hsu, "Multivariate multi-model approach for globally multimodal problems," in Genetic and evolutionary computation, New York, 2010.

■ A. Nikanjam and H. Karshenas, "Multi-Structure Problems: Difficult Model Learning in Discrete EDAs," in IEEE Congress on Evolutionary Computation (CEC2016), Vancouver, Canada, 2016

■ M. Pelikan, K. Sastry and D. E. Goldberg, "Sporadic model building for efficiency enhancement of the hierarchical BOA," Genetic Programming and Evolvable Machines, vol. 9, no. 1, p. 53–84, 2008

■ Martin Pelikan’s personal website

■ Wikipedia35

Page 29: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Special Thanks to…

36

Dr. Nikanjam

Dr. Naser Sharif

Saeed Ghadiri

Page 30: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

Special Thanks to…

37

Dr. Taghirad

KN2C Robotic Team

Page 31: Implementation of Multi-model Approach for Estimation of ... · Problem Size #Models RTR? Final Population size 30 1 0 1312 0.028 0.019 6822 30 3 0 1812 0.015 0.021 16416 30 3 1 734

38