swarm intelligence in dynamic environments yang.pdf · aco mimics the behaviour of ants searching...

41
Shengxiang Yang Centre for Computational Intelligence (CCI) De Montfort University, Leicester LE1 9BH, UK http://www.tech.dmu.ac.uk/~syang Email: [email protected] Talk presented at the 2018 International Conference on Swarm Intelligence (ICSI 2018), Shanghai, China, 17-22 June, 2018 Swarm Intelligence in Dynamic Environments

Upload: others

Post on 01-Oct-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Shengxiang Yang

Centre for Computational Intelligence (CCI)

De Montfort University, Leicester LE1 9BH, UK

http://www.tech.dmu.ac.uk/~syang

Email: [email protected]

Talk presented at the 2018 International Conference on Swarm Intelligence

(ICSI 2018), Shanghai, China, 17-22 June, 2018

Swarm Intelligence in Dynamic Environments

Page 2: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Wish You All

Happy Dragon Boat Festival!

端午节快乐!

2

Page 3: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Centre for Computational Intelligence, DMU

• CCI (www.cci.dmu.ac.uk): Mission: Developing fundamental theoretical and practical solutions to

real-world problems using a variety of CI paradigms

Members: 18 staff, research fellows, ~30 PhDs, visiting researchers

Themes: EC, fuzzy logic, neural networks, data mining, robotics, game …

• Funding: Research Councils/Charities: EPSRC, ESRC, EU FP7 & Horizon 2020,

Royal Society, Royal Academy of Eng., Innovate UK, KTP, Nuffield Trust …

Government: Leicester City Council, DTI

Industries: Lachesis, EMDA, RSSB, Network Rail, etc.

• Collaborations: Universities: UK, USA, Spain, and China

Industries and local governments

• Teaching/Training: DTP-IS: University Doctor Training Programme in Intelligent Systems

MSc: Intelligent Systems (& Robotics); Data Analytics; BI & Data Mining

BSc: AI with Robotics; Computer Game Programming; Math; Comp Sci

• YouTube page: http://www.youtube.com/thecci3

Page 4: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Outline of the Talk

• Introduction to swarm intelligence (SI)

• SI for dynamic optimisation problems (DOPs)

• Case studies

• Future directions

• Summary

4

Page 5: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Swarm Intelligence (SI) in Nature/Biology

• Swarm Intelligence (SI): Intelligence achieved by the

collective behavior of a swarm of agents (individuals) A system consists of a population of agents that interact locally with

one another and with their environment

Each agent follows very simple rules (may involve self-learning and

social-learning)

Interactions between such agents lead to the emergence of

"intelligent" global behavior, unknown to the individual agents

5

Page 6: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

A Simple Example

6

Ants leave pheromone trails to point the way to food (E. O. Wilson 1962)

Page 7: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Swarm Intelligence in Optimization

• Encapsulates a class of optimisation algorithms inspired

by the SI phenomena in nature/biology

Particle Swarm Optimization (PSO): Bird flocking

Ant Colony Optimization (ACO): Ant foraging

Artificial Bee Colony (ABC) algorithms: Bee colony

Bacterial Foraging Optimization (BFO): Bacterial growth

Firefly Algorithm (FA): Firefly flashing

……

7

Page 8: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Ant Colony Optimization (ACO)

• Proposed by Dorigo et al. (1996)

• ACO mimics the behaviour of ants searching for food

• The idea: ants “walk” on the arcs of graph while “reading”

and “writing” pheromones until they converge into a path

• The shorter the path the more pheromone deposited

• Standard ACO consists of two phases: Forward mode: Construct solutions

Backward mode: Pheromone update

8

Page 9: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Forward Mode: Construct Solutions

9

• Ant k constructs a tour probabilistically

is the existing pheromone between cities i and j

is the heuristic information between cities i and j

is the list of nearest unvisited cities of city i

α and β are constant parameters that determine the influence of τ

and η, respectively

Page 10: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Backward Mode: Pheromone Update

10

• Ant k updates its pheromone trails

Deposit pheromone

• is the tour constructed by ant k

• is the amount of pheromone to be deposited

Evaporate pheromone

• where ρ is the evaporation rate

• Helps ants to “forget” bad decisions (poor solutions) made in the past

(previous iteration): If an arc is not chosen by ants for a number of

iterations, its associated pheromone value decreases exponentially

Page 11: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

SI Applications

• Advantages of SI: Multiple solutions in a single run

No strict requirements to problems

Easy to use

• Widely used for optimisation and search problems Financial and economical systems

Transportation and logistics systems

Industry engineering

Automatic programming, art and music design

......

11

Page 12: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

SI for Dynamic Optimization Problems: Motivation

• Traditionally research on SI has focused on static problems Aims to find optimum quickly and precisely in the search space

12

• But, many real-world problems are dynamic optimisation

problems (DOPs), where changes occur over time Transport systems: travel time between nodes may change

Logistics: customer demands may change

Page 13: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

What Are DOPs?

• In general terms, “optimisation problems that change over

time” are called dynamic problems or time-dependent

problems:

F = f(X, S, t)

where X: decision variable(s); S: parameters; t: time

• DOPs: a special class of dynamic problems that are

solved online by an algorithm as time goes by

13

Page 14: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Why DOPs Challenging SI?

14

• For DOPs, optima may move over time in search space We need to track the moving optima over time

• DOPs challenge traditional SI algorithms Once converged, hard for an SI to escape from the old optimum

Page 15: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Why SI for DOPs?

• Many real-life problems are DOPs

Desirable to present solutions to decision makers over time

• SI methods, once properly enhanced, are good choice

Inspired by biological behaviour, always in dynamic environments

Intrinsically, should be fine to deal with DOPs

• Research on SI for DOPs rises recently

15

Page 16: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

SI for DOPs: Things to Do

• To detect potential environmental changes

Success rate of detection

Cost of detection

• To track the changing optima

To expect a steady and fast change response

To reduce the cost of tracking (given the budget limit, i.e., time,

memory)

16

Page 17: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Change Detection Approaches

• Why is detection important ? When a change occurs, archived solutions may become outdated

SI would get misled if archived solutions are not re-evaluated in

time

• Two ways of detecting changes: Individual-level detection: fast but not robust

Population-level detection: slow but robust

Both methods could fail to detect changes (not 100% guaranteed)

17

Page 18: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Individual-level Change Detection

• Re-evaluate some individuals’ fitness every generation

Check the discrepancy between their current and previous fitness

• Success rate of detection depends on

Detectability of environmental changes

Location of detectors placed

18

able to detect unable to detect

detectors before a change

re-evaluated detectors due

to a change

time-independent

fitness segments

Page 19: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Population-level Change Detection

• Population-related statistical information, i.e., distribution,

is assessed in every generation Check the significance of variation in statistical information

• Less sensitive to noise but possibly higher computational

cost

19

population distribution

before a change

population distribution

after a change

Page 20: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Response Approaches

• How about restarting an SI algorithm after a change?

Natural and easy choice

But, not good choice due to:

• It may be inefficient, wasting computational resources

• It may lead to very different solutions before and after a change. For

real-world problems, we may expect solutions to remain similar

• Extra approaches are needed to enhance SI for DOPs

• Typical approaches: Memory: store and reuse useful information

Diversity: handle convergence directly

Multi-population: co-operate sub-populations

Hybridization: hybridize SI with local search or other metaheuristics

20

M. Mavrovouniotis, C. Li, and S. Yang. A survey of swarm intelligence for dynamic optimization: Algorithms and

applications. Swarm and Evolutionary Computation, 33: 1-17, April 2017

Page 21: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Memory-based Approaches

• Cyclic DOPs: optimal solutions repeatedly return locations

• Memory: to store history information for future use

• Challenges: What information to store?

When and how to retrieve memory?

How to update memory?

21

Page 22: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Diversity-based Approaches

• Diversity increase: introduce diversity after a change

Partially random restart, hyper-mutation, variable local search

22mutate/recombine

select

increase

diversity

Change

detected?

YES

NO

Page 23: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Diversity-based Approaches

• Diversity maintenance: maintain diversity throughout the

run (even if no change occurs) Random immigrants

23

mutate/recombine

select maintain diversity

Page 24: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Multi-population Approaches

• Idea: Use several cooperative populations Populations evolve independently in different areas of search space

Populations exclude each other to avoid overlap

When optimum moves, nearby population will take action

24

Page 25: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Hybridization Approaches

• Idea: Using hybridization technique to improve the perfor-

mance of SI for DOPs

• Hybridize SI with local search + diversity schemes, e.g.: P-ACO: Hybridize ACO with local search and random immigrants

Multi-strategy ensemble PSO (MEPSO): Hybridize PSO with

Gaussian local search + differential mutation

• Hybridize SI with other meta-heuristic methods PSO + Cellular Aotomata

PSO + Fuzzy C-means

25

• M. Mavrovouniotis, S. Yang, A memetic ant colony optimization algorithm for the dynamic travelling

salesman problem, Soft Comput. 15 (7) (2011) 1405–1425

• W. Du, B. Li, Multi-strategy ensemble particle swarm optimization for dynamic optimization, Inf. Sci. 178

(15) (2008) 3096–3109

• A. Hashemi, M. Meybodi, A multi-role cellular PSO for dynamic environments, in: 14th International

Computer Conference (CSICC 2009), 2009, pp. 412–417

• M. Kamosi, A. Hashemi, M. Meybodi, A hibernating multi-swarm optimization algorithm for dynamic

environments, in: 2010 2nd World Congress on Nature and Biologically Inspired Computing, 2010, 363–369

Page 26: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Remarks on Enhancing Approaches

• No clear winner among the approaches

• Memory is efficient for cyclic environments

• Multi-population is good for multimodal problems

Able to maintain diversity

The search ability will decrease if too many sub-populations

• Diversity schemes are usually useful

Guided immigrants may be more efficient

• Thumb of rule: balancing exploration & exploitation over

time

26

Page 27: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Case Study: Multi-swarm PSO for Continuous DOPs

• Recently, a framework of multi-population approaches Use single linkage hierarchical clustering to create populations

Each population will search one peak in the fitness landscape

An overcrowding scheme to remove unnecessary populations

A special rule to decide proper moments to increase diversity

without change detection

An adaptive method to create a proper number of populations

needed

27

• C. Li and S. Yang. A general framework of multi-population methods with clustering in undetectable dynamic

environments. IEEE Transactions on Evolutionary Computation, 16(4): 556-577, August 2012

• C. Li, S. Yang, and M. Yang. An adaptive multi-swarm optimizer for dynamic optimization problems.

Evolutionary Computation, 22(4): 559-594, Winter 2014

• C. Li, T. T. Nguyen, M. Yang, M. Mavrovouniotis, and S. Yang. An adaptive multi-population framework for

locating and tracking multiple optima. IEEE Transactions on Evolutionary Computation, 20(4):590-605, 2016

Page 28: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Demo: Multi-swarm PSO Using Clustering

28

Page 29: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Case Study: ACO for Combinatorial DOPs

29

Image source: https://en.wikipedia.org/wiki/Leicester_railway_station

• A train that arrives late at a station will miss its scheduled time slot and may have to be reallocated to a new platform

• Multiple trains may be delayed in suc-cession, each new delay changes the problem

• Dynamic Railway Platform Reallocation Problem (DRPRP) reallocates multiple successive delayed trains to new timeslots on railway platforms to mini-mise the ongoing delay in the system • We considered Leicester station

• A busy UK railway station with 4

bi-directional platforms and trains

arriving from 4 different directions

• We consider the effect of the

reallocation decisions not only at

the station but also on the

remainder of these trains’ journey

Leicester Station Track Layout

Page 30: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Modelling the Problem

30

• The model was created from Network

Rail’s train schedule data from Integrated

Train Planning System (ITPS)

• From this we extract details of the

movement of trains through the station

and the movement of all trains at each

timing point on each train’s route

• We consider timing points within 50 miles

of Leicester station (225 timing points)

Some

timing

points

in the

problem

An example of the schedule feed data

Page 31: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Leicester Station Simulation

31

Page 32: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Max-Min Ant System (MMAS)

32

Each node in the

graph represents a

train and the platform

to assign the train to

An ant starts on node 0

The ant chooses next

node probabilistically

Ant Solution:

<Train A on Platform 2>

The ant now chooses

the next train & platform

Ant Solution:

<Train A on Platform 2,

Train B on Platform 3>

• After all ants have

made a tour, all

pheromone trails

are evaporated

• Pheromone is laid

down on the best

ant’s tour

• In ACO ants communicate indirectly via pheromone trails

• We model the problem with a directed edge graph

• Ants choose next node based on pheromone trails and

problem-specific heuristics

Page 33: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Algorithm Design

33

After a Dynamic Change:

• More trains have arrived but some trains have passed through the station

• The graph is updated but pheromones are kept between changes to retain useful information from before change

Unnecessary Platform Reallocation:• MMAS has no mechanism to persuade it

against unnecessarily reallocating trains to

platforms. To resolve this we:

1. Add a heuristic based on the physical

distance between platforms

2. Introduce a best-so-far ant replacement

scheme that discourages unnecessary

reallocations of trains to new platforms

Image source: http://www.adelaidenow.com.au//

Page 34: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Comparison Algorithm

34

First Free Platform (FFP)• Discussions with a Network Rail Station Master established that a

technique often used to reallocate delayed trains to platforms is to

find the first free platform as close as possible to the original platform

• We compared our MMAS algorithm to a heuristic using this principle

Modelling Dynamism

The frequency of change f is the time interval between delayed trains.

The magnitude of change m is how much the train is delayed by

In this investigation trains were delayed by 10, 20 and 30 min with gaps

of 10, 20 and 30 min to give 9 different dynamic scenarios

J. Eaton and S. Yang. Railway platform reallocation after dynamic perturbations using ant colony optimisation.

Proceedings of the 2016 IEEE Symposium Series on Computational Intelligence, pp. 1-8, 2016

Page 35: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Experimental Results

35

Low frequency, high magnitude changes High frequency, high magnitude changes

Page 36: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

SI for Dynamic Multi-objective Optimization

• So far, mainly dynamic single-objective optimization

• Dynamic multi-objective optimization problems (DMOPs)

Even more challenging

• Recently, rising interest in studying SI for DMOPs Eaton et al. (2017) applied ACO for the dynamic multi-objective

railway junction rescheduling problem

36

J. Eaton, S. Yang, and M. Gongora. Ant colony optimization for simulated dynamic multi-objective railway

junction rescheduling. IEEE Transactions on Intelligent Transportation Systems, 18(11): 2980-2992, 2017

Page 37: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

SI for DOPs: Challenging Issues

• Detecting changes: Most studies assume that changes are easy to detect or visible to

an algorithm whenever occurred

In fact, changes are difficult to detect for many DOPs

• Understanding the characteristics of DOPs:

What characteristics make DOPs easy or difficult?

Little work, needs much more effort

• Analysing the behaviour of SI methods for DOPs:

Requiring more theoretical analysis tools

Big question: Which SI methods for what DOPs?

• Real world applications:

How to model real-world DOPs?

37

Page 38: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Future Work

• The domain has attracted a growing interest recently But, far from well-studied

• New approaches needed: esp. hybrid approaches

• Theoretical analysis: greatly needed

• SI for DMOPs: deserves much more effort

• Real world applications: also greatly needed Fields: logistics, transport, MANETs, data streams, social networks,

...

38

Page 39: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Summary

• SI for DOPs: important area The domain is still young and active

Many challenges to be taken

• More young researchers are greatly welcome!

39

Thanks!

Page 40: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Acknowledgements

• Two EPSRC funded projects on EC for DOPs “EAs for DOPs: Design, Analysis and Applications”

• Funding/Duration: over £600K/3.5 years (1/2008–7/2011)

• http://gtr.rcuk.ac.uk/project/B807434B-E9CA-41C7-B3AF-

567C38589BAC

“EC for Dynamic Optimisation in Network Environments”• Funding/Duration: ∼£1M/4.5 years (2/2013–8/2017)

• http://gtr.rcuk.ac.uk/project/C43F34D3-16F1-430B-9E1F-

483BBADCD8FA

• Research team members: Research Fellows: Dr. Hui Cheng, Dr. Crina Grosan, Dr. Changhe

Li, Dr. Michalis Mavrovouniotis, Dr. Yong Wang, etc.

PhD students: Changhe Li, Michalis Mavrovouniotis, Shouyong

Jiang, Jayne Eaton, etc.

• Research co-operators: Prof. Xin Yao, Prof. Juergen Branke, Dr. Renato Tinos, Dr. Hendrik

Richter, Dr. Trung Thanh Nguyen, Dr. Juan Zou, etc.

40

Page 41: Swarm Intelligence in Dynamic Environments Yang.pdf · ACO mimics the behaviour of ants searching for food • The idea: ants “walk” on the arcs of graph while “reading” and

Relevant Information

• IEEE CIS Task Force on EC in Dynamic and Uncertain

Environments http://www.tech.dmu.ac.uk/~syang/IEEE_ECIDUE.html

• Source codes: http://www.tech.dmu.ac.uk/~syang/publications.html

• Books and survey papers: Y. Jin, J. Branke. Evolutionary optimization in uncertain

environments–A survey. IEEE Trans Evol Comput, 9(3): 303–317,

2005

T. T. Nguyen, S. Yang, J. Branke. Evolutionary dynamic

optimization: A survey of the state of the art. Swarm and

Evolutionary Computation, 6: 1-24, 2012

S. Yang, X. Yao. Evolutionary Computation for Dynamic

Optimization Problems. Springer, 2013

41