biology inspired techniques for self organization in ... · project funded by the future and...

71
Project funded by the Future and Emerging Technologies arm of the IST Programme Biology Inspired techniques for Self Organization in dynamic Networks IST-2001-38923 Ozalp Babaoglu Dipartimento di Scienze dell’Informazione Università di Bologna

Upload: vankiet

Post on 27-Aug-2018

214 views

Category:

Documents


0 download

TRANSCRIPT

Project funded by the Future and Emerging Technologies arm of the IST Programme

Biology Inspired techniques for Self Organization in

dynamic NetworksIST-2001-38923

Ozalp Babaoglu

Dipartimento di Scienze dell’InformazioneUniversità di Bologna

2© Babaoglu 2004

Motivation

Current networked information systems are fragile (not robust), rigid (not adaptive) and are notoriously difficult to configure and maintain

Many natural (biological, social) systems are exactly the opposite — they are robust, adaptive and self-organizing despite being highly decentralized

Can we build information systems that are more “organic” or “life-like”?

Do this by drawing inspiration from biology

3© Babaoglu 2004

Dynamic networks

The problem is further aggravated in modern network structures Mobile ad-hoc networks (MANET) Overlay Networks

• Peer-to-Peer systems• Grid computing

Due to their extreme size and extreme dynamism

4© Babaoglu 2004

Project BISON

Funded by IST-FET under FP5 Partners

University of Bologna, Italy (Coordinator) Telenor Communication AS, Norway Technical University of Dresden, Germany IDSIA, Lugano, Switzerland

1 January 2003 start date, duration 36 months Total cost €2,251,594 EU funding €1,128,000 URL: http://www.cs.unibo.it/bison

5© Babaoglu 2004

BISON objectives

Complex adaptive system CAS are collections (swarm) of “agents”, acting in a decentralized and distributed fashion found in nature and biological processes social structures economies, financial markets

Behavior of CAS is often self-organizing, adaptive and robust (“nice properties”)

We want to implement a number of functions on a variety of network structures using ideas from CAS

Note that we are not interested in modelling or developing theories for explaining particular CAS

6© Babaoglu 2004

BISON expected results

Decentralized, self-organizing, adaptive and robust solutions to important technological problems that arise in dynamic networks

Systematic framework and a coherent set of heuristics to guide the synthesis of complex systems that solve interesting technological problems

7© Babaoglu 2004

BISON biological inspirations

Social insects, ants Amoebae Chemotaxis Immune system Epidemics (gossip) Aggregation Neurons Regeneration

8© Babaoglu 2004

BISON biological inspirations

Social insects, ants Amoebae Chemotaxis Immune system Epidemics (gossip) Aggregation Neurons Regeneration

9© Babaoglu 2004

BISON functions, services

Routing (MANET) Power management (MANET) Load balancing Searching Collective computation Monitoring Topology management

10© Babaoglu 2004

BISON functions, services

Routing (MANET) Power management (MANET) Load balancing Searching Collective computation Monitoring Topology management

11© Babaoglu 2004

Swarm intelligence

The set of local agents that are equals (peers) forms the “swarm”

The agents interact (locally) Each individual agent has very limited intelligence (ie,

simple rules) But the swarm has a collective intelligence that can handle

difficult challenges The intelligent behavior that the swarm exhibits (built from

simple agents following simple rules) is called “emergence”

12© Babaoglu 2004

Emergence

Emergence is all around us a city car traffic the brain the immune system an ant colony

Emergent behavior is collective behavior arising from the interaction of many autonomous units, where the units obey simple rules, and yet it is:

complex and interesting (maybe even adaptive) difficult to predict from knowledge of the agents’ rules

13© Babaoglu 2004

Back to BISON

BISON applies these ideas to large-scale, dynamic networks of computers, PDAs, phones, etc. to solve important problems such as efficient routing of traffic; load balancing; search over distributed content; distributed computation

14© Babaoglu 2004

What is a MANET

A network in which all communication is wireless: One shared channel Unreliable transmissions Low bandwidth

All nodes are mobile: Nodes can enter and leave the network at any time The topology changes constantly

There is no fixed infrastructure: All nodes act as terminals and routers There is no central control or overview

15© Babaoglu 2004

MANET example

16© Babaoglu 2004

MANET routing

17© Babaoglu 2004

MANET routing

18© Babaoglu 2004

MANET routing

How to route traffic between nodes so that all those that can communicate do so and with (almost) minimal latency

Despite the various sources of dynamism Changing traffic patterns Unreliable communication Arrival-departure of nodes Mobility of nodes Changing radio coverage (battery drain)

19© Babaoglu 2004

Biological inspiration: Ant Colonies

Foraging ant colonies can find shortest paths in a synergistic way in distributed and dynamic environments: While moving back and forth between nest and food, ants

mark their path by secreting pheromone Step-by-step routing decisions are biased by the local

intensity of pheromone field Pheromone is the colony’s collective and distributed

memory — it encodes the collectively learned quality of local routing choices toward destination target

20© Babaoglu 2004

Swarm routing with ants

Desired emergent behavior: identification of good paths Simple rules:

explore the net, seeking a destination lay a trace (pheromone) reflecting the quality of the path

taken traffic to be routed follows the traces with the strongest

pheromone

21© Babaoglu 2004

How ants find food

First, they explore at random

22© Babaoglu 2004

How ants find food

Individual ants mark their path by emitting a chemical substance — a pheromone — as they forage for food

23© Babaoglu 2004

How ants find food

Ants smell pheromone and they tend to choose path with strong pheromone concentration

24© Babaoglu 2004

How ants find food

Other ants use the pheromone to find the food source

25© Babaoglu 2004

How ants find food

When the “system” is interrupted, ants are able to adapt by rapidly adopting second best solutions

26© Babaoglu 2004

How ants find food

Social insects, following simple, individual rules, accomplish complex colony activities that are adaptive, robust and self-organizing

27© Babaoglu 2004

From ants to agents

Reverse-engineering of ant colony mechanisms leads to “Ant Colony Optimization” Combinatorial optimization Adaptive routing (AntNet)

Multiple autonomous/concurrent agents (ants) Solution constructed as a sequential decision process Stochastic decision policy depending on pheromone Use of solution outcomes to iteratively update pheromone

28© Babaoglu 2004

AntHocNet

Ant-based datagram routing: Data are routed stochastically as datagrams according to

pheromone tables Pheromone entry: estimated quality of next hop choice for a

destination Ant agents sample paths and update pheromone tables

• Reactive route setup• Proactive maintenance and updating

For each source-destination, a bundle of datagram paths is available (backup, multi-path spreading)

29© Babaoglu 2004

Arena long edge

Delivery ratio

AntHocNet – Performance

Number of nodes

30© Babaoglu 2004

Views, overlay networks

The set of nodes that a peer knows about is called its view Typically, views are a (very) small subset of all nodes Views are typically dynamic since the set of nodes and the

“knows” relation are highly dynamic (churn) Views define an overlay network with dynamic topology on

top of the physical network

31© Babaoglu 2004

Overlay networks

Physical network“who has a communication link to whom”

32© Babaoglu 2004

Overlay networks

Logical network“who can communicate with whom”

33© Babaoglu 2004

Overlay networks

Overlay network (ring)“who knows whom”

34© Babaoglu 2004

Overlay networks

Overlay network (binary tree)“who knows whom”

35© Babaoglu 2004

Functions over overlay networks

Load balancing Distribute “load” over the nodes as evenly as possible

Content searching Identify nodes that hold copies of certain “content”

Collective computation Compute arbitrary functions over local values

Topology management Build and maintain overlay networks with desired topologies

Despite extreme size and dynamism (churn)

36© Babaoglu 2004

Load balancingBiological inspiration: Dictyostelium

Dictyostelium (slime mold) typically grows as separate, independent cells but interact to form a coordinated multicellular structure when induced by starvation

Physical aggregation transforms multiple, independent cells into a single, multicellular organism

The underlying mechanism is called chemotaxis: motion (taxis) along increasing gradients guided by diffusing chemical signals

37© Babaoglu 2004

Fruiting body

Grazing single cells

Life cycle of Dictyostelium

PhysicalAggregation

38© Babaoglu 2004

Life cycle of Dictyostelium

39© Babaoglu 2004

Load balancing

Inverted chemotaxis that provokes motion along decreasing gradients and thus “anti-aggregate” — load balance

Instance of a more general technique employing two mechanisms operating at different time scales: rapidly diffusing “signal” (chemical substance, information) slowly diffusing “matter” (cells, load)

40© Babaoglu 2004

Immune recognition is based on the complementary structure of the antibody (B-cell) receptors and a portion of the antigen called the epitope

Antigen attack Immune recognition antibody activation, proliferation antibody mutation (diversification)

Immune system basics

EpitopesB-cell receptors

Antigen

41© Babaoglu 2004

ImmuneSearch

Content (file) being searched for: “antigen” Queries: mobile “antibodies” Pattern recognition: affinity measure, similarity between

content and query Search mechanisms:

Proliferation: antibodies replicate themselves when they find a good match

Mutation: antibodies undergo random transformations Clustering: matching content is moved closer to the source

of the query by rewiring the overlay network

42© Babaoglu 2004

Performance of ImmuneSearch

Performance: (proliferation+clustering) > proliferation > random walk > flooding

43© Babaoglu 2004

Collective computation (aggregation)

Each node has a (numeric) local state Compute (global) aggregate function over the initial values

at all nodes The aggregate value to be known (locally) at each node Examples of aggregate functions:

Average Min-max Geometric mean Variance Network size

44© Babaoglu 2004

Biological inspiration:Viruses, epidemic spreading

Each node periodically selects another (random) peer and exchanges local state information

Each node updates its local state based on the information exchanged

System fully symmetric — all nodes act identically Communication is symmetric — “push-pull” epidemic Proactive Many uses in distributed systems

45© Babaoglu 2004

Aggregation through epidemics

Local value Sp contains current estimate of the aggregate Suppose the (random) peer picked by node p is q Nodes p and q exchange current estimates Update local estimates

average: Sp ← geometric mean: Sp ← maximum: Sp ← max(Sp, Sq)

Other, more complex functions built by combining elementary functions

(Sp+Sq)2 √

(SpSq)

46© Babaoglu 2004

Aggregation example: averaging

16

10

4

36

2

8

47© Babaoglu 2004

Aggregation example: averaging

16

6

4

36

6

8

48© Babaoglu 2004

Aggregation example: averaging

10

6

10

36

6

8

49© Babaoglu 2004

Properties of epidemic-based aggregation

In epidemic-based averaging, if the selected peer is a globally random sample, then the variance of the set of estimates decreases exponentially

Extreme robustness to node and link failure and node dynamism (churn)

50© Babaoglu 2004

Exponential convergence of averaging

0.0001

0.001

0.01

0.1

1

10

100

1000

10000

100000

0 5 10 15 20 25 30

Est

imat

ed A

vera

ge

Cycles

ExperimentsMaximumMinimum

0.990.995

11.005

1.011.015

1.021.025

22 24 26 28 30

51© Babaoglu 2004

Network size estimation

How to count the number of peers in an overlay network Tens of million potential peers Continual flux (churn) Not allowed to “freeze” system

Similar to conducting a census in a country the size of Italy without requiring people to stay at home on a given day

52© Babaoglu 2004

Network size estimation using averaging

0

1

0

0

0

0

53© Babaoglu 2004

Network size estimation using averaging

0

0.5

0

0

0

0.5

54© Babaoglu 2004

Network size estimation using averaging

0.25

0

0.25

0.25

0

0.25

55© Babaoglu 2004

Network size estimation using averaging

0.125

0.25

0.125

0.125

0.125

0.25

56© Babaoglu 2004

Network size estimation using averaging

0.166

0.166

0.166

0.166

0.166

0.166

1/0.166 = 6.02 ≈ N

57© Babaoglu 2004

Network size estimation with node crashes

50000

100000

150000

200000

250000

300000

350000

400000

450000

0 5 10 15 20

Est

imat

ed S

ize

Cycle

Experiments

50% of the nodes crash at a given cycle

58© Babaoglu 2004

Network size estimation under churn

80000

100000

120000

140000

160000

180000

200000

220000

240000

260000

0 500 1000 1500 2000 2500

Est

imat

ed S

ize

Nodes Substituted per Cycle

Experiments

59© Babaoglu 2004

Network size estimation with multiple aggregation instances

90000

95000

100000

105000

110000

115000

120000

125000

130000

0 5 10 15 20 25 30 35 40 45 50

Est

imat

ed S

ize

Number of Aggregation Instances

Experiments (Max)Experiments (Min)

1000 nodes crash at the beginning of each cycle

60© Babaoglu 2004

Network size estimation with multiple aggregation instances

0

50000

100000

150000

200000

250000

300000

0 5 10 15 20 25 30 35 40 45 50

Est

imat

ed S

ize

Number of Aggregation Instances

Experiments (Max)Experiments (Min)

20% of messages are lost

61© Babaoglu 2004

Topology management

How to ensure that the overlay network topology satisfies certain properties: has a desired structure (connected, random graph, ring,

torus, binary tree, etc.) maintains the desired structure in a dynamic setting (churn)

Problem to be solved is topology management Solution based on epidemics

62© Babaoglu 2004

Topology management: example

BDE

SX

W

A E

63© Babaoglu 2004

Topology management: example

BDE

SX

W

A E

selectPeer()

64© Babaoglu 2004

Topology management: example

A E

Exchange views

65© Babaoglu 2004

Topology management: example

A E

Both peers apply updateState thereby redefining topology

66© Babaoglu 2004

Epidemic-inspired topology management

Newscast Unstructured, almost-random topologies

T-man Wide range of structured topologies including small and

large diameter, clustered, sorted, etc. Both protocols

Extremely robust to node and link failure and node dynamism (churn)

Scalable

67© Babaoglu 2004

T-Man example: after 3 cycles

68© Babaoglu 2004

T-Man example: after 5 cycles

69© Babaoglu 2004

T-Man example: after 8 cycles

70© Babaoglu 2004

T-Man example: after 15 cycles

71© Babaoglu 2004

Summary

Biology is a rich source of inspiration for developing solutions with “nice properties” to technological problems

To date, we have looked at five biological systems with interesting behavior: Ants: path finding using pheromone, gathering Slime mold amoebae: physical aggregation as a response

to collective hunger, using chemotaxis Immune cells: search, recognition, and response to

antigens Viruses: epidemic spreading, collective computation