statistical complexity analysis dr. dmitry nerukh giorgos karvounis

18
STATISTICAL COMPLEXITY STATISTICAL COMPLEXITY ANALYSIS ANALYSIS • Dr. Dmitry Nerukh • Giorgos Karvounis

Upload: rodger-clarke

Post on 05-Jan-2016

219 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

STATISTICAL COMPLEXITY STATISTICAL COMPLEXITY ANALYSISANALYSIS

• Dr. Dmitry Nerukh• Giorgos Karvounis

Page 2: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

What is Complexity?• Many different definitions.

• A natural system, is converted into a formal system that our mind can manipulate and we have a model.

• “Complexity is the property of a real world system that is manifest in the inability of any one formalism being adequate to capture all its properties…”

Page 3: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

WHY USE COMPLEXITY ANALYSIS (1)

• When a new state of matter emerges from a phase transition,

• …certain “pattern formation” takes on the this “newness” with respect to other structures .

• This process is defined as “intrinsic” emergence.

• There is an increase in intrinsic computational capability which can be capitalised and measured.

Page 4: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

WHY USE COMPLEXITY ANALYSIS (2)

• Contemporary physics can measure order (e.g. temperature) or ideal randomness (e.g. entropy, thermodynamics).

• No tools to address problems of innovation, or the discovery of patterns

• Measuring the computational capabilities of the system is the only way to address such questions:

• …discovering and quantifying emergence, pattern, information process and memory in quantitative units.

• The term intrinsic computation defines the way the system stores information with respect to time, transmits it between internal degrees of freedom and makes use of it in order to produce future behaviour.

Page 5: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

METHODOLOGY (1) • Complexity estimates how sophisticated are the

dynamical laws governing the time evolution of the system.

• We adopted the approach by Crutchfield et. al. termed “computational mechanics”.

• We implement ideas from both Shannon entropy and KC algorithmic complexity theories, measuring the size of the informational description of the process.

• This is a direct approach to reveal the symmetries possessed by any spatial patterns and to estimate the “minimum amount of memory required to reproduce any configuration ensemble”.

Page 6: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

……BUT WE ARE MODELLERS…BUT WE ARE MODELLERS…

…HOW DO WE DEAL WITH MODELS…??HOW DO WE DEAL WITH MODELS…??

Page 7: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

METHODOLOGY (2)

• We can reconstruct an algorithmic machine (termed as “ε-machine”) that provides the means to build the statistically minimal “optimally predictive model ” .

• In order to build this machine, we need the smallest possible set of predictive states, the “causal states”.

• We can state that two predictive states are equivalent (~) if and only if they give rise to same future values in terms of conditional probabilities:

)|Pr()|Pr(: 21 pastfuturepastfuture StateCausal

Page 8: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

COMPUTATIONAL IMPLEMENTATION(1) • The algorithm is based on the use of symbolic dynamics generated

from symbols assigned to discrete time steps.

• The crucial part in the implementation of the methodology is converting a continuous real signal into a sequence of symbols i.e “ signal symbolization” of the molecular trajectory. The one –dimensional case is shown below:

s0

s1

s1

s1

s0

s2

s1

t4

t3

t2

t0

sig

na

l

time

t1

s0

s2

s2

s2

s2

s2

s2

s2

s2

s0

s0

s0

s0

s0

Page 9: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

CAUSAL STATE

Consider the following sequence:

bla.bla.bla.lab.lba.bla.bla.lab.bal.bla.alb.alb.bla.bla…

Bla . BlaPASTPAST FUTUREFUTURE

Alb . BlaPASTPAST FUTUREFUTURE

BlaAlb Bla CAUSALCAUSAL

statestate SScausalcausal

Page 10: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

E-machine

• An -machine, the set of causal states and the probabilities of the transitions between them, provides a direct description of the patterns present in the system’s internal degrees of freedom.

E x am ple o f -m ach ine reconstruction : A and B a re tw o cau sa l sta tes o f the system .

N um bers on the a rrow s show s the transition p robab ility b etw een th e sta tes.

Page 11: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

FINITE STATISTICAL COMPLEXITY

• Finite Statistical Complexity can be defined as “the minimum amount of evolutionary information (or hidden memory) required to statistically reproduce a process.”

•It expresses the informational size of the distribution of the causal states as measured by the Shannon Entropy:

).Pr(log)Pr( aSSC 2

•Statistical Complexity is based on the assumption that randomness is statistically simple: an ideal random process has zero statistical complexity. Equally, simple periodic processes give low complexity values as well.

•Complex process is the one that lies between these extremes and is an amalgam of predictable and stochastic mechanisms.

Page 12: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

STATISTICAL COMPLEXITY OF A

ZWITTERION: a folding event

• We measured the statistical complexity of the dynamical trajectories of four significant atoms within a zwitterion.

• Attain insights regarding complexity and how this can be a useful tool to characterise or capture the folding event.

• Depending on the temperature of the simulation, the zwitterion adopts a stable folded conformation.

• Statistical Complexity Analysis of various atoms’ trajectories at the unfolded configuration and compare their values at the folded state.

Page 13: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

COMPLEXITY ANALYSIS OF THE EXTENDED STATE

0 2000 4000 6000 8000 10000 12000

2

3

4

5

6

7

8

9

time (fs)

Dista

nce

(nm

)

Distance between N+andO-

UNFOLDED STATE

0 2000 4000 6000 8000 10000 120006.0

6.5

7.0

7.5

8.0

8.5

9.0

Com

plex

ity

time (fs)

N+

Cb/bone

O-

1

4000KUNFOLDEDSTATE

At the extended state, there is no significant change on the complexity value, as the zwitterion remains as an extended chain, following basically the same pattern throughout the process.

Page 14: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

0 2000 4000 6000 8000 10000 120006.0

6.5

7.0

7.5

8.0

8.5

9.0Com

plex

ity

time (fs)

N+

O-

Cb/bone

4600K

FOLDED STATE

0 2000 4000 6000 8000 10000 12000

2

3

4

5

6

7

8

9Distance between N+and O-

Dista

nce

(nm)

time (fs)

FOLDED STATE

COMPLEXITY ANALYSIS OF THE FOLDED STATE

•In the folding event, there is a considerable drop in the complexity value, assigned to the transitional stage.

• Afterwards, there is a sudden rise in the complexity, until all atoms reach the same value, assigned to the pattern of the folded state.

Page 15: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

COMPLEXITY ANALYSIS(1)

-4

-2

0

2

-2

0

2

4

2

4

Z

YX

Page 16: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

COMPLEXITY ANALYSIS (2)

• The essentiality of complexity measurements is that we can distinguish those patterns in quantitative terms.

• Better insight to the mechanisms that underlie the formation of this structure and separate the more “ordered” regularities to those that are more “random”:

Ra nd o m ne ss 10

Sta

t. C

om

ple

xity

T RA N S I T I O N A LS T A GE

FO LDEDSTAG E

Page 17: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

FUTURE WORK

• Further development of the algorithm in order to achieve a better representation of the -machine.

• Apply Statistical Complexity Analysis to a larger system such as protein folding and polymers’ phase transitions.

Page 18: STATISTICAL COMPLEXITY ANALYSIS Dr. Dmitry Nerukh Giorgos Karvounis

ACKNOWLEDGMENTS

For this work we are grateful to:

• Prof. R. Glen

• The Newton Trust and UNILEVER for their financial support.