thumbnails · 2017-11-04 · bpres. chapter 2 leads the reader in to the world of bpres. it...

30

Upload: others

Post on 02-Jun-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and
Cover
File Attachment
Thumbnailsjpg

Discrete Time Branching Processes in Random Environment

Branching Processes Branching Random Walks and Branching Particle Fields Set

coordinated by Elena Yarovaya

Volume 1

Discrete Time Branching Processes in Random

Environment

Goumltz Kersting Vladimir Vatutin

First published 2017 in Great Britain and the United States by ISTE Ltd and John Wiley amp Sons Inc

Apart from any fair dealing for the purposes of research or private study or criticism or review as permitted under the Copyright Designs and Patents Act 1988 this publication may only be reproduced stored or transmitted in any form or by any means with the prior permission in writing of the publishers or in the case of reprographic reproduction in accordance with the terms and licenses issued by the CLA Enquiries concerning reproduction outside these terms should be sent to the publishers at the undermentioned address

ISTE Ltd John Wiley amp Sons Inc 27-37 St Georgersquos Road 111 River Street London SW19 4EU Hoboken NJ 07030 UK USA

wwwistecouk wwwwileycom

copy ISTE Ltd 2017 The rights of Goumltz Kersting and Vladimir Vatutin to be identified as the authors of this work have been asserted by them in accordance with the Copyright Designs and Patents Act 1988

Library of Congress Control Number 2017949423 British Library Cataloguing-in-Publication Data A CIP record for this book is available from the British Library ISBN 978-1-78630-252-6

Contents

Preface ix

List of Notations xiii

Chapter 1 Branching Processes in Varying Environment 1

11 Introduction 1

12 Extinction probabilities 6

13 Almost sure convergence 12

14 Family trees 15

141 Construction of the Geiger tree 20

142 Construction of the size-biased tree T lowast 21

15 Notes 22

Chapter 2 Branching Processes in Random Environment 25

21 Introduction 25

22 Extinction probabilities 28

23 Exponential growth in the supercritical case 30

24 Three subcritical regimes 35

25 The strictly critical case 43

26 Notes 45

Chapter 3 Large Deviations for BPREs 47

31 Introduction 47

32 A tail estimate for branching processes in a

varying environment 50

33 Proof of Theorem 31 55

34 Notes 59

vi Discrete Time Branching Processes in Random Environment

Chapter 4 Properties of Random Walks 61

41 Introduction 61

42 Sparre-Andersen identities 62

43 Spitzer identity 69

44 Applications of Sparre-Andersen and Spitzer

identities 70

441 Properties of ladder epochs and ladder heights 70

442 Tail distributions of ladder epochs 75

443 Some renewal functions 76

444 Asymptotic properties of Ln and Mn 79

445 Arcsine law 83

446 Large deviations for random walks 84

45 Notes 90

Chapter 5 Critical BPREs the Annealed Approach 91

51 Introduction 91

52 Changes of measures 92

53 Properties of the prospective minima 99

54 Survival probability 107

55 Limit theorems for the critical case (annealed

approach) 116

56 Environment providing survival 122

57 Convergence of logZn 126

58 Notes 127

Chapter 6 Critical BPREs the Quenched Approach 133

61 Introduction 133

62 Changes of measures 134

63 Probability of survival 138

64 Yaglom limit theorems 142

641 The population size at non-random moments 143

642 The population size at moments nt 0 lt t lt 1 149

643 The number of particles at moment τ(n) le nt 150

644 The number of particles at moment τ(n) gt nt 157

65 Discrete limit distributions 159

66 Notes 167

Chapter 7 Weakly Subcritical BPREs 169

71 Introduction 169

72 The probability measures P+ and Pminus 172

73 Proof of theorems 180

731 Proof of Theorem 71 188

Contents vii

732 Proof of Theorem 72 190

733 Proof of Theorem 73 191

74 Notes 195

Chapter 8 Intermediate Subcritical BPREs 197

81 Introduction 197

82 Proof of Theorems 81 to 83 201

83 Further limit results 209

84 Conditioned family trees 212

85 Proof of Theorem 84 219

86 Notes 225

Chapter 9 Strongly Subcritical BPREs 227

91 Introduction 227

92 Survival probability and Yaglom-type limit theorems 229

93 Environments providing survival and dynamics of the

population size 234

931 Properties of the transition matrix P lowast 240

932 Proof of Theorem 92 243

933 Proof of Theorem 93 248

94 Notes 251

Chapter 10 Multi-type BPREs 257

101 Introduction 257

102 Supercritical MBPREs 264

103 The survival probability of subcritical and

critical MBPREs 264

104 Functional limit theorem in the critical case 266

105 Subcritical multi-type case 266

Appendix 269

Bibliography 275

Index 285

Preface

Branching processes constitute a fundamental part in the theory of stochastic

processes Very roughly speaking the theory of branching processes deals with the

issue of exponential growth or decay of random sequences or processes Its central

concept consists of a system or a population made up of particles or individuals

which independently produce descendants This is an extensive topic dating back to a

publication of F Galton and H W Watson in 1874 on the extinction of family names

and afterwards dividing into many subareas Correspondingly it is treated in a

number of monographs starting in 1963 with T E Harrisrsquo seminal The Theory ofBranching Processes and supported in the middle of the 1970s by Sevastyanovrsquos

Verzweigungsprozesse Athreya and Neyrsquos Branching Processes Jagersrsquo BranchingProcesses with Biological Applications and others The models considered in these

books mainly concern branching processes evolving in a constant environment

Important tools in proving limit theorems for such processes are generating

functions renewal type equations and functional limit theorems

However these monographs rarely touch the matter of branching processes in a

random environment (BPREs) These objects form not so much a subclass but rather

an extension of the area of branching processes In such models two types of

stochasticity are incorporated on the one hand demographic stochasticity resulting

from the reproduction of individuals and on the other hand environmental

stochasticity stemming from the changes in the conditions of reproduction along

time A central insight is that it is often the latter component that primarily

determines the behavior of these processes Thus the theory of BPREs gains its own

characteristic appearance and novel aspects appear such as a phase transition From a

technical point of view the study of such processes requires an extension of the range

of methods to be used in comparison with the methods common in the classical

theory of branching processes Other techniques should be attracted in particular

from the theory of random walks which plays an essential role in proving limit

theorems for BPREs

x Discrete Time Branching Processes in Random Environment

With this volume we have two purposes in mind First we have to assert that the

basics of the theory of BPREs are somewhat scattered in the literature (since the late

1960s) from which they are not at all easily accessible Thus we start by presenting

them in a unified manner In order to simplify matters we confine ourselves from

the beginning to the case where the environment varies in an iid fashion the model

going back to the now classical paper written by Smith and Wilkinson in 1969 We

also put together that material which is required from the topic of branching processes

in a varying environment Overall the proofs are now substantially simplified and

streamlined but at the same time some of the theorems could be better shaped

Second we would like to advance some scientific work on branching processes in

a random environment conditioned on survival which was conducted since around

2000 by a German-Russian group of scientists consisting of Valery Afanasyev

Christian Boumlinghoff Elena Dyakonova Jochen Geiger Goumltz Kersting Vladimir

Vatutin and Vitali Wachtel This research was generously supported by the German

Research Association DFG and the Russian Foundation for Basic Research RFBR In

this book we again do not aim to present our results in their most general setting yet

an ample amount of technical context cannot be avoided

We start the book by describing in Chapter 1 some properties of branching

processes in a varying environment (BPVEs) In particular we give a (short) proof of

the theorem describing the necessary and sufficient conditions for the dichotomy in

the asymptotic behavior of a BPVE such a process should either die or its population

size should tend to infinity with time Besides the construction of family trees

size-biased trees and Geigerrsquos tree representing conditioned family trees are

described here in detail These trees play an important role in studying subcritical

BPREs Chapter 2 leads the reader in to the world of BPREs It contains

classification of BPREs describes some properties of supercritical BPREs and gives

rough estimates for the growth rate of the survival probability for subcritical BPREs

Conclusions of Chapter 2 are supported by Chapter 3 where the asymptotic behavior

of the probabilities of large deviations for all types of BPREs is investigated

Properties of BPREs are closely related to the properties of the so-called associated

random walk (ARW) constituted by the logarithms of the expected population sizes

of particles of different generations This justifies the appearance of Chapter 4 that

includes some basic results on the theory of random walks and a couple of findings

concerning properties of random walks conditioned to stay non-negative or negative

and probabilities of large deviations for different types of random walks

Chapters 5 through 9 deal with various statements describing the asymptotic

behavior of the survival probability and Yaglom-type functional conditional limit

theorems for the critical and subcritical BPREs and analyzing properties of the ARW

Preface xi

providing survival of a BPRE for a long time Here the theory of random walks

conditioned to stay non-negative or negative demonstrates its beauty and power

Thus it is shown in Chapter 5 (under the annealed approach) that if a critical

BPRE survives up to a distant moment n then the minimum value of the ARW on

the interval [0 n] is attained at the beginning of the evolution of the BPRE and the

longtime behavior of the population size of such BPREs (conditioned on survival)

resembles the behavior of the ordinary supercritical GaltonndashWatson branching

processes If however a critical BPRE is considered under the quenched approach

(Chapter 6) then given the survival of the process for a long time the evolution of

the population size in the past has an oscillating character the periods when the

population size was very big were separated by intervals when the size of the

population was small

Chapters 7ndash9 are devoted to the weakly intermediately and strongly subcritical

BPREs investigated under the annealed approach To study properties of such

processes it is necessary to make changes in the initial measures based on the

properties of the ARWs The basic conclusion of Chapters 7ndash8 is the survival

probability of the weakly and intermediately subcritical BPREs up to a distant

moment n is proportional to the probability for the corresponding ARW to stay

non-negative within the time interval [0 n] Finally it is shown in Chapter 9 that

properties of strongly subcritical BPREs are in many respect similar to the

properties of the subcritical GaltonndashWatson branching processes In particular the

survival probability of such a process up to a distant moment n is proportional to the

expected number of particles in the process at this moment

We do not pretend that this book includes all interesting and important results

established up to now for BPREs In particular we do not treat here BPREs with

immigration and multitype BPREs The last direction of the theory of BPREs is a

very promising field of investigation that requires study of properties of Markov chains

generated by products of random matrices To attract the attention of future researchers

to this field we give a short survey of some recent results for multitype BPREs in

Chapter 10 The book is concluded by an Appendix that contains statements of results

used in the proofs of some theorems but not fitting the main line of the monograph

Goumltz KERSTING

Vladimir VATUTIN

August 2017

List of Notations

P(N0) set of all probability measures f on N0 =0 1 2

f [z] f(s) weights and generating function of the measuref 2

f the mean of the measure f 2f the normalized second factorial moment 2flowast size-biased measure 21fmn = fm+1 middot middot middot fn convolutions of probability measures 3fmn = fm middot middot middot fn+1 convolutions of probability measures 107κ(a f) =(f )minus2

suminfiny=a y

2f [y]truncated second moment 102

θ the moment of extinction of a branching process6 28

T lowast size-biased tree 21V random environment 25X = logF logarithm of expected population size 27κ(λ) = logE[eλX ] cumulant generating function 36γ strict descending ladder epoch 61γprime weak descending ladder epoch 61Γ strict ascending ladder epoch 61Γprime weak ascending ladder epoch 62Ln = min(S0 S1 Sn) 38Mn = max(S1 Sn) 76τ(n) = min0 le k le n Sk = Ln

the left-most moment when the minimum value ofthe random walk on the interval [0 n] is attained70

P probability measure given the environment 3 26E expectation given the environment 3 26

xiv Discrete Time Branching Processes in Random Environment

P probability measure obtained by averaging withrespect to the environment 25

E expectation taken after averaging with respect tothe environment 25

P+ Pminus change of measure 93

Pplusmn change of measure 134

P change of measure 170 198P+ Pminus change of measure 172

Plowast change of measure 234

1

Branching Processes inVarying Environment

11 Introduction

Branching processes are a fundamental object in probability theory They serve as

models for the reproduction of particles or individuals within a collective or a

population Here we act on the assumption that the population evolves within clearly

distinguishable generations which allows us to examine the population at the

founding generation n = 0 and the subsequent generations n = 1 2 To begin

with we focus on the sequence of population sizes Zn at generation n n ge 0 Later

we shall study whole family trees

Various kinds of randomness can be incorporated into such branching models

For this monograph we have two such types in mind On the one hand we take

randomness in reproduction into account Here a main assumption is that different

individuals give birth independently and that their offspring distributions coincide

within each generation On the other hand we consider environmental stochasticity

This means that these offspring distributions may change at random from one

generation to the next A fundamental question concerns which one of the two

random components will dominate and determine primarily the modelrsquos long-term

behavior We shall get to know the considerable influence of environmental

fluctuations

This first chapter is of a preliminary nature Here we look at branching models

with reduced randomness We allow that the offspring distributions vary among the

generations but as a start in a deterministic fashion So to speak we consider the above

model conditioned by its environment

Discrete Time Branching Processes in Random Environment First EditionGoumltz Kersting and Vladimir Vatutincopy ISTE Ltd 2017 Published by ISTE Ltd and John Wiley amp Sons Inc

2 Discrete Time Branching Processes in Random Environment

We begin with introducing some notation Let P(N0) be the space of all probability

measures on the natural numbers N0 = 0 1 2 For f isin P(N0) we denote its

weights by f [z] z = 0 1 We also define

f(s) =infinsumz=0

f [z]sz 0 le s le 1

The resulting function on the interval [0 1] is the generating function of the

measure f Thus we take the liberty here to denote the measure and its generating

function by one and the same symbol f This is not just as probability measures and

generating functions uniquely determine each other but operations on probability

measures are often most conveniently expressed by means of their generating

functions Therefore for two probability measures f1 and f2 the expressions f1f2 or

f1 f2 do not only stand for the product or composition of their generating functions

but also stand for the respective operations with the associated probability measures

(in the first case it is the convolution of f1 and f2) Similarly the derivative f prime of the

function f may be considered as well as the measure with weights

f prime[z] = (z + 1)f [z + 1] (which in general is no longer a probability measure) This

slight abuse of notation will cause no confusions but on the contrary will facilitate

presentation Recall that the mean and the normalized second factorial moment

f =infinsumz=1

zf [z] and f =1

f2

infinsumz=2

z(z minus 1)f [z]

can be obtained from the generating functions as

f = f prime(1) f =f primeprime(1)f prime(1)2

NOTEndash Any operation we shall apply to probability measures (and more generally to

finite measures) on N0 has to be understood as an operation applied to their generating

functions

We are now ready for first notions Let (ΩF P) be the underlying probability

space

DEFINITION 11ndash A sequence v = (f1 f2 ) of probability measures on N0 is

called a varying environment

DEFINITION 12ndash Let v = (fn n ge 1) be a varying environment Then a stochastic

process Z = Zn n isin N0 with values in N0 is called a branching process withenvironment v if for any integers z ge 0 n ge 1

P(Zn = z | Z0 Znminus1) = (fZnminus1n )[z] P-as

Branching Processes in Varying Environment 3

On the right-hand side we have the Znminus1th power of fn In particular Zn = 0P-as on the event that Znminus1 = 0 If we want to emphasize that probabilities P(middot) are

determined on the basis of the varying environment v we use the notation Pv(middot)

In probabilistic terms the definition says for n ge 1 that given Z0 Znminus1 the

random variable Zn may be realized as the sum of iid random variables Yin

i = 1 Znminus1 with distribution fn

Zn =

Znminus1sumi=1

Yin

This corresponds to the following conception of the process Z Zn is the number

of individuals of some population in generation n where all individuals reproduce

independently of each other and of Z0 and where fn is the distribution of the number

Yn of offspring of an individual in generation nminus 1 The distribution of Z0 which is

the initial distribution of the population may be arbitrary Mostly we choose it to be

Z0 = 1

EXAMPLE 11ndash A branching process with the constant environment f = f1 = f2 =middot middot middot is called a GaltonndashWatson process with offspring distribution f

The distribution of Zn is conveniently expressed via composing generating

functions For probability measures f1 fn on N0 and for natural numbers

0 le m lt n we introduce the probability measures

fmn = fm+1 middot middot middot fn [11]

Moreover let fnn be the Dirac measure δ1

PROPOSITION 11ndash Let Z be a branching process with initial size Z0 = 1 as andvarying environment (fn n ge 1) Then for n ge 0 the distribution of Zn is equal tothe measure f0n

PROOFndash Induction on n

Usually it is not straightforward to evaluate f0n explicitly The following example

contains an exceptional case of particular interest

EXAMPLE 12ndash LINEAR FRACTIONAL DISTRIBUTIONS A probability measure fon N0 is said to be of the linear fractional type if there are real numbers p a with

0 lt p lt 1 and 0 le a le 1 such that

f [z] = apqzminus1 for z = 0

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 2: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

Discrete Time Branching Processes in Random Environment

Branching Processes Branching Random Walks and Branching Particle Fields Set

coordinated by Elena Yarovaya

Volume 1

Discrete Time Branching Processes in Random

Environment

Goumltz Kersting Vladimir Vatutin

First published 2017 in Great Britain and the United States by ISTE Ltd and John Wiley amp Sons Inc

Apart from any fair dealing for the purposes of research or private study or criticism or review as permitted under the Copyright Designs and Patents Act 1988 this publication may only be reproduced stored or transmitted in any form or by any means with the prior permission in writing of the publishers or in the case of reprographic reproduction in accordance with the terms and licenses issued by the CLA Enquiries concerning reproduction outside these terms should be sent to the publishers at the undermentioned address

ISTE Ltd John Wiley amp Sons Inc 27-37 St Georgersquos Road 111 River Street London SW19 4EU Hoboken NJ 07030 UK USA

wwwistecouk wwwwileycom

copy ISTE Ltd 2017 The rights of Goumltz Kersting and Vladimir Vatutin to be identified as the authors of this work have been asserted by them in accordance with the Copyright Designs and Patents Act 1988

Library of Congress Control Number 2017949423 British Library Cataloguing-in-Publication Data A CIP record for this book is available from the British Library ISBN 978-1-78630-252-6

Contents

Preface ix

List of Notations xiii

Chapter 1 Branching Processes in Varying Environment 1

11 Introduction 1

12 Extinction probabilities 6

13 Almost sure convergence 12

14 Family trees 15

141 Construction of the Geiger tree 20

142 Construction of the size-biased tree T lowast 21

15 Notes 22

Chapter 2 Branching Processes in Random Environment 25

21 Introduction 25

22 Extinction probabilities 28

23 Exponential growth in the supercritical case 30

24 Three subcritical regimes 35

25 The strictly critical case 43

26 Notes 45

Chapter 3 Large Deviations for BPREs 47

31 Introduction 47

32 A tail estimate for branching processes in a

varying environment 50

33 Proof of Theorem 31 55

34 Notes 59

vi Discrete Time Branching Processes in Random Environment

Chapter 4 Properties of Random Walks 61

41 Introduction 61

42 Sparre-Andersen identities 62

43 Spitzer identity 69

44 Applications of Sparre-Andersen and Spitzer

identities 70

441 Properties of ladder epochs and ladder heights 70

442 Tail distributions of ladder epochs 75

443 Some renewal functions 76

444 Asymptotic properties of Ln and Mn 79

445 Arcsine law 83

446 Large deviations for random walks 84

45 Notes 90

Chapter 5 Critical BPREs the Annealed Approach 91

51 Introduction 91

52 Changes of measures 92

53 Properties of the prospective minima 99

54 Survival probability 107

55 Limit theorems for the critical case (annealed

approach) 116

56 Environment providing survival 122

57 Convergence of logZn 126

58 Notes 127

Chapter 6 Critical BPREs the Quenched Approach 133

61 Introduction 133

62 Changes of measures 134

63 Probability of survival 138

64 Yaglom limit theorems 142

641 The population size at non-random moments 143

642 The population size at moments nt 0 lt t lt 1 149

643 The number of particles at moment τ(n) le nt 150

644 The number of particles at moment τ(n) gt nt 157

65 Discrete limit distributions 159

66 Notes 167

Chapter 7 Weakly Subcritical BPREs 169

71 Introduction 169

72 The probability measures P+ and Pminus 172

73 Proof of theorems 180

731 Proof of Theorem 71 188

Contents vii

732 Proof of Theorem 72 190

733 Proof of Theorem 73 191

74 Notes 195

Chapter 8 Intermediate Subcritical BPREs 197

81 Introduction 197

82 Proof of Theorems 81 to 83 201

83 Further limit results 209

84 Conditioned family trees 212

85 Proof of Theorem 84 219

86 Notes 225

Chapter 9 Strongly Subcritical BPREs 227

91 Introduction 227

92 Survival probability and Yaglom-type limit theorems 229

93 Environments providing survival and dynamics of the

population size 234

931 Properties of the transition matrix P lowast 240

932 Proof of Theorem 92 243

933 Proof of Theorem 93 248

94 Notes 251

Chapter 10 Multi-type BPREs 257

101 Introduction 257

102 Supercritical MBPREs 264

103 The survival probability of subcritical and

critical MBPREs 264

104 Functional limit theorem in the critical case 266

105 Subcritical multi-type case 266

Appendix 269

Bibliography 275

Index 285

Preface

Branching processes constitute a fundamental part in the theory of stochastic

processes Very roughly speaking the theory of branching processes deals with the

issue of exponential growth or decay of random sequences or processes Its central

concept consists of a system or a population made up of particles or individuals

which independently produce descendants This is an extensive topic dating back to a

publication of F Galton and H W Watson in 1874 on the extinction of family names

and afterwards dividing into many subareas Correspondingly it is treated in a

number of monographs starting in 1963 with T E Harrisrsquo seminal The Theory ofBranching Processes and supported in the middle of the 1970s by Sevastyanovrsquos

Verzweigungsprozesse Athreya and Neyrsquos Branching Processes Jagersrsquo BranchingProcesses with Biological Applications and others The models considered in these

books mainly concern branching processes evolving in a constant environment

Important tools in proving limit theorems for such processes are generating

functions renewal type equations and functional limit theorems

However these monographs rarely touch the matter of branching processes in a

random environment (BPREs) These objects form not so much a subclass but rather

an extension of the area of branching processes In such models two types of

stochasticity are incorporated on the one hand demographic stochasticity resulting

from the reproduction of individuals and on the other hand environmental

stochasticity stemming from the changes in the conditions of reproduction along

time A central insight is that it is often the latter component that primarily

determines the behavior of these processes Thus the theory of BPREs gains its own

characteristic appearance and novel aspects appear such as a phase transition From a

technical point of view the study of such processes requires an extension of the range

of methods to be used in comparison with the methods common in the classical

theory of branching processes Other techniques should be attracted in particular

from the theory of random walks which plays an essential role in proving limit

theorems for BPREs

x Discrete Time Branching Processes in Random Environment

With this volume we have two purposes in mind First we have to assert that the

basics of the theory of BPREs are somewhat scattered in the literature (since the late

1960s) from which they are not at all easily accessible Thus we start by presenting

them in a unified manner In order to simplify matters we confine ourselves from

the beginning to the case where the environment varies in an iid fashion the model

going back to the now classical paper written by Smith and Wilkinson in 1969 We

also put together that material which is required from the topic of branching processes

in a varying environment Overall the proofs are now substantially simplified and

streamlined but at the same time some of the theorems could be better shaped

Second we would like to advance some scientific work on branching processes in

a random environment conditioned on survival which was conducted since around

2000 by a German-Russian group of scientists consisting of Valery Afanasyev

Christian Boumlinghoff Elena Dyakonova Jochen Geiger Goumltz Kersting Vladimir

Vatutin and Vitali Wachtel This research was generously supported by the German

Research Association DFG and the Russian Foundation for Basic Research RFBR In

this book we again do not aim to present our results in their most general setting yet

an ample amount of technical context cannot be avoided

We start the book by describing in Chapter 1 some properties of branching

processes in a varying environment (BPVEs) In particular we give a (short) proof of

the theorem describing the necessary and sufficient conditions for the dichotomy in

the asymptotic behavior of a BPVE such a process should either die or its population

size should tend to infinity with time Besides the construction of family trees

size-biased trees and Geigerrsquos tree representing conditioned family trees are

described here in detail These trees play an important role in studying subcritical

BPREs Chapter 2 leads the reader in to the world of BPREs It contains

classification of BPREs describes some properties of supercritical BPREs and gives

rough estimates for the growth rate of the survival probability for subcritical BPREs

Conclusions of Chapter 2 are supported by Chapter 3 where the asymptotic behavior

of the probabilities of large deviations for all types of BPREs is investigated

Properties of BPREs are closely related to the properties of the so-called associated

random walk (ARW) constituted by the logarithms of the expected population sizes

of particles of different generations This justifies the appearance of Chapter 4 that

includes some basic results on the theory of random walks and a couple of findings

concerning properties of random walks conditioned to stay non-negative or negative

and probabilities of large deviations for different types of random walks

Chapters 5 through 9 deal with various statements describing the asymptotic

behavior of the survival probability and Yaglom-type functional conditional limit

theorems for the critical and subcritical BPREs and analyzing properties of the ARW

Preface xi

providing survival of a BPRE for a long time Here the theory of random walks

conditioned to stay non-negative or negative demonstrates its beauty and power

Thus it is shown in Chapter 5 (under the annealed approach) that if a critical

BPRE survives up to a distant moment n then the minimum value of the ARW on

the interval [0 n] is attained at the beginning of the evolution of the BPRE and the

longtime behavior of the population size of such BPREs (conditioned on survival)

resembles the behavior of the ordinary supercritical GaltonndashWatson branching

processes If however a critical BPRE is considered under the quenched approach

(Chapter 6) then given the survival of the process for a long time the evolution of

the population size in the past has an oscillating character the periods when the

population size was very big were separated by intervals when the size of the

population was small

Chapters 7ndash9 are devoted to the weakly intermediately and strongly subcritical

BPREs investigated under the annealed approach To study properties of such

processes it is necessary to make changes in the initial measures based on the

properties of the ARWs The basic conclusion of Chapters 7ndash8 is the survival

probability of the weakly and intermediately subcritical BPREs up to a distant

moment n is proportional to the probability for the corresponding ARW to stay

non-negative within the time interval [0 n] Finally it is shown in Chapter 9 that

properties of strongly subcritical BPREs are in many respect similar to the

properties of the subcritical GaltonndashWatson branching processes In particular the

survival probability of such a process up to a distant moment n is proportional to the

expected number of particles in the process at this moment

We do not pretend that this book includes all interesting and important results

established up to now for BPREs In particular we do not treat here BPREs with

immigration and multitype BPREs The last direction of the theory of BPREs is a

very promising field of investigation that requires study of properties of Markov chains

generated by products of random matrices To attract the attention of future researchers

to this field we give a short survey of some recent results for multitype BPREs in

Chapter 10 The book is concluded by an Appendix that contains statements of results

used in the proofs of some theorems but not fitting the main line of the monograph

Goumltz KERSTING

Vladimir VATUTIN

August 2017

List of Notations

P(N0) set of all probability measures f on N0 =0 1 2

f [z] f(s) weights and generating function of the measuref 2

f the mean of the measure f 2f the normalized second factorial moment 2flowast size-biased measure 21fmn = fm+1 middot middot middot fn convolutions of probability measures 3fmn = fm middot middot middot fn+1 convolutions of probability measures 107κ(a f) =(f )minus2

suminfiny=a y

2f [y]truncated second moment 102

θ the moment of extinction of a branching process6 28

T lowast size-biased tree 21V random environment 25X = logF logarithm of expected population size 27κ(λ) = logE[eλX ] cumulant generating function 36γ strict descending ladder epoch 61γprime weak descending ladder epoch 61Γ strict ascending ladder epoch 61Γprime weak ascending ladder epoch 62Ln = min(S0 S1 Sn) 38Mn = max(S1 Sn) 76τ(n) = min0 le k le n Sk = Ln

the left-most moment when the minimum value ofthe random walk on the interval [0 n] is attained70

P probability measure given the environment 3 26E expectation given the environment 3 26

xiv Discrete Time Branching Processes in Random Environment

P probability measure obtained by averaging withrespect to the environment 25

E expectation taken after averaging with respect tothe environment 25

P+ Pminus change of measure 93

Pplusmn change of measure 134

P change of measure 170 198P+ Pminus change of measure 172

Plowast change of measure 234

1

Branching Processes inVarying Environment

11 Introduction

Branching processes are a fundamental object in probability theory They serve as

models for the reproduction of particles or individuals within a collective or a

population Here we act on the assumption that the population evolves within clearly

distinguishable generations which allows us to examine the population at the

founding generation n = 0 and the subsequent generations n = 1 2 To begin

with we focus on the sequence of population sizes Zn at generation n n ge 0 Later

we shall study whole family trees

Various kinds of randomness can be incorporated into such branching models

For this monograph we have two such types in mind On the one hand we take

randomness in reproduction into account Here a main assumption is that different

individuals give birth independently and that their offspring distributions coincide

within each generation On the other hand we consider environmental stochasticity

This means that these offspring distributions may change at random from one

generation to the next A fundamental question concerns which one of the two

random components will dominate and determine primarily the modelrsquos long-term

behavior We shall get to know the considerable influence of environmental

fluctuations

This first chapter is of a preliminary nature Here we look at branching models

with reduced randomness We allow that the offspring distributions vary among the

generations but as a start in a deterministic fashion So to speak we consider the above

model conditioned by its environment

Discrete Time Branching Processes in Random Environment First EditionGoumltz Kersting and Vladimir Vatutincopy ISTE Ltd 2017 Published by ISTE Ltd and John Wiley amp Sons Inc

2 Discrete Time Branching Processes in Random Environment

We begin with introducing some notation Let P(N0) be the space of all probability

measures on the natural numbers N0 = 0 1 2 For f isin P(N0) we denote its

weights by f [z] z = 0 1 We also define

f(s) =infinsumz=0

f [z]sz 0 le s le 1

The resulting function on the interval [0 1] is the generating function of the

measure f Thus we take the liberty here to denote the measure and its generating

function by one and the same symbol f This is not just as probability measures and

generating functions uniquely determine each other but operations on probability

measures are often most conveniently expressed by means of their generating

functions Therefore for two probability measures f1 and f2 the expressions f1f2 or

f1 f2 do not only stand for the product or composition of their generating functions

but also stand for the respective operations with the associated probability measures

(in the first case it is the convolution of f1 and f2) Similarly the derivative f prime of the

function f may be considered as well as the measure with weights

f prime[z] = (z + 1)f [z + 1] (which in general is no longer a probability measure) This

slight abuse of notation will cause no confusions but on the contrary will facilitate

presentation Recall that the mean and the normalized second factorial moment

f =infinsumz=1

zf [z] and f =1

f2

infinsumz=2

z(z minus 1)f [z]

can be obtained from the generating functions as

f = f prime(1) f =f primeprime(1)f prime(1)2

NOTEndash Any operation we shall apply to probability measures (and more generally to

finite measures) on N0 has to be understood as an operation applied to their generating

functions

We are now ready for first notions Let (ΩF P) be the underlying probability

space

DEFINITION 11ndash A sequence v = (f1 f2 ) of probability measures on N0 is

called a varying environment

DEFINITION 12ndash Let v = (fn n ge 1) be a varying environment Then a stochastic

process Z = Zn n isin N0 with values in N0 is called a branching process withenvironment v if for any integers z ge 0 n ge 1

P(Zn = z | Z0 Znminus1) = (fZnminus1n )[z] P-as

Branching Processes in Varying Environment 3

On the right-hand side we have the Znminus1th power of fn In particular Zn = 0P-as on the event that Znminus1 = 0 If we want to emphasize that probabilities P(middot) are

determined on the basis of the varying environment v we use the notation Pv(middot)

In probabilistic terms the definition says for n ge 1 that given Z0 Znminus1 the

random variable Zn may be realized as the sum of iid random variables Yin

i = 1 Znminus1 with distribution fn

Zn =

Znminus1sumi=1

Yin

This corresponds to the following conception of the process Z Zn is the number

of individuals of some population in generation n where all individuals reproduce

independently of each other and of Z0 and where fn is the distribution of the number

Yn of offspring of an individual in generation nminus 1 The distribution of Z0 which is

the initial distribution of the population may be arbitrary Mostly we choose it to be

Z0 = 1

EXAMPLE 11ndash A branching process with the constant environment f = f1 = f2 =middot middot middot is called a GaltonndashWatson process with offspring distribution f

The distribution of Zn is conveniently expressed via composing generating

functions For probability measures f1 fn on N0 and for natural numbers

0 le m lt n we introduce the probability measures

fmn = fm+1 middot middot middot fn [11]

Moreover let fnn be the Dirac measure δ1

PROPOSITION 11ndash Let Z be a branching process with initial size Z0 = 1 as andvarying environment (fn n ge 1) Then for n ge 0 the distribution of Zn is equal tothe measure f0n

PROOFndash Induction on n

Usually it is not straightforward to evaluate f0n explicitly The following example

contains an exceptional case of particular interest

EXAMPLE 12ndash LINEAR FRACTIONAL DISTRIBUTIONS A probability measure fon N0 is said to be of the linear fractional type if there are real numbers p a with

0 lt p lt 1 and 0 le a le 1 such that

f [z] = apqzminus1 for z = 0

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 3: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

Branching Processes Branching Random Walks and Branching Particle Fields Set

coordinated by Elena Yarovaya

Volume 1

Discrete Time Branching Processes in Random

Environment

Goumltz Kersting Vladimir Vatutin

First published 2017 in Great Britain and the United States by ISTE Ltd and John Wiley amp Sons Inc

Apart from any fair dealing for the purposes of research or private study or criticism or review as permitted under the Copyright Designs and Patents Act 1988 this publication may only be reproduced stored or transmitted in any form or by any means with the prior permission in writing of the publishers or in the case of reprographic reproduction in accordance with the terms and licenses issued by the CLA Enquiries concerning reproduction outside these terms should be sent to the publishers at the undermentioned address

ISTE Ltd John Wiley amp Sons Inc 27-37 St Georgersquos Road 111 River Street London SW19 4EU Hoboken NJ 07030 UK USA

wwwistecouk wwwwileycom

copy ISTE Ltd 2017 The rights of Goumltz Kersting and Vladimir Vatutin to be identified as the authors of this work have been asserted by them in accordance with the Copyright Designs and Patents Act 1988

Library of Congress Control Number 2017949423 British Library Cataloguing-in-Publication Data A CIP record for this book is available from the British Library ISBN 978-1-78630-252-6

Contents

Preface ix

List of Notations xiii

Chapter 1 Branching Processes in Varying Environment 1

11 Introduction 1

12 Extinction probabilities 6

13 Almost sure convergence 12

14 Family trees 15

141 Construction of the Geiger tree 20

142 Construction of the size-biased tree T lowast 21

15 Notes 22

Chapter 2 Branching Processes in Random Environment 25

21 Introduction 25

22 Extinction probabilities 28

23 Exponential growth in the supercritical case 30

24 Three subcritical regimes 35

25 The strictly critical case 43

26 Notes 45

Chapter 3 Large Deviations for BPREs 47

31 Introduction 47

32 A tail estimate for branching processes in a

varying environment 50

33 Proof of Theorem 31 55

34 Notes 59

vi Discrete Time Branching Processes in Random Environment

Chapter 4 Properties of Random Walks 61

41 Introduction 61

42 Sparre-Andersen identities 62

43 Spitzer identity 69

44 Applications of Sparre-Andersen and Spitzer

identities 70

441 Properties of ladder epochs and ladder heights 70

442 Tail distributions of ladder epochs 75

443 Some renewal functions 76

444 Asymptotic properties of Ln and Mn 79

445 Arcsine law 83

446 Large deviations for random walks 84

45 Notes 90

Chapter 5 Critical BPREs the Annealed Approach 91

51 Introduction 91

52 Changes of measures 92

53 Properties of the prospective minima 99

54 Survival probability 107

55 Limit theorems for the critical case (annealed

approach) 116

56 Environment providing survival 122

57 Convergence of logZn 126

58 Notes 127

Chapter 6 Critical BPREs the Quenched Approach 133

61 Introduction 133

62 Changes of measures 134

63 Probability of survival 138

64 Yaglom limit theorems 142

641 The population size at non-random moments 143

642 The population size at moments nt 0 lt t lt 1 149

643 The number of particles at moment τ(n) le nt 150

644 The number of particles at moment τ(n) gt nt 157

65 Discrete limit distributions 159

66 Notes 167

Chapter 7 Weakly Subcritical BPREs 169

71 Introduction 169

72 The probability measures P+ and Pminus 172

73 Proof of theorems 180

731 Proof of Theorem 71 188

Contents vii

732 Proof of Theorem 72 190

733 Proof of Theorem 73 191

74 Notes 195

Chapter 8 Intermediate Subcritical BPREs 197

81 Introduction 197

82 Proof of Theorems 81 to 83 201

83 Further limit results 209

84 Conditioned family trees 212

85 Proof of Theorem 84 219

86 Notes 225

Chapter 9 Strongly Subcritical BPREs 227

91 Introduction 227

92 Survival probability and Yaglom-type limit theorems 229

93 Environments providing survival and dynamics of the

population size 234

931 Properties of the transition matrix P lowast 240

932 Proof of Theorem 92 243

933 Proof of Theorem 93 248

94 Notes 251

Chapter 10 Multi-type BPREs 257

101 Introduction 257

102 Supercritical MBPREs 264

103 The survival probability of subcritical and

critical MBPREs 264

104 Functional limit theorem in the critical case 266

105 Subcritical multi-type case 266

Appendix 269

Bibliography 275

Index 285

Preface

Branching processes constitute a fundamental part in the theory of stochastic

processes Very roughly speaking the theory of branching processes deals with the

issue of exponential growth or decay of random sequences or processes Its central

concept consists of a system or a population made up of particles or individuals

which independently produce descendants This is an extensive topic dating back to a

publication of F Galton and H W Watson in 1874 on the extinction of family names

and afterwards dividing into many subareas Correspondingly it is treated in a

number of monographs starting in 1963 with T E Harrisrsquo seminal The Theory ofBranching Processes and supported in the middle of the 1970s by Sevastyanovrsquos

Verzweigungsprozesse Athreya and Neyrsquos Branching Processes Jagersrsquo BranchingProcesses with Biological Applications and others The models considered in these

books mainly concern branching processes evolving in a constant environment

Important tools in proving limit theorems for such processes are generating

functions renewal type equations and functional limit theorems

However these monographs rarely touch the matter of branching processes in a

random environment (BPREs) These objects form not so much a subclass but rather

an extension of the area of branching processes In such models two types of

stochasticity are incorporated on the one hand demographic stochasticity resulting

from the reproduction of individuals and on the other hand environmental

stochasticity stemming from the changes in the conditions of reproduction along

time A central insight is that it is often the latter component that primarily

determines the behavior of these processes Thus the theory of BPREs gains its own

characteristic appearance and novel aspects appear such as a phase transition From a

technical point of view the study of such processes requires an extension of the range

of methods to be used in comparison with the methods common in the classical

theory of branching processes Other techniques should be attracted in particular

from the theory of random walks which plays an essential role in proving limit

theorems for BPREs

x Discrete Time Branching Processes in Random Environment

With this volume we have two purposes in mind First we have to assert that the

basics of the theory of BPREs are somewhat scattered in the literature (since the late

1960s) from which they are not at all easily accessible Thus we start by presenting

them in a unified manner In order to simplify matters we confine ourselves from

the beginning to the case where the environment varies in an iid fashion the model

going back to the now classical paper written by Smith and Wilkinson in 1969 We

also put together that material which is required from the topic of branching processes

in a varying environment Overall the proofs are now substantially simplified and

streamlined but at the same time some of the theorems could be better shaped

Second we would like to advance some scientific work on branching processes in

a random environment conditioned on survival which was conducted since around

2000 by a German-Russian group of scientists consisting of Valery Afanasyev

Christian Boumlinghoff Elena Dyakonova Jochen Geiger Goumltz Kersting Vladimir

Vatutin and Vitali Wachtel This research was generously supported by the German

Research Association DFG and the Russian Foundation for Basic Research RFBR In

this book we again do not aim to present our results in their most general setting yet

an ample amount of technical context cannot be avoided

We start the book by describing in Chapter 1 some properties of branching

processes in a varying environment (BPVEs) In particular we give a (short) proof of

the theorem describing the necessary and sufficient conditions for the dichotomy in

the asymptotic behavior of a BPVE such a process should either die or its population

size should tend to infinity with time Besides the construction of family trees

size-biased trees and Geigerrsquos tree representing conditioned family trees are

described here in detail These trees play an important role in studying subcritical

BPREs Chapter 2 leads the reader in to the world of BPREs It contains

classification of BPREs describes some properties of supercritical BPREs and gives

rough estimates for the growth rate of the survival probability for subcritical BPREs

Conclusions of Chapter 2 are supported by Chapter 3 where the asymptotic behavior

of the probabilities of large deviations for all types of BPREs is investigated

Properties of BPREs are closely related to the properties of the so-called associated

random walk (ARW) constituted by the logarithms of the expected population sizes

of particles of different generations This justifies the appearance of Chapter 4 that

includes some basic results on the theory of random walks and a couple of findings

concerning properties of random walks conditioned to stay non-negative or negative

and probabilities of large deviations for different types of random walks

Chapters 5 through 9 deal with various statements describing the asymptotic

behavior of the survival probability and Yaglom-type functional conditional limit

theorems for the critical and subcritical BPREs and analyzing properties of the ARW

Preface xi

providing survival of a BPRE for a long time Here the theory of random walks

conditioned to stay non-negative or negative demonstrates its beauty and power

Thus it is shown in Chapter 5 (under the annealed approach) that if a critical

BPRE survives up to a distant moment n then the minimum value of the ARW on

the interval [0 n] is attained at the beginning of the evolution of the BPRE and the

longtime behavior of the population size of such BPREs (conditioned on survival)

resembles the behavior of the ordinary supercritical GaltonndashWatson branching

processes If however a critical BPRE is considered under the quenched approach

(Chapter 6) then given the survival of the process for a long time the evolution of

the population size in the past has an oscillating character the periods when the

population size was very big were separated by intervals when the size of the

population was small

Chapters 7ndash9 are devoted to the weakly intermediately and strongly subcritical

BPREs investigated under the annealed approach To study properties of such

processes it is necessary to make changes in the initial measures based on the

properties of the ARWs The basic conclusion of Chapters 7ndash8 is the survival

probability of the weakly and intermediately subcritical BPREs up to a distant

moment n is proportional to the probability for the corresponding ARW to stay

non-negative within the time interval [0 n] Finally it is shown in Chapter 9 that

properties of strongly subcritical BPREs are in many respect similar to the

properties of the subcritical GaltonndashWatson branching processes In particular the

survival probability of such a process up to a distant moment n is proportional to the

expected number of particles in the process at this moment

We do not pretend that this book includes all interesting and important results

established up to now for BPREs In particular we do not treat here BPREs with

immigration and multitype BPREs The last direction of the theory of BPREs is a

very promising field of investigation that requires study of properties of Markov chains

generated by products of random matrices To attract the attention of future researchers

to this field we give a short survey of some recent results for multitype BPREs in

Chapter 10 The book is concluded by an Appendix that contains statements of results

used in the proofs of some theorems but not fitting the main line of the monograph

Goumltz KERSTING

Vladimir VATUTIN

August 2017

List of Notations

P(N0) set of all probability measures f on N0 =0 1 2

f [z] f(s) weights and generating function of the measuref 2

f the mean of the measure f 2f the normalized second factorial moment 2flowast size-biased measure 21fmn = fm+1 middot middot middot fn convolutions of probability measures 3fmn = fm middot middot middot fn+1 convolutions of probability measures 107κ(a f) =(f )minus2

suminfiny=a y

2f [y]truncated second moment 102

θ the moment of extinction of a branching process6 28

T lowast size-biased tree 21V random environment 25X = logF logarithm of expected population size 27κ(λ) = logE[eλX ] cumulant generating function 36γ strict descending ladder epoch 61γprime weak descending ladder epoch 61Γ strict ascending ladder epoch 61Γprime weak ascending ladder epoch 62Ln = min(S0 S1 Sn) 38Mn = max(S1 Sn) 76τ(n) = min0 le k le n Sk = Ln

the left-most moment when the minimum value ofthe random walk on the interval [0 n] is attained70

P probability measure given the environment 3 26E expectation given the environment 3 26

xiv Discrete Time Branching Processes in Random Environment

P probability measure obtained by averaging withrespect to the environment 25

E expectation taken after averaging with respect tothe environment 25

P+ Pminus change of measure 93

Pplusmn change of measure 134

P change of measure 170 198P+ Pminus change of measure 172

Plowast change of measure 234

1

Branching Processes inVarying Environment

11 Introduction

Branching processes are a fundamental object in probability theory They serve as

models for the reproduction of particles or individuals within a collective or a

population Here we act on the assumption that the population evolves within clearly

distinguishable generations which allows us to examine the population at the

founding generation n = 0 and the subsequent generations n = 1 2 To begin

with we focus on the sequence of population sizes Zn at generation n n ge 0 Later

we shall study whole family trees

Various kinds of randomness can be incorporated into such branching models

For this monograph we have two such types in mind On the one hand we take

randomness in reproduction into account Here a main assumption is that different

individuals give birth independently and that their offspring distributions coincide

within each generation On the other hand we consider environmental stochasticity

This means that these offspring distributions may change at random from one

generation to the next A fundamental question concerns which one of the two

random components will dominate and determine primarily the modelrsquos long-term

behavior We shall get to know the considerable influence of environmental

fluctuations

This first chapter is of a preliminary nature Here we look at branching models

with reduced randomness We allow that the offspring distributions vary among the

generations but as a start in a deterministic fashion So to speak we consider the above

model conditioned by its environment

Discrete Time Branching Processes in Random Environment First EditionGoumltz Kersting and Vladimir Vatutincopy ISTE Ltd 2017 Published by ISTE Ltd and John Wiley amp Sons Inc

2 Discrete Time Branching Processes in Random Environment

We begin with introducing some notation Let P(N0) be the space of all probability

measures on the natural numbers N0 = 0 1 2 For f isin P(N0) we denote its

weights by f [z] z = 0 1 We also define

f(s) =infinsumz=0

f [z]sz 0 le s le 1

The resulting function on the interval [0 1] is the generating function of the

measure f Thus we take the liberty here to denote the measure and its generating

function by one and the same symbol f This is not just as probability measures and

generating functions uniquely determine each other but operations on probability

measures are often most conveniently expressed by means of their generating

functions Therefore for two probability measures f1 and f2 the expressions f1f2 or

f1 f2 do not only stand for the product or composition of their generating functions

but also stand for the respective operations with the associated probability measures

(in the first case it is the convolution of f1 and f2) Similarly the derivative f prime of the

function f may be considered as well as the measure with weights

f prime[z] = (z + 1)f [z + 1] (which in general is no longer a probability measure) This

slight abuse of notation will cause no confusions but on the contrary will facilitate

presentation Recall that the mean and the normalized second factorial moment

f =infinsumz=1

zf [z] and f =1

f2

infinsumz=2

z(z minus 1)f [z]

can be obtained from the generating functions as

f = f prime(1) f =f primeprime(1)f prime(1)2

NOTEndash Any operation we shall apply to probability measures (and more generally to

finite measures) on N0 has to be understood as an operation applied to their generating

functions

We are now ready for first notions Let (ΩF P) be the underlying probability

space

DEFINITION 11ndash A sequence v = (f1 f2 ) of probability measures on N0 is

called a varying environment

DEFINITION 12ndash Let v = (fn n ge 1) be a varying environment Then a stochastic

process Z = Zn n isin N0 with values in N0 is called a branching process withenvironment v if for any integers z ge 0 n ge 1

P(Zn = z | Z0 Znminus1) = (fZnminus1n )[z] P-as

Branching Processes in Varying Environment 3

On the right-hand side we have the Znminus1th power of fn In particular Zn = 0P-as on the event that Znminus1 = 0 If we want to emphasize that probabilities P(middot) are

determined on the basis of the varying environment v we use the notation Pv(middot)

In probabilistic terms the definition says for n ge 1 that given Z0 Znminus1 the

random variable Zn may be realized as the sum of iid random variables Yin

i = 1 Znminus1 with distribution fn

Zn =

Znminus1sumi=1

Yin

This corresponds to the following conception of the process Z Zn is the number

of individuals of some population in generation n where all individuals reproduce

independently of each other and of Z0 and where fn is the distribution of the number

Yn of offspring of an individual in generation nminus 1 The distribution of Z0 which is

the initial distribution of the population may be arbitrary Mostly we choose it to be

Z0 = 1

EXAMPLE 11ndash A branching process with the constant environment f = f1 = f2 =middot middot middot is called a GaltonndashWatson process with offspring distribution f

The distribution of Zn is conveniently expressed via composing generating

functions For probability measures f1 fn on N0 and for natural numbers

0 le m lt n we introduce the probability measures

fmn = fm+1 middot middot middot fn [11]

Moreover let fnn be the Dirac measure δ1

PROPOSITION 11ndash Let Z be a branching process with initial size Z0 = 1 as andvarying environment (fn n ge 1) Then for n ge 0 the distribution of Zn is equal tothe measure f0n

PROOFndash Induction on n

Usually it is not straightforward to evaluate f0n explicitly The following example

contains an exceptional case of particular interest

EXAMPLE 12ndash LINEAR FRACTIONAL DISTRIBUTIONS A probability measure fon N0 is said to be of the linear fractional type if there are real numbers p a with

0 lt p lt 1 and 0 le a le 1 such that

f [z] = apqzminus1 for z = 0

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 4: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

First published 2017 in Great Britain and the United States by ISTE Ltd and John Wiley amp Sons Inc

Apart from any fair dealing for the purposes of research or private study or criticism or review as permitted under the Copyright Designs and Patents Act 1988 this publication may only be reproduced stored or transmitted in any form or by any means with the prior permission in writing of the publishers or in the case of reprographic reproduction in accordance with the terms and licenses issued by the CLA Enquiries concerning reproduction outside these terms should be sent to the publishers at the undermentioned address

ISTE Ltd John Wiley amp Sons Inc 27-37 St Georgersquos Road 111 River Street London SW19 4EU Hoboken NJ 07030 UK USA

wwwistecouk wwwwileycom

copy ISTE Ltd 2017 The rights of Goumltz Kersting and Vladimir Vatutin to be identified as the authors of this work have been asserted by them in accordance with the Copyright Designs and Patents Act 1988

Library of Congress Control Number 2017949423 British Library Cataloguing-in-Publication Data A CIP record for this book is available from the British Library ISBN 978-1-78630-252-6

Contents

Preface ix

List of Notations xiii

Chapter 1 Branching Processes in Varying Environment 1

11 Introduction 1

12 Extinction probabilities 6

13 Almost sure convergence 12

14 Family trees 15

141 Construction of the Geiger tree 20

142 Construction of the size-biased tree T lowast 21

15 Notes 22

Chapter 2 Branching Processes in Random Environment 25

21 Introduction 25

22 Extinction probabilities 28

23 Exponential growth in the supercritical case 30

24 Three subcritical regimes 35

25 The strictly critical case 43

26 Notes 45

Chapter 3 Large Deviations for BPREs 47

31 Introduction 47

32 A tail estimate for branching processes in a

varying environment 50

33 Proof of Theorem 31 55

34 Notes 59

vi Discrete Time Branching Processes in Random Environment

Chapter 4 Properties of Random Walks 61

41 Introduction 61

42 Sparre-Andersen identities 62

43 Spitzer identity 69

44 Applications of Sparre-Andersen and Spitzer

identities 70

441 Properties of ladder epochs and ladder heights 70

442 Tail distributions of ladder epochs 75

443 Some renewal functions 76

444 Asymptotic properties of Ln and Mn 79

445 Arcsine law 83

446 Large deviations for random walks 84

45 Notes 90

Chapter 5 Critical BPREs the Annealed Approach 91

51 Introduction 91

52 Changes of measures 92

53 Properties of the prospective minima 99

54 Survival probability 107

55 Limit theorems for the critical case (annealed

approach) 116

56 Environment providing survival 122

57 Convergence of logZn 126

58 Notes 127

Chapter 6 Critical BPREs the Quenched Approach 133

61 Introduction 133

62 Changes of measures 134

63 Probability of survival 138

64 Yaglom limit theorems 142

641 The population size at non-random moments 143

642 The population size at moments nt 0 lt t lt 1 149

643 The number of particles at moment τ(n) le nt 150

644 The number of particles at moment τ(n) gt nt 157

65 Discrete limit distributions 159

66 Notes 167

Chapter 7 Weakly Subcritical BPREs 169

71 Introduction 169

72 The probability measures P+ and Pminus 172

73 Proof of theorems 180

731 Proof of Theorem 71 188

Contents vii

732 Proof of Theorem 72 190

733 Proof of Theorem 73 191

74 Notes 195

Chapter 8 Intermediate Subcritical BPREs 197

81 Introduction 197

82 Proof of Theorems 81 to 83 201

83 Further limit results 209

84 Conditioned family trees 212

85 Proof of Theorem 84 219

86 Notes 225

Chapter 9 Strongly Subcritical BPREs 227

91 Introduction 227

92 Survival probability and Yaglom-type limit theorems 229

93 Environments providing survival and dynamics of the

population size 234

931 Properties of the transition matrix P lowast 240

932 Proof of Theorem 92 243

933 Proof of Theorem 93 248

94 Notes 251

Chapter 10 Multi-type BPREs 257

101 Introduction 257

102 Supercritical MBPREs 264

103 The survival probability of subcritical and

critical MBPREs 264

104 Functional limit theorem in the critical case 266

105 Subcritical multi-type case 266

Appendix 269

Bibliography 275

Index 285

Preface

Branching processes constitute a fundamental part in the theory of stochastic

processes Very roughly speaking the theory of branching processes deals with the

issue of exponential growth or decay of random sequences or processes Its central

concept consists of a system or a population made up of particles or individuals

which independently produce descendants This is an extensive topic dating back to a

publication of F Galton and H W Watson in 1874 on the extinction of family names

and afterwards dividing into many subareas Correspondingly it is treated in a

number of monographs starting in 1963 with T E Harrisrsquo seminal The Theory ofBranching Processes and supported in the middle of the 1970s by Sevastyanovrsquos

Verzweigungsprozesse Athreya and Neyrsquos Branching Processes Jagersrsquo BranchingProcesses with Biological Applications and others The models considered in these

books mainly concern branching processes evolving in a constant environment

Important tools in proving limit theorems for such processes are generating

functions renewal type equations and functional limit theorems

However these monographs rarely touch the matter of branching processes in a

random environment (BPREs) These objects form not so much a subclass but rather

an extension of the area of branching processes In such models two types of

stochasticity are incorporated on the one hand demographic stochasticity resulting

from the reproduction of individuals and on the other hand environmental

stochasticity stemming from the changes in the conditions of reproduction along

time A central insight is that it is often the latter component that primarily

determines the behavior of these processes Thus the theory of BPREs gains its own

characteristic appearance and novel aspects appear such as a phase transition From a

technical point of view the study of such processes requires an extension of the range

of methods to be used in comparison with the methods common in the classical

theory of branching processes Other techniques should be attracted in particular

from the theory of random walks which plays an essential role in proving limit

theorems for BPREs

x Discrete Time Branching Processes in Random Environment

With this volume we have two purposes in mind First we have to assert that the

basics of the theory of BPREs are somewhat scattered in the literature (since the late

1960s) from which they are not at all easily accessible Thus we start by presenting

them in a unified manner In order to simplify matters we confine ourselves from

the beginning to the case where the environment varies in an iid fashion the model

going back to the now classical paper written by Smith and Wilkinson in 1969 We

also put together that material which is required from the topic of branching processes

in a varying environment Overall the proofs are now substantially simplified and

streamlined but at the same time some of the theorems could be better shaped

Second we would like to advance some scientific work on branching processes in

a random environment conditioned on survival which was conducted since around

2000 by a German-Russian group of scientists consisting of Valery Afanasyev

Christian Boumlinghoff Elena Dyakonova Jochen Geiger Goumltz Kersting Vladimir

Vatutin and Vitali Wachtel This research was generously supported by the German

Research Association DFG and the Russian Foundation for Basic Research RFBR In

this book we again do not aim to present our results in their most general setting yet

an ample amount of technical context cannot be avoided

We start the book by describing in Chapter 1 some properties of branching

processes in a varying environment (BPVEs) In particular we give a (short) proof of

the theorem describing the necessary and sufficient conditions for the dichotomy in

the asymptotic behavior of a BPVE such a process should either die or its population

size should tend to infinity with time Besides the construction of family trees

size-biased trees and Geigerrsquos tree representing conditioned family trees are

described here in detail These trees play an important role in studying subcritical

BPREs Chapter 2 leads the reader in to the world of BPREs It contains

classification of BPREs describes some properties of supercritical BPREs and gives

rough estimates for the growth rate of the survival probability for subcritical BPREs

Conclusions of Chapter 2 are supported by Chapter 3 where the asymptotic behavior

of the probabilities of large deviations for all types of BPREs is investigated

Properties of BPREs are closely related to the properties of the so-called associated

random walk (ARW) constituted by the logarithms of the expected population sizes

of particles of different generations This justifies the appearance of Chapter 4 that

includes some basic results on the theory of random walks and a couple of findings

concerning properties of random walks conditioned to stay non-negative or negative

and probabilities of large deviations for different types of random walks

Chapters 5 through 9 deal with various statements describing the asymptotic

behavior of the survival probability and Yaglom-type functional conditional limit

theorems for the critical and subcritical BPREs and analyzing properties of the ARW

Preface xi

providing survival of a BPRE for a long time Here the theory of random walks

conditioned to stay non-negative or negative demonstrates its beauty and power

Thus it is shown in Chapter 5 (under the annealed approach) that if a critical

BPRE survives up to a distant moment n then the minimum value of the ARW on

the interval [0 n] is attained at the beginning of the evolution of the BPRE and the

longtime behavior of the population size of such BPREs (conditioned on survival)

resembles the behavior of the ordinary supercritical GaltonndashWatson branching

processes If however a critical BPRE is considered under the quenched approach

(Chapter 6) then given the survival of the process for a long time the evolution of

the population size in the past has an oscillating character the periods when the

population size was very big were separated by intervals when the size of the

population was small

Chapters 7ndash9 are devoted to the weakly intermediately and strongly subcritical

BPREs investigated under the annealed approach To study properties of such

processes it is necessary to make changes in the initial measures based on the

properties of the ARWs The basic conclusion of Chapters 7ndash8 is the survival

probability of the weakly and intermediately subcritical BPREs up to a distant

moment n is proportional to the probability for the corresponding ARW to stay

non-negative within the time interval [0 n] Finally it is shown in Chapter 9 that

properties of strongly subcritical BPREs are in many respect similar to the

properties of the subcritical GaltonndashWatson branching processes In particular the

survival probability of such a process up to a distant moment n is proportional to the

expected number of particles in the process at this moment

We do not pretend that this book includes all interesting and important results

established up to now for BPREs In particular we do not treat here BPREs with

immigration and multitype BPREs The last direction of the theory of BPREs is a

very promising field of investigation that requires study of properties of Markov chains

generated by products of random matrices To attract the attention of future researchers

to this field we give a short survey of some recent results for multitype BPREs in

Chapter 10 The book is concluded by an Appendix that contains statements of results

used in the proofs of some theorems but not fitting the main line of the monograph

Goumltz KERSTING

Vladimir VATUTIN

August 2017

List of Notations

P(N0) set of all probability measures f on N0 =0 1 2

f [z] f(s) weights and generating function of the measuref 2

f the mean of the measure f 2f the normalized second factorial moment 2flowast size-biased measure 21fmn = fm+1 middot middot middot fn convolutions of probability measures 3fmn = fm middot middot middot fn+1 convolutions of probability measures 107κ(a f) =(f )minus2

suminfiny=a y

2f [y]truncated second moment 102

θ the moment of extinction of a branching process6 28

T lowast size-biased tree 21V random environment 25X = logF logarithm of expected population size 27κ(λ) = logE[eλX ] cumulant generating function 36γ strict descending ladder epoch 61γprime weak descending ladder epoch 61Γ strict ascending ladder epoch 61Γprime weak ascending ladder epoch 62Ln = min(S0 S1 Sn) 38Mn = max(S1 Sn) 76τ(n) = min0 le k le n Sk = Ln

the left-most moment when the minimum value ofthe random walk on the interval [0 n] is attained70

P probability measure given the environment 3 26E expectation given the environment 3 26

xiv Discrete Time Branching Processes in Random Environment

P probability measure obtained by averaging withrespect to the environment 25

E expectation taken after averaging with respect tothe environment 25

P+ Pminus change of measure 93

Pplusmn change of measure 134

P change of measure 170 198P+ Pminus change of measure 172

Plowast change of measure 234

1

Branching Processes inVarying Environment

11 Introduction

Branching processes are a fundamental object in probability theory They serve as

models for the reproduction of particles or individuals within a collective or a

population Here we act on the assumption that the population evolves within clearly

distinguishable generations which allows us to examine the population at the

founding generation n = 0 and the subsequent generations n = 1 2 To begin

with we focus on the sequence of population sizes Zn at generation n n ge 0 Later

we shall study whole family trees

Various kinds of randomness can be incorporated into such branching models

For this monograph we have two such types in mind On the one hand we take

randomness in reproduction into account Here a main assumption is that different

individuals give birth independently and that their offspring distributions coincide

within each generation On the other hand we consider environmental stochasticity

This means that these offspring distributions may change at random from one

generation to the next A fundamental question concerns which one of the two

random components will dominate and determine primarily the modelrsquos long-term

behavior We shall get to know the considerable influence of environmental

fluctuations

This first chapter is of a preliminary nature Here we look at branching models

with reduced randomness We allow that the offspring distributions vary among the

generations but as a start in a deterministic fashion So to speak we consider the above

model conditioned by its environment

Discrete Time Branching Processes in Random Environment First EditionGoumltz Kersting and Vladimir Vatutincopy ISTE Ltd 2017 Published by ISTE Ltd and John Wiley amp Sons Inc

2 Discrete Time Branching Processes in Random Environment

We begin with introducing some notation Let P(N0) be the space of all probability

measures on the natural numbers N0 = 0 1 2 For f isin P(N0) we denote its

weights by f [z] z = 0 1 We also define

f(s) =infinsumz=0

f [z]sz 0 le s le 1

The resulting function on the interval [0 1] is the generating function of the

measure f Thus we take the liberty here to denote the measure and its generating

function by one and the same symbol f This is not just as probability measures and

generating functions uniquely determine each other but operations on probability

measures are often most conveniently expressed by means of their generating

functions Therefore for two probability measures f1 and f2 the expressions f1f2 or

f1 f2 do not only stand for the product or composition of their generating functions

but also stand for the respective operations with the associated probability measures

(in the first case it is the convolution of f1 and f2) Similarly the derivative f prime of the

function f may be considered as well as the measure with weights

f prime[z] = (z + 1)f [z + 1] (which in general is no longer a probability measure) This

slight abuse of notation will cause no confusions but on the contrary will facilitate

presentation Recall that the mean and the normalized second factorial moment

f =infinsumz=1

zf [z] and f =1

f2

infinsumz=2

z(z minus 1)f [z]

can be obtained from the generating functions as

f = f prime(1) f =f primeprime(1)f prime(1)2

NOTEndash Any operation we shall apply to probability measures (and more generally to

finite measures) on N0 has to be understood as an operation applied to their generating

functions

We are now ready for first notions Let (ΩF P) be the underlying probability

space

DEFINITION 11ndash A sequence v = (f1 f2 ) of probability measures on N0 is

called a varying environment

DEFINITION 12ndash Let v = (fn n ge 1) be a varying environment Then a stochastic

process Z = Zn n isin N0 with values in N0 is called a branching process withenvironment v if for any integers z ge 0 n ge 1

P(Zn = z | Z0 Znminus1) = (fZnminus1n )[z] P-as

Branching Processes in Varying Environment 3

On the right-hand side we have the Znminus1th power of fn In particular Zn = 0P-as on the event that Znminus1 = 0 If we want to emphasize that probabilities P(middot) are

determined on the basis of the varying environment v we use the notation Pv(middot)

In probabilistic terms the definition says for n ge 1 that given Z0 Znminus1 the

random variable Zn may be realized as the sum of iid random variables Yin

i = 1 Znminus1 with distribution fn

Zn =

Znminus1sumi=1

Yin

This corresponds to the following conception of the process Z Zn is the number

of individuals of some population in generation n where all individuals reproduce

independently of each other and of Z0 and where fn is the distribution of the number

Yn of offspring of an individual in generation nminus 1 The distribution of Z0 which is

the initial distribution of the population may be arbitrary Mostly we choose it to be

Z0 = 1

EXAMPLE 11ndash A branching process with the constant environment f = f1 = f2 =middot middot middot is called a GaltonndashWatson process with offspring distribution f

The distribution of Zn is conveniently expressed via composing generating

functions For probability measures f1 fn on N0 and for natural numbers

0 le m lt n we introduce the probability measures

fmn = fm+1 middot middot middot fn [11]

Moreover let fnn be the Dirac measure δ1

PROPOSITION 11ndash Let Z be a branching process with initial size Z0 = 1 as andvarying environment (fn n ge 1) Then for n ge 0 the distribution of Zn is equal tothe measure f0n

PROOFndash Induction on n

Usually it is not straightforward to evaluate f0n explicitly The following example

contains an exceptional case of particular interest

EXAMPLE 12ndash LINEAR FRACTIONAL DISTRIBUTIONS A probability measure fon N0 is said to be of the linear fractional type if there are real numbers p a with

0 lt p lt 1 and 0 le a le 1 such that

f [z] = apqzminus1 for z = 0

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 5: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

Contents

Preface ix

List of Notations xiii

Chapter 1 Branching Processes in Varying Environment 1

11 Introduction 1

12 Extinction probabilities 6

13 Almost sure convergence 12

14 Family trees 15

141 Construction of the Geiger tree 20

142 Construction of the size-biased tree T lowast 21

15 Notes 22

Chapter 2 Branching Processes in Random Environment 25

21 Introduction 25

22 Extinction probabilities 28

23 Exponential growth in the supercritical case 30

24 Three subcritical regimes 35

25 The strictly critical case 43

26 Notes 45

Chapter 3 Large Deviations for BPREs 47

31 Introduction 47

32 A tail estimate for branching processes in a

varying environment 50

33 Proof of Theorem 31 55

34 Notes 59

vi Discrete Time Branching Processes in Random Environment

Chapter 4 Properties of Random Walks 61

41 Introduction 61

42 Sparre-Andersen identities 62

43 Spitzer identity 69

44 Applications of Sparre-Andersen and Spitzer

identities 70

441 Properties of ladder epochs and ladder heights 70

442 Tail distributions of ladder epochs 75

443 Some renewal functions 76

444 Asymptotic properties of Ln and Mn 79

445 Arcsine law 83

446 Large deviations for random walks 84

45 Notes 90

Chapter 5 Critical BPREs the Annealed Approach 91

51 Introduction 91

52 Changes of measures 92

53 Properties of the prospective minima 99

54 Survival probability 107

55 Limit theorems for the critical case (annealed

approach) 116

56 Environment providing survival 122

57 Convergence of logZn 126

58 Notes 127

Chapter 6 Critical BPREs the Quenched Approach 133

61 Introduction 133

62 Changes of measures 134

63 Probability of survival 138

64 Yaglom limit theorems 142

641 The population size at non-random moments 143

642 The population size at moments nt 0 lt t lt 1 149

643 The number of particles at moment τ(n) le nt 150

644 The number of particles at moment τ(n) gt nt 157

65 Discrete limit distributions 159

66 Notes 167

Chapter 7 Weakly Subcritical BPREs 169

71 Introduction 169

72 The probability measures P+ and Pminus 172

73 Proof of theorems 180

731 Proof of Theorem 71 188

Contents vii

732 Proof of Theorem 72 190

733 Proof of Theorem 73 191

74 Notes 195

Chapter 8 Intermediate Subcritical BPREs 197

81 Introduction 197

82 Proof of Theorems 81 to 83 201

83 Further limit results 209

84 Conditioned family trees 212

85 Proof of Theorem 84 219

86 Notes 225

Chapter 9 Strongly Subcritical BPREs 227

91 Introduction 227

92 Survival probability and Yaglom-type limit theorems 229

93 Environments providing survival and dynamics of the

population size 234

931 Properties of the transition matrix P lowast 240

932 Proof of Theorem 92 243

933 Proof of Theorem 93 248

94 Notes 251

Chapter 10 Multi-type BPREs 257

101 Introduction 257

102 Supercritical MBPREs 264

103 The survival probability of subcritical and

critical MBPREs 264

104 Functional limit theorem in the critical case 266

105 Subcritical multi-type case 266

Appendix 269

Bibliography 275

Index 285

Preface

Branching processes constitute a fundamental part in the theory of stochastic

processes Very roughly speaking the theory of branching processes deals with the

issue of exponential growth or decay of random sequences or processes Its central

concept consists of a system or a population made up of particles or individuals

which independently produce descendants This is an extensive topic dating back to a

publication of F Galton and H W Watson in 1874 on the extinction of family names

and afterwards dividing into many subareas Correspondingly it is treated in a

number of monographs starting in 1963 with T E Harrisrsquo seminal The Theory ofBranching Processes and supported in the middle of the 1970s by Sevastyanovrsquos

Verzweigungsprozesse Athreya and Neyrsquos Branching Processes Jagersrsquo BranchingProcesses with Biological Applications and others The models considered in these

books mainly concern branching processes evolving in a constant environment

Important tools in proving limit theorems for such processes are generating

functions renewal type equations and functional limit theorems

However these monographs rarely touch the matter of branching processes in a

random environment (BPREs) These objects form not so much a subclass but rather

an extension of the area of branching processes In such models two types of

stochasticity are incorporated on the one hand demographic stochasticity resulting

from the reproduction of individuals and on the other hand environmental

stochasticity stemming from the changes in the conditions of reproduction along

time A central insight is that it is often the latter component that primarily

determines the behavior of these processes Thus the theory of BPREs gains its own

characteristic appearance and novel aspects appear such as a phase transition From a

technical point of view the study of such processes requires an extension of the range

of methods to be used in comparison with the methods common in the classical

theory of branching processes Other techniques should be attracted in particular

from the theory of random walks which plays an essential role in proving limit

theorems for BPREs

x Discrete Time Branching Processes in Random Environment

With this volume we have two purposes in mind First we have to assert that the

basics of the theory of BPREs are somewhat scattered in the literature (since the late

1960s) from which they are not at all easily accessible Thus we start by presenting

them in a unified manner In order to simplify matters we confine ourselves from

the beginning to the case where the environment varies in an iid fashion the model

going back to the now classical paper written by Smith and Wilkinson in 1969 We

also put together that material which is required from the topic of branching processes

in a varying environment Overall the proofs are now substantially simplified and

streamlined but at the same time some of the theorems could be better shaped

Second we would like to advance some scientific work on branching processes in

a random environment conditioned on survival which was conducted since around

2000 by a German-Russian group of scientists consisting of Valery Afanasyev

Christian Boumlinghoff Elena Dyakonova Jochen Geiger Goumltz Kersting Vladimir

Vatutin and Vitali Wachtel This research was generously supported by the German

Research Association DFG and the Russian Foundation for Basic Research RFBR In

this book we again do not aim to present our results in their most general setting yet

an ample amount of technical context cannot be avoided

We start the book by describing in Chapter 1 some properties of branching

processes in a varying environment (BPVEs) In particular we give a (short) proof of

the theorem describing the necessary and sufficient conditions for the dichotomy in

the asymptotic behavior of a BPVE such a process should either die or its population

size should tend to infinity with time Besides the construction of family trees

size-biased trees and Geigerrsquos tree representing conditioned family trees are

described here in detail These trees play an important role in studying subcritical

BPREs Chapter 2 leads the reader in to the world of BPREs It contains

classification of BPREs describes some properties of supercritical BPREs and gives

rough estimates for the growth rate of the survival probability for subcritical BPREs

Conclusions of Chapter 2 are supported by Chapter 3 where the asymptotic behavior

of the probabilities of large deviations for all types of BPREs is investigated

Properties of BPREs are closely related to the properties of the so-called associated

random walk (ARW) constituted by the logarithms of the expected population sizes

of particles of different generations This justifies the appearance of Chapter 4 that

includes some basic results on the theory of random walks and a couple of findings

concerning properties of random walks conditioned to stay non-negative or negative

and probabilities of large deviations for different types of random walks

Chapters 5 through 9 deal with various statements describing the asymptotic

behavior of the survival probability and Yaglom-type functional conditional limit

theorems for the critical and subcritical BPREs and analyzing properties of the ARW

Preface xi

providing survival of a BPRE for a long time Here the theory of random walks

conditioned to stay non-negative or negative demonstrates its beauty and power

Thus it is shown in Chapter 5 (under the annealed approach) that if a critical

BPRE survives up to a distant moment n then the minimum value of the ARW on

the interval [0 n] is attained at the beginning of the evolution of the BPRE and the

longtime behavior of the population size of such BPREs (conditioned on survival)

resembles the behavior of the ordinary supercritical GaltonndashWatson branching

processes If however a critical BPRE is considered under the quenched approach

(Chapter 6) then given the survival of the process for a long time the evolution of

the population size in the past has an oscillating character the periods when the

population size was very big were separated by intervals when the size of the

population was small

Chapters 7ndash9 are devoted to the weakly intermediately and strongly subcritical

BPREs investigated under the annealed approach To study properties of such

processes it is necessary to make changes in the initial measures based on the

properties of the ARWs The basic conclusion of Chapters 7ndash8 is the survival

probability of the weakly and intermediately subcritical BPREs up to a distant

moment n is proportional to the probability for the corresponding ARW to stay

non-negative within the time interval [0 n] Finally it is shown in Chapter 9 that

properties of strongly subcritical BPREs are in many respect similar to the

properties of the subcritical GaltonndashWatson branching processes In particular the

survival probability of such a process up to a distant moment n is proportional to the

expected number of particles in the process at this moment

We do not pretend that this book includes all interesting and important results

established up to now for BPREs In particular we do not treat here BPREs with

immigration and multitype BPREs The last direction of the theory of BPREs is a

very promising field of investigation that requires study of properties of Markov chains

generated by products of random matrices To attract the attention of future researchers

to this field we give a short survey of some recent results for multitype BPREs in

Chapter 10 The book is concluded by an Appendix that contains statements of results

used in the proofs of some theorems but not fitting the main line of the monograph

Goumltz KERSTING

Vladimir VATUTIN

August 2017

List of Notations

P(N0) set of all probability measures f on N0 =0 1 2

f [z] f(s) weights and generating function of the measuref 2

f the mean of the measure f 2f the normalized second factorial moment 2flowast size-biased measure 21fmn = fm+1 middot middot middot fn convolutions of probability measures 3fmn = fm middot middot middot fn+1 convolutions of probability measures 107κ(a f) =(f )minus2

suminfiny=a y

2f [y]truncated second moment 102

θ the moment of extinction of a branching process6 28

T lowast size-biased tree 21V random environment 25X = logF logarithm of expected population size 27κ(λ) = logE[eλX ] cumulant generating function 36γ strict descending ladder epoch 61γprime weak descending ladder epoch 61Γ strict ascending ladder epoch 61Γprime weak ascending ladder epoch 62Ln = min(S0 S1 Sn) 38Mn = max(S1 Sn) 76τ(n) = min0 le k le n Sk = Ln

the left-most moment when the minimum value ofthe random walk on the interval [0 n] is attained70

P probability measure given the environment 3 26E expectation given the environment 3 26

xiv Discrete Time Branching Processes in Random Environment

P probability measure obtained by averaging withrespect to the environment 25

E expectation taken after averaging with respect tothe environment 25

P+ Pminus change of measure 93

Pplusmn change of measure 134

P change of measure 170 198P+ Pminus change of measure 172

Plowast change of measure 234

1

Branching Processes inVarying Environment

11 Introduction

Branching processes are a fundamental object in probability theory They serve as

models for the reproduction of particles or individuals within a collective or a

population Here we act on the assumption that the population evolves within clearly

distinguishable generations which allows us to examine the population at the

founding generation n = 0 and the subsequent generations n = 1 2 To begin

with we focus on the sequence of population sizes Zn at generation n n ge 0 Later

we shall study whole family trees

Various kinds of randomness can be incorporated into such branching models

For this monograph we have two such types in mind On the one hand we take

randomness in reproduction into account Here a main assumption is that different

individuals give birth independently and that their offspring distributions coincide

within each generation On the other hand we consider environmental stochasticity

This means that these offspring distributions may change at random from one

generation to the next A fundamental question concerns which one of the two

random components will dominate and determine primarily the modelrsquos long-term

behavior We shall get to know the considerable influence of environmental

fluctuations

This first chapter is of a preliminary nature Here we look at branching models

with reduced randomness We allow that the offspring distributions vary among the

generations but as a start in a deterministic fashion So to speak we consider the above

model conditioned by its environment

Discrete Time Branching Processes in Random Environment First EditionGoumltz Kersting and Vladimir Vatutincopy ISTE Ltd 2017 Published by ISTE Ltd and John Wiley amp Sons Inc

2 Discrete Time Branching Processes in Random Environment

We begin with introducing some notation Let P(N0) be the space of all probability

measures on the natural numbers N0 = 0 1 2 For f isin P(N0) we denote its

weights by f [z] z = 0 1 We also define

f(s) =infinsumz=0

f [z]sz 0 le s le 1

The resulting function on the interval [0 1] is the generating function of the

measure f Thus we take the liberty here to denote the measure and its generating

function by one and the same symbol f This is not just as probability measures and

generating functions uniquely determine each other but operations on probability

measures are often most conveniently expressed by means of their generating

functions Therefore for two probability measures f1 and f2 the expressions f1f2 or

f1 f2 do not only stand for the product or composition of their generating functions

but also stand for the respective operations with the associated probability measures

(in the first case it is the convolution of f1 and f2) Similarly the derivative f prime of the

function f may be considered as well as the measure with weights

f prime[z] = (z + 1)f [z + 1] (which in general is no longer a probability measure) This

slight abuse of notation will cause no confusions but on the contrary will facilitate

presentation Recall that the mean and the normalized second factorial moment

f =infinsumz=1

zf [z] and f =1

f2

infinsumz=2

z(z minus 1)f [z]

can be obtained from the generating functions as

f = f prime(1) f =f primeprime(1)f prime(1)2

NOTEndash Any operation we shall apply to probability measures (and more generally to

finite measures) on N0 has to be understood as an operation applied to their generating

functions

We are now ready for first notions Let (ΩF P) be the underlying probability

space

DEFINITION 11ndash A sequence v = (f1 f2 ) of probability measures on N0 is

called a varying environment

DEFINITION 12ndash Let v = (fn n ge 1) be a varying environment Then a stochastic

process Z = Zn n isin N0 with values in N0 is called a branching process withenvironment v if for any integers z ge 0 n ge 1

P(Zn = z | Z0 Znminus1) = (fZnminus1n )[z] P-as

Branching Processes in Varying Environment 3

On the right-hand side we have the Znminus1th power of fn In particular Zn = 0P-as on the event that Znminus1 = 0 If we want to emphasize that probabilities P(middot) are

determined on the basis of the varying environment v we use the notation Pv(middot)

In probabilistic terms the definition says for n ge 1 that given Z0 Znminus1 the

random variable Zn may be realized as the sum of iid random variables Yin

i = 1 Znminus1 with distribution fn

Zn =

Znminus1sumi=1

Yin

This corresponds to the following conception of the process Z Zn is the number

of individuals of some population in generation n where all individuals reproduce

independently of each other and of Z0 and where fn is the distribution of the number

Yn of offspring of an individual in generation nminus 1 The distribution of Z0 which is

the initial distribution of the population may be arbitrary Mostly we choose it to be

Z0 = 1

EXAMPLE 11ndash A branching process with the constant environment f = f1 = f2 =middot middot middot is called a GaltonndashWatson process with offspring distribution f

The distribution of Zn is conveniently expressed via composing generating

functions For probability measures f1 fn on N0 and for natural numbers

0 le m lt n we introduce the probability measures

fmn = fm+1 middot middot middot fn [11]

Moreover let fnn be the Dirac measure δ1

PROPOSITION 11ndash Let Z be a branching process with initial size Z0 = 1 as andvarying environment (fn n ge 1) Then for n ge 0 the distribution of Zn is equal tothe measure f0n

PROOFndash Induction on n

Usually it is not straightforward to evaluate f0n explicitly The following example

contains an exceptional case of particular interest

EXAMPLE 12ndash LINEAR FRACTIONAL DISTRIBUTIONS A probability measure fon N0 is said to be of the linear fractional type if there are real numbers p a with

0 lt p lt 1 and 0 le a le 1 such that

f [z] = apqzminus1 for z = 0

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 6: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

vi Discrete Time Branching Processes in Random Environment

Chapter 4 Properties of Random Walks 61

41 Introduction 61

42 Sparre-Andersen identities 62

43 Spitzer identity 69

44 Applications of Sparre-Andersen and Spitzer

identities 70

441 Properties of ladder epochs and ladder heights 70

442 Tail distributions of ladder epochs 75

443 Some renewal functions 76

444 Asymptotic properties of Ln and Mn 79

445 Arcsine law 83

446 Large deviations for random walks 84

45 Notes 90

Chapter 5 Critical BPREs the Annealed Approach 91

51 Introduction 91

52 Changes of measures 92

53 Properties of the prospective minima 99

54 Survival probability 107

55 Limit theorems for the critical case (annealed

approach) 116

56 Environment providing survival 122

57 Convergence of logZn 126

58 Notes 127

Chapter 6 Critical BPREs the Quenched Approach 133

61 Introduction 133

62 Changes of measures 134

63 Probability of survival 138

64 Yaglom limit theorems 142

641 The population size at non-random moments 143

642 The population size at moments nt 0 lt t lt 1 149

643 The number of particles at moment τ(n) le nt 150

644 The number of particles at moment τ(n) gt nt 157

65 Discrete limit distributions 159

66 Notes 167

Chapter 7 Weakly Subcritical BPREs 169

71 Introduction 169

72 The probability measures P+ and Pminus 172

73 Proof of theorems 180

731 Proof of Theorem 71 188

Contents vii

732 Proof of Theorem 72 190

733 Proof of Theorem 73 191

74 Notes 195

Chapter 8 Intermediate Subcritical BPREs 197

81 Introduction 197

82 Proof of Theorems 81 to 83 201

83 Further limit results 209

84 Conditioned family trees 212

85 Proof of Theorem 84 219

86 Notes 225

Chapter 9 Strongly Subcritical BPREs 227

91 Introduction 227

92 Survival probability and Yaglom-type limit theorems 229

93 Environments providing survival and dynamics of the

population size 234

931 Properties of the transition matrix P lowast 240

932 Proof of Theorem 92 243

933 Proof of Theorem 93 248

94 Notes 251

Chapter 10 Multi-type BPREs 257

101 Introduction 257

102 Supercritical MBPREs 264

103 The survival probability of subcritical and

critical MBPREs 264

104 Functional limit theorem in the critical case 266

105 Subcritical multi-type case 266

Appendix 269

Bibliography 275

Index 285

Preface

Branching processes constitute a fundamental part in the theory of stochastic

processes Very roughly speaking the theory of branching processes deals with the

issue of exponential growth or decay of random sequences or processes Its central

concept consists of a system or a population made up of particles or individuals

which independently produce descendants This is an extensive topic dating back to a

publication of F Galton and H W Watson in 1874 on the extinction of family names

and afterwards dividing into many subareas Correspondingly it is treated in a

number of monographs starting in 1963 with T E Harrisrsquo seminal The Theory ofBranching Processes and supported in the middle of the 1970s by Sevastyanovrsquos

Verzweigungsprozesse Athreya and Neyrsquos Branching Processes Jagersrsquo BranchingProcesses with Biological Applications and others The models considered in these

books mainly concern branching processes evolving in a constant environment

Important tools in proving limit theorems for such processes are generating

functions renewal type equations and functional limit theorems

However these monographs rarely touch the matter of branching processes in a

random environment (BPREs) These objects form not so much a subclass but rather

an extension of the area of branching processes In such models two types of

stochasticity are incorporated on the one hand demographic stochasticity resulting

from the reproduction of individuals and on the other hand environmental

stochasticity stemming from the changes in the conditions of reproduction along

time A central insight is that it is often the latter component that primarily

determines the behavior of these processes Thus the theory of BPREs gains its own

characteristic appearance and novel aspects appear such as a phase transition From a

technical point of view the study of such processes requires an extension of the range

of methods to be used in comparison with the methods common in the classical

theory of branching processes Other techniques should be attracted in particular

from the theory of random walks which plays an essential role in proving limit

theorems for BPREs

x Discrete Time Branching Processes in Random Environment

With this volume we have two purposes in mind First we have to assert that the

basics of the theory of BPREs are somewhat scattered in the literature (since the late

1960s) from which they are not at all easily accessible Thus we start by presenting

them in a unified manner In order to simplify matters we confine ourselves from

the beginning to the case where the environment varies in an iid fashion the model

going back to the now classical paper written by Smith and Wilkinson in 1969 We

also put together that material which is required from the topic of branching processes

in a varying environment Overall the proofs are now substantially simplified and

streamlined but at the same time some of the theorems could be better shaped

Second we would like to advance some scientific work on branching processes in

a random environment conditioned on survival which was conducted since around

2000 by a German-Russian group of scientists consisting of Valery Afanasyev

Christian Boumlinghoff Elena Dyakonova Jochen Geiger Goumltz Kersting Vladimir

Vatutin and Vitali Wachtel This research was generously supported by the German

Research Association DFG and the Russian Foundation for Basic Research RFBR In

this book we again do not aim to present our results in their most general setting yet

an ample amount of technical context cannot be avoided

We start the book by describing in Chapter 1 some properties of branching

processes in a varying environment (BPVEs) In particular we give a (short) proof of

the theorem describing the necessary and sufficient conditions for the dichotomy in

the asymptotic behavior of a BPVE such a process should either die or its population

size should tend to infinity with time Besides the construction of family trees

size-biased trees and Geigerrsquos tree representing conditioned family trees are

described here in detail These trees play an important role in studying subcritical

BPREs Chapter 2 leads the reader in to the world of BPREs It contains

classification of BPREs describes some properties of supercritical BPREs and gives

rough estimates for the growth rate of the survival probability for subcritical BPREs

Conclusions of Chapter 2 are supported by Chapter 3 where the asymptotic behavior

of the probabilities of large deviations for all types of BPREs is investigated

Properties of BPREs are closely related to the properties of the so-called associated

random walk (ARW) constituted by the logarithms of the expected population sizes

of particles of different generations This justifies the appearance of Chapter 4 that

includes some basic results on the theory of random walks and a couple of findings

concerning properties of random walks conditioned to stay non-negative or negative

and probabilities of large deviations for different types of random walks

Chapters 5 through 9 deal with various statements describing the asymptotic

behavior of the survival probability and Yaglom-type functional conditional limit

theorems for the critical and subcritical BPREs and analyzing properties of the ARW

Preface xi

providing survival of a BPRE for a long time Here the theory of random walks

conditioned to stay non-negative or negative demonstrates its beauty and power

Thus it is shown in Chapter 5 (under the annealed approach) that if a critical

BPRE survives up to a distant moment n then the minimum value of the ARW on

the interval [0 n] is attained at the beginning of the evolution of the BPRE and the

longtime behavior of the population size of such BPREs (conditioned on survival)

resembles the behavior of the ordinary supercritical GaltonndashWatson branching

processes If however a critical BPRE is considered under the quenched approach

(Chapter 6) then given the survival of the process for a long time the evolution of

the population size in the past has an oscillating character the periods when the

population size was very big were separated by intervals when the size of the

population was small

Chapters 7ndash9 are devoted to the weakly intermediately and strongly subcritical

BPREs investigated under the annealed approach To study properties of such

processes it is necessary to make changes in the initial measures based on the

properties of the ARWs The basic conclusion of Chapters 7ndash8 is the survival

probability of the weakly and intermediately subcritical BPREs up to a distant

moment n is proportional to the probability for the corresponding ARW to stay

non-negative within the time interval [0 n] Finally it is shown in Chapter 9 that

properties of strongly subcritical BPREs are in many respect similar to the

properties of the subcritical GaltonndashWatson branching processes In particular the

survival probability of such a process up to a distant moment n is proportional to the

expected number of particles in the process at this moment

We do not pretend that this book includes all interesting and important results

established up to now for BPREs In particular we do not treat here BPREs with

immigration and multitype BPREs The last direction of the theory of BPREs is a

very promising field of investigation that requires study of properties of Markov chains

generated by products of random matrices To attract the attention of future researchers

to this field we give a short survey of some recent results for multitype BPREs in

Chapter 10 The book is concluded by an Appendix that contains statements of results

used in the proofs of some theorems but not fitting the main line of the monograph

Goumltz KERSTING

Vladimir VATUTIN

August 2017

List of Notations

P(N0) set of all probability measures f on N0 =0 1 2

f [z] f(s) weights and generating function of the measuref 2

f the mean of the measure f 2f the normalized second factorial moment 2flowast size-biased measure 21fmn = fm+1 middot middot middot fn convolutions of probability measures 3fmn = fm middot middot middot fn+1 convolutions of probability measures 107κ(a f) =(f )minus2

suminfiny=a y

2f [y]truncated second moment 102

θ the moment of extinction of a branching process6 28

T lowast size-biased tree 21V random environment 25X = logF logarithm of expected population size 27κ(λ) = logE[eλX ] cumulant generating function 36γ strict descending ladder epoch 61γprime weak descending ladder epoch 61Γ strict ascending ladder epoch 61Γprime weak ascending ladder epoch 62Ln = min(S0 S1 Sn) 38Mn = max(S1 Sn) 76τ(n) = min0 le k le n Sk = Ln

the left-most moment when the minimum value ofthe random walk on the interval [0 n] is attained70

P probability measure given the environment 3 26E expectation given the environment 3 26

xiv Discrete Time Branching Processes in Random Environment

P probability measure obtained by averaging withrespect to the environment 25

E expectation taken after averaging with respect tothe environment 25

P+ Pminus change of measure 93

Pplusmn change of measure 134

P change of measure 170 198P+ Pminus change of measure 172

Plowast change of measure 234

1

Branching Processes inVarying Environment

11 Introduction

Branching processes are a fundamental object in probability theory They serve as

models for the reproduction of particles or individuals within a collective or a

population Here we act on the assumption that the population evolves within clearly

distinguishable generations which allows us to examine the population at the

founding generation n = 0 and the subsequent generations n = 1 2 To begin

with we focus on the sequence of population sizes Zn at generation n n ge 0 Later

we shall study whole family trees

Various kinds of randomness can be incorporated into such branching models

For this monograph we have two such types in mind On the one hand we take

randomness in reproduction into account Here a main assumption is that different

individuals give birth independently and that their offspring distributions coincide

within each generation On the other hand we consider environmental stochasticity

This means that these offspring distributions may change at random from one

generation to the next A fundamental question concerns which one of the two

random components will dominate and determine primarily the modelrsquos long-term

behavior We shall get to know the considerable influence of environmental

fluctuations

This first chapter is of a preliminary nature Here we look at branching models

with reduced randomness We allow that the offspring distributions vary among the

generations but as a start in a deterministic fashion So to speak we consider the above

model conditioned by its environment

Discrete Time Branching Processes in Random Environment First EditionGoumltz Kersting and Vladimir Vatutincopy ISTE Ltd 2017 Published by ISTE Ltd and John Wiley amp Sons Inc

2 Discrete Time Branching Processes in Random Environment

We begin with introducing some notation Let P(N0) be the space of all probability

measures on the natural numbers N0 = 0 1 2 For f isin P(N0) we denote its

weights by f [z] z = 0 1 We also define

f(s) =infinsumz=0

f [z]sz 0 le s le 1

The resulting function on the interval [0 1] is the generating function of the

measure f Thus we take the liberty here to denote the measure and its generating

function by one and the same symbol f This is not just as probability measures and

generating functions uniquely determine each other but operations on probability

measures are often most conveniently expressed by means of their generating

functions Therefore for two probability measures f1 and f2 the expressions f1f2 or

f1 f2 do not only stand for the product or composition of their generating functions

but also stand for the respective operations with the associated probability measures

(in the first case it is the convolution of f1 and f2) Similarly the derivative f prime of the

function f may be considered as well as the measure with weights

f prime[z] = (z + 1)f [z + 1] (which in general is no longer a probability measure) This

slight abuse of notation will cause no confusions but on the contrary will facilitate

presentation Recall that the mean and the normalized second factorial moment

f =infinsumz=1

zf [z] and f =1

f2

infinsumz=2

z(z minus 1)f [z]

can be obtained from the generating functions as

f = f prime(1) f =f primeprime(1)f prime(1)2

NOTEndash Any operation we shall apply to probability measures (and more generally to

finite measures) on N0 has to be understood as an operation applied to their generating

functions

We are now ready for first notions Let (ΩF P) be the underlying probability

space

DEFINITION 11ndash A sequence v = (f1 f2 ) of probability measures on N0 is

called a varying environment

DEFINITION 12ndash Let v = (fn n ge 1) be a varying environment Then a stochastic

process Z = Zn n isin N0 with values in N0 is called a branching process withenvironment v if for any integers z ge 0 n ge 1

P(Zn = z | Z0 Znminus1) = (fZnminus1n )[z] P-as

Branching Processes in Varying Environment 3

On the right-hand side we have the Znminus1th power of fn In particular Zn = 0P-as on the event that Znminus1 = 0 If we want to emphasize that probabilities P(middot) are

determined on the basis of the varying environment v we use the notation Pv(middot)

In probabilistic terms the definition says for n ge 1 that given Z0 Znminus1 the

random variable Zn may be realized as the sum of iid random variables Yin

i = 1 Znminus1 with distribution fn

Zn =

Znminus1sumi=1

Yin

This corresponds to the following conception of the process Z Zn is the number

of individuals of some population in generation n where all individuals reproduce

independently of each other and of Z0 and where fn is the distribution of the number

Yn of offspring of an individual in generation nminus 1 The distribution of Z0 which is

the initial distribution of the population may be arbitrary Mostly we choose it to be

Z0 = 1

EXAMPLE 11ndash A branching process with the constant environment f = f1 = f2 =middot middot middot is called a GaltonndashWatson process with offspring distribution f

The distribution of Zn is conveniently expressed via composing generating

functions For probability measures f1 fn on N0 and for natural numbers

0 le m lt n we introduce the probability measures

fmn = fm+1 middot middot middot fn [11]

Moreover let fnn be the Dirac measure δ1

PROPOSITION 11ndash Let Z be a branching process with initial size Z0 = 1 as andvarying environment (fn n ge 1) Then for n ge 0 the distribution of Zn is equal tothe measure f0n

PROOFndash Induction on n

Usually it is not straightforward to evaluate f0n explicitly The following example

contains an exceptional case of particular interest

EXAMPLE 12ndash LINEAR FRACTIONAL DISTRIBUTIONS A probability measure fon N0 is said to be of the linear fractional type if there are real numbers p a with

0 lt p lt 1 and 0 le a le 1 such that

f [z] = apqzminus1 for z = 0

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 7: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

Contents vii

732 Proof of Theorem 72 190

733 Proof of Theorem 73 191

74 Notes 195

Chapter 8 Intermediate Subcritical BPREs 197

81 Introduction 197

82 Proof of Theorems 81 to 83 201

83 Further limit results 209

84 Conditioned family trees 212

85 Proof of Theorem 84 219

86 Notes 225

Chapter 9 Strongly Subcritical BPREs 227

91 Introduction 227

92 Survival probability and Yaglom-type limit theorems 229

93 Environments providing survival and dynamics of the

population size 234

931 Properties of the transition matrix P lowast 240

932 Proof of Theorem 92 243

933 Proof of Theorem 93 248

94 Notes 251

Chapter 10 Multi-type BPREs 257

101 Introduction 257

102 Supercritical MBPREs 264

103 The survival probability of subcritical and

critical MBPREs 264

104 Functional limit theorem in the critical case 266

105 Subcritical multi-type case 266

Appendix 269

Bibliography 275

Index 285

Preface

Branching processes constitute a fundamental part in the theory of stochastic

processes Very roughly speaking the theory of branching processes deals with the

issue of exponential growth or decay of random sequences or processes Its central

concept consists of a system or a population made up of particles or individuals

which independently produce descendants This is an extensive topic dating back to a

publication of F Galton and H W Watson in 1874 on the extinction of family names

and afterwards dividing into many subareas Correspondingly it is treated in a

number of monographs starting in 1963 with T E Harrisrsquo seminal The Theory ofBranching Processes and supported in the middle of the 1970s by Sevastyanovrsquos

Verzweigungsprozesse Athreya and Neyrsquos Branching Processes Jagersrsquo BranchingProcesses with Biological Applications and others The models considered in these

books mainly concern branching processes evolving in a constant environment

Important tools in proving limit theorems for such processes are generating

functions renewal type equations and functional limit theorems

However these monographs rarely touch the matter of branching processes in a

random environment (BPREs) These objects form not so much a subclass but rather

an extension of the area of branching processes In such models two types of

stochasticity are incorporated on the one hand demographic stochasticity resulting

from the reproduction of individuals and on the other hand environmental

stochasticity stemming from the changes in the conditions of reproduction along

time A central insight is that it is often the latter component that primarily

determines the behavior of these processes Thus the theory of BPREs gains its own

characteristic appearance and novel aspects appear such as a phase transition From a

technical point of view the study of such processes requires an extension of the range

of methods to be used in comparison with the methods common in the classical

theory of branching processes Other techniques should be attracted in particular

from the theory of random walks which plays an essential role in proving limit

theorems for BPREs

x Discrete Time Branching Processes in Random Environment

With this volume we have two purposes in mind First we have to assert that the

basics of the theory of BPREs are somewhat scattered in the literature (since the late

1960s) from which they are not at all easily accessible Thus we start by presenting

them in a unified manner In order to simplify matters we confine ourselves from

the beginning to the case where the environment varies in an iid fashion the model

going back to the now classical paper written by Smith and Wilkinson in 1969 We

also put together that material which is required from the topic of branching processes

in a varying environment Overall the proofs are now substantially simplified and

streamlined but at the same time some of the theorems could be better shaped

Second we would like to advance some scientific work on branching processes in

a random environment conditioned on survival which was conducted since around

2000 by a German-Russian group of scientists consisting of Valery Afanasyev

Christian Boumlinghoff Elena Dyakonova Jochen Geiger Goumltz Kersting Vladimir

Vatutin and Vitali Wachtel This research was generously supported by the German

Research Association DFG and the Russian Foundation for Basic Research RFBR In

this book we again do not aim to present our results in their most general setting yet

an ample amount of technical context cannot be avoided

We start the book by describing in Chapter 1 some properties of branching

processes in a varying environment (BPVEs) In particular we give a (short) proof of

the theorem describing the necessary and sufficient conditions for the dichotomy in

the asymptotic behavior of a BPVE such a process should either die or its population

size should tend to infinity with time Besides the construction of family trees

size-biased trees and Geigerrsquos tree representing conditioned family trees are

described here in detail These trees play an important role in studying subcritical

BPREs Chapter 2 leads the reader in to the world of BPREs It contains

classification of BPREs describes some properties of supercritical BPREs and gives

rough estimates for the growth rate of the survival probability for subcritical BPREs

Conclusions of Chapter 2 are supported by Chapter 3 where the asymptotic behavior

of the probabilities of large deviations for all types of BPREs is investigated

Properties of BPREs are closely related to the properties of the so-called associated

random walk (ARW) constituted by the logarithms of the expected population sizes

of particles of different generations This justifies the appearance of Chapter 4 that

includes some basic results on the theory of random walks and a couple of findings

concerning properties of random walks conditioned to stay non-negative or negative

and probabilities of large deviations for different types of random walks

Chapters 5 through 9 deal with various statements describing the asymptotic

behavior of the survival probability and Yaglom-type functional conditional limit

theorems for the critical and subcritical BPREs and analyzing properties of the ARW

Preface xi

providing survival of a BPRE for a long time Here the theory of random walks

conditioned to stay non-negative or negative demonstrates its beauty and power

Thus it is shown in Chapter 5 (under the annealed approach) that if a critical

BPRE survives up to a distant moment n then the minimum value of the ARW on

the interval [0 n] is attained at the beginning of the evolution of the BPRE and the

longtime behavior of the population size of such BPREs (conditioned on survival)

resembles the behavior of the ordinary supercritical GaltonndashWatson branching

processes If however a critical BPRE is considered under the quenched approach

(Chapter 6) then given the survival of the process for a long time the evolution of

the population size in the past has an oscillating character the periods when the

population size was very big were separated by intervals when the size of the

population was small

Chapters 7ndash9 are devoted to the weakly intermediately and strongly subcritical

BPREs investigated under the annealed approach To study properties of such

processes it is necessary to make changes in the initial measures based on the

properties of the ARWs The basic conclusion of Chapters 7ndash8 is the survival

probability of the weakly and intermediately subcritical BPREs up to a distant

moment n is proportional to the probability for the corresponding ARW to stay

non-negative within the time interval [0 n] Finally it is shown in Chapter 9 that

properties of strongly subcritical BPREs are in many respect similar to the

properties of the subcritical GaltonndashWatson branching processes In particular the

survival probability of such a process up to a distant moment n is proportional to the

expected number of particles in the process at this moment

We do not pretend that this book includes all interesting and important results

established up to now for BPREs In particular we do not treat here BPREs with

immigration and multitype BPREs The last direction of the theory of BPREs is a

very promising field of investigation that requires study of properties of Markov chains

generated by products of random matrices To attract the attention of future researchers

to this field we give a short survey of some recent results for multitype BPREs in

Chapter 10 The book is concluded by an Appendix that contains statements of results

used in the proofs of some theorems but not fitting the main line of the monograph

Goumltz KERSTING

Vladimir VATUTIN

August 2017

List of Notations

P(N0) set of all probability measures f on N0 =0 1 2

f [z] f(s) weights and generating function of the measuref 2

f the mean of the measure f 2f the normalized second factorial moment 2flowast size-biased measure 21fmn = fm+1 middot middot middot fn convolutions of probability measures 3fmn = fm middot middot middot fn+1 convolutions of probability measures 107κ(a f) =(f )minus2

suminfiny=a y

2f [y]truncated second moment 102

θ the moment of extinction of a branching process6 28

T lowast size-biased tree 21V random environment 25X = logF logarithm of expected population size 27κ(λ) = logE[eλX ] cumulant generating function 36γ strict descending ladder epoch 61γprime weak descending ladder epoch 61Γ strict ascending ladder epoch 61Γprime weak ascending ladder epoch 62Ln = min(S0 S1 Sn) 38Mn = max(S1 Sn) 76τ(n) = min0 le k le n Sk = Ln

the left-most moment when the minimum value ofthe random walk on the interval [0 n] is attained70

P probability measure given the environment 3 26E expectation given the environment 3 26

xiv Discrete Time Branching Processes in Random Environment

P probability measure obtained by averaging withrespect to the environment 25

E expectation taken after averaging with respect tothe environment 25

P+ Pminus change of measure 93

Pplusmn change of measure 134

P change of measure 170 198P+ Pminus change of measure 172

Plowast change of measure 234

1

Branching Processes inVarying Environment

11 Introduction

Branching processes are a fundamental object in probability theory They serve as

models for the reproduction of particles or individuals within a collective or a

population Here we act on the assumption that the population evolves within clearly

distinguishable generations which allows us to examine the population at the

founding generation n = 0 and the subsequent generations n = 1 2 To begin

with we focus on the sequence of population sizes Zn at generation n n ge 0 Later

we shall study whole family trees

Various kinds of randomness can be incorporated into such branching models

For this monograph we have two such types in mind On the one hand we take

randomness in reproduction into account Here a main assumption is that different

individuals give birth independently and that their offspring distributions coincide

within each generation On the other hand we consider environmental stochasticity

This means that these offspring distributions may change at random from one

generation to the next A fundamental question concerns which one of the two

random components will dominate and determine primarily the modelrsquos long-term

behavior We shall get to know the considerable influence of environmental

fluctuations

This first chapter is of a preliminary nature Here we look at branching models

with reduced randomness We allow that the offspring distributions vary among the

generations but as a start in a deterministic fashion So to speak we consider the above

model conditioned by its environment

Discrete Time Branching Processes in Random Environment First EditionGoumltz Kersting and Vladimir Vatutincopy ISTE Ltd 2017 Published by ISTE Ltd and John Wiley amp Sons Inc

2 Discrete Time Branching Processes in Random Environment

We begin with introducing some notation Let P(N0) be the space of all probability

measures on the natural numbers N0 = 0 1 2 For f isin P(N0) we denote its

weights by f [z] z = 0 1 We also define

f(s) =infinsumz=0

f [z]sz 0 le s le 1

The resulting function on the interval [0 1] is the generating function of the

measure f Thus we take the liberty here to denote the measure and its generating

function by one and the same symbol f This is not just as probability measures and

generating functions uniquely determine each other but operations on probability

measures are often most conveniently expressed by means of their generating

functions Therefore for two probability measures f1 and f2 the expressions f1f2 or

f1 f2 do not only stand for the product or composition of their generating functions

but also stand for the respective operations with the associated probability measures

(in the first case it is the convolution of f1 and f2) Similarly the derivative f prime of the

function f may be considered as well as the measure with weights

f prime[z] = (z + 1)f [z + 1] (which in general is no longer a probability measure) This

slight abuse of notation will cause no confusions but on the contrary will facilitate

presentation Recall that the mean and the normalized second factorial moment

f =infinsumz=1

zf [z] and f =1

f2

infinsumz=2

z(z minus 1)f [z]

can be obtained from the generating functions as

f = f prime(1) f =f primeprime(1)f prime(1)2

NOTEndash Any operation we shall apply to probability measures (and more generally to

finite measures) on N0 has to be understood as an operation applied to their generating

functions

We are now ready for first notions Let (ΩF P) be the underlying probability

space

DEFINITION 11ndash A sequence v = (f1 f2 ) of probability measures on N0 is

called a varying environment

DEFINITION 12ndash Let v = (fn n ge 1) be a varying environment Then a stochastic

process Z = Zn n isin N0 with values in N0 is called a branching process withenvironment v if for any integers z ge 0 n ge 1

P(Zn = z | Z0 Znminus1) = (fZnminus1n )[z] P-as

Branching Processes in Varying Environment 3

On the right-hand side we have the Znminus1th power of fn In particular Zn = 0P-as on the event that Znminus1 = 0 If we want to emphasize that probabilities P(middot) are

determined on the basis of the varying environment v we use the notation Pv(middot)

In probabilistic terms the definition says for n ge 1 that given Z0 Znminus1 the

random variable Zn may be realized as the sum of iid random variables Yin

i = 1 Znminus1 with distribution fn

Zn =

Znminus1sumi=1

Yin

This corresponds to the following conception of the process Z Zn is the number

of individuals of some population in generation n where all individuals reproduce

independently of each other and of Z0 and where fn is the distribution of the number

Yn of offspring of an individual in generation nminus 1 The distribution of Z0 which is

the initial distribution of the population may be arbitrary Mostly we choose it to be

Z0 = 1

EXAMPLE 11ndash A branching process with the constant environment f = f1 = f2 =middot middot middot is called a GaltonndashWatson process with offspring distribution f

The distribution of Zn is conveniently expressed via composing generating

functions For probability measures f1 fn on N0 and for natural numbers

0 le m lt n we introduce the probability measures

fmn = fm+1 middot middot middot fn [11]

Moreover let fnn be the Dirac measure δ1

PROPOSITION 11ndash Let Z be a branching process with initial size Z0 = 1 as andvarying environment (fn n ge 1) Then for n ge 0 the distribution of Zn is equal tothe measure f0n

PROOFndash Induction on n

Usually it is not straightforward to evaluate f0n explicitly The following example

contains an exceptional case of particular interest

EXAMPLE 12ndash LINEAR FRACTIONAL DISTRIBUTIONS A probability measure fon N0 is said to be of the linear fractional type if there are real numbers p a with

0 lt p lt 1 and 0 le a le 1 such that

f [z] = apqzminus1 for z = 0

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 8: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

Preface

Branching processes constitute a fundamental part in the theory of stochastic

processes Very roughly speaking the theory of branching processes deals with the

issue of exponential growth or decay of random sequences or processes Its central

concept consists of a system or a population made up of particles or individuals

which independently produce descendants This is an extensive topic dating back to a

publication of F Galton and H W Watson in 1874 on the extinction of family names

and afterwards dividing into many subareas Correspondingly it is treated in a

number of monographs starting in 1963 with T E Harrisrsquo seminal The Theory ofBranching Processes and supported in the middle of the 1970s by Sevastyanovrsquos

Verzweigungsprozesse Athreya and Neyrsquos Branching Processes Jagersrsquo BranchingProcesses with Biological Applications and others The models considered in these

books mainly concern branching processes evolving in a constant environment

Important tools in proving limit theorems for such processes are generating

functions renewal type equations and functional limit theorems

However these monographs rarely touch the matter of branching processes in a

random environment (BPREs) These objects form not so much a subclass but rather

an extension of the area of branching processes In such models two types of

stochasticity are incorporated on the one hand demographic stochasticity resulting

from the reproduction of individuals and on the other hand environmental

stochasticity stemming from the changes in the conditions of reproduction along

time A central insight is that it is often the latter component that primarily

determines the behavior of these processes Thus the theory of BPREs gains its own

characteristic appearance and novel aspects appear such as a phase transition From a

technical point of view the study of such processes requires an extension of the range

of methods to be used in comparison with the methods common in the classical

theory of branching processes Other techniques should be attracted in particular

from the theory of random walks which plays an essential role in proving limit

theorems for BPREs

x Discrete Time Branching Processes in Random Environment

With this volume we have two purposes in mind First we have to assert that the

basics of the theory of BPREs are somewhat scattered in the literature (since the late

1960s) from which they are not at all easily accessible Thus we start by presenting

them in a unified manner In order to simplify matters we confine ourselves from

the beginning to the case where the environment varies in an iid fashion the model

going back to the now classical paper written by Smith and Wilkinson in 1969 We

also put together that material which is required from the topic of branching processes

in a varying environment Overall the proofs are now substantially simplified and

streamlined but at the same time some of the theorems could be better shaped

Second we would like to advance some scientific work on branching processes in

a random environment conditioned on survival which was conducted since around

2000 by a German-Russian group of scientists consisting of Valery Afanasyev

Christian Boumlinghoff Elena Dyakonova Jochen Geiger Goumltz Kersting Vladimir

Vatutin and Vitali Wachtel This research was generously supported by the German

Research Association DFG and the Russian Foundation for Basic Research RFBR In

this book we again do not aim to present our results in their most general setting yet

an ample amount of technical context cannot be avoided

We start the book by describing in Chapter 1 some properties of branching

processes in a varying environment (BPVEs) In particular we give a (short) proof of

the theorem describing the necessary and sufficient conditions for the dichotomy in

the asymptotic behavior of a BPVE such a process should either die or its population

size should tend to infinity with time Besides the construction of family trees

size-biased trees and Geigerrsquos tree representing conditioned family trees are

described here in detail These trees play an important role in studying subcritical

BPREs Chapter 2 leads the reader in to the world of BPREs It contains

classification of BPREs describes some properties of supercritical BPREs and gives

rough estimates for the growth rate of the survival probability for subcritical BPREs

Conclusions of Chapter 2 are supported by Chapter 3 where the asymptotic behavior

of the probabilities of large deviations for all types of BPREs is investigated

Properties of BPREs are closely related to the properties of the so-called associated

random walk (ARW) constituted by the logarithms of the expected population sizes

of particles of different generations This justifies the appearance of Chapter 4 that

includes some basic results on the theory of random walks and a couple of findings

concerning properties of random walks conditioned to stay non-negative or negative

and probabilities of large deviations for different types of random walks

Chapters 5 through 9 deal with various statements describing the asymptotic

behavior of the survival probability and Yaglom-type functional conditional limit

theorems for the critical and subcritical BPREs and analyzing properties of the ARW

Preface xi

providing survival of a BPRE for a long time Here the theory of random walks

conditioned to stay non-negative or negative demonstrates its beauty and power

Thus it is shown in Chapter 5 (under the annealed approach) that if a critical

BPRE survives up to a distant moment n then the minimum value of the ARW on

the interval [0 n] is attained at the beginning of the evolution of the BPRE and the

longtime behavior of the population size of such BPREs (conditioned on survival)

resembles the behavior of the ordinary supercritical GaltonndashWatson branching

processes If however a critical BPRE is considered under the quenched approach

(Chapter 6) then given the survival of the process for a long time the evolution of

the population size in the past has an oscillating character the periods when the

population size was very big were separated by intervals when the size of the

population was small

Chapters 7ndash9 are devoted to the weakly intermediately and strongly subcritical

BPREs investigated under the annealed approach To study properties of such

processes it is necessary to make changes in the initial measures based on the

properties of the ARWs The basic conclusion of Chapters 7ndash8 is the survival

probability of the weakly and intermediately subcritical BPREs up to a distant

moment n is proportional to the probability for the corresponding ARW to stay

non-negative within the time interval [0 n] Finally it is shown in Chapter 9 that

properties of strongly subcritical BPREs are in many respect similar to the

properties of the subcritical GaltonndashWatson branching processes In particular the

survival probability of such a process up to a distant moment n is proportional to the

expected number of particles in the process at this moment

We do not pretend that this book includes all interesting and important results

established up to now for BPREs In particular we do not treat here BPREs with

immigration and multitype BPREs The last direction of the theory of BPREs is a

very promising field of investigation that requires study of properties of Markov chains

generated by products of random matrices To attract the attention of future researchers

to this field we give a short survey of some recent results for multitype BPREs in

Chapter 10 The book is concluded by an Appendix that contains statements of results

used in the proofs of some theorems but not fitting the main line of the monograph

Goumltz KERSTING

Vladimir VATUTIN

August 2017

List of Notations

P(N0) set of all probability measures f on N0 =0 1 2

f [z] f(s) weights and generating function of the measuref 2

f the mean of the measure f 2f the normalized second factorial moment 2flowast size-biased measure 21fmn = fm+1 middot middot middot fn convolutions of probability measures 3fmn = fm middot middot middot fn+1 convolutions of probability measures 107κ(a f) =(f )minus2

suminfiny=a y

2f [y]truncated second moment 102

θ the moment of extinction of a branching process6 28

T lowast size-biased tree 21V random environment 25X = logF logarithm of expected population size 27κ(λ) = logE[eλX ] cumulant generating function 36γ strict descending ladder epoch 61γprime weak descending ladder epoch 61Γ strict ascending ladder epoch 61Γprime weak ascending ladder epoch 62Ln = min(S0 S1 Sn) 38Mn = max(S1 Sn) 76τ(n) = min0 le k le n Sk = Ln

the left-most moment when the minimum value ofthe random walk on the interval [0 n] is attained70

P probability measure given the environment 3 26E expectation given the environment 3 26

xiv Discrete Time Branching Processes in Random Environment

P probability measure obtained by averaging withrespect to the environment 25

E expectation taken after averaging with respect tothe environment 25

P+ Pminus change of measure 93

Pplusmn change of measure 134

P change of measure 170 198P+ Pminus change of measure 172

Plowast change of measure 234

1

Branching Processes inVarying Environment

11 Introduction

Branching processes are a fundamental object in probability theory They serve as

models for the reproduction of particles or individuals within a collective or a

population Here we act on the assumption that the population evolves within clearly

distinguishable generations which allows us to examine the population at the

founding generation n = 0 and the subsequent generations n = 1 2 To begin

with we focus on the sequence of population sizes Zn at generation n n ge 0 Later

we shall study whole family trees

Various kinds of randomness can be incorporated into such branching models

For this monograph we have two such types in mind On the one hand we take

randomness in reproduction into account Here a main assumption is that different

individuals give birth independently and that their offspring distributions coincide

within each generation On the other hand we consider environmental stochasticity

This means that these offspring distributions may change at random from one

generation to the next A fundamental question concerns which one of the two

random components will dominate and determine primarily the modelrsquos long-term

behavior We shall get to know the considerable influence of environmental

fluctuations

This first chapter is of a preliminary nature Here we look at branching models

with reduced randomness We allow that the offspring distributions vary among the

generations but as a start in a deterministic fashion So to speak we consider the above

model conditioned by its environment

Discrete Time Branching Processes in Random Environment First EditionGoumltz Kersting and Vladimir Vatutincopy ISTE Ltd 2017 Published by ISTE Ltd and John Wiley amp Sons Inc

2 Discrete Time Branching Processes in Random Environment

We begin with introducing some notation Let P(N0) be the space of all probability

measures on the natural numbers N0 = 0 1 2 For f isin P(N0) we denote its

weights by f [z] z = 0 1 We also define

f(s) =infinsumz=0

f [z]sz 0 le s le 1

The resulting function on the interval [0 1] is the generating function of the

measure f Thus we take the liberty here to denote the measure and its generating

function by one and the same symbol f This is not just as probability measures and

generating functions uniquely determine each other but operations on probability

measures are often most conveniently expressed by means of their generating

functions Therefore for two probability measures f1 and f2 the expressions f1f2 or

f1 f2 do not only stand for the product or composition of their generating functions

but also stand for the respective operations with the associated probability measures

(in the first case it is the convolution of f1 and f2) Similarly the derivative f prime of the

function f may be considered as well as the measure with weights

f prime[z] = (z + 1)f [z + 1] (which in general is no longer a probability measure) This

slight abuse of notation will cause no confusions but on the contrary will facilitate

presentation Recall that the mean and the normalized second factorial moment

f =infinsumz=1

zf [z] and f =1

f2

infinsumz=2

z(z minus 1)f [z]

can be obtained from the generating functions as

f = f prime(1) f =f primeprime(1)f prime(1)2

NOTEndash Any operation we shall apply to probability measures (and more generally to

finite measures) on N0 has to be understood as an operation applied to their generating

functions

We are now ready for first notions Let (ΩF P) be the underlying probability

space

DEFINITION 11ndash A sequence v = (f1 f2 ) of probability measures on N0 is

called a varying environment

DEFINITION 12ndash Let v = (fn n ge 1) be a varying environment Then a stochastic

process Z = Zn n isin N0 with values in N0 is called a branching process withenvironment v if for any integers z ge 0 n ge 1

P(Zn = z | Z0 Znminus1) = (fZnminus1n )[z] P-as

Branching Processes in Varying Environment 3

On the right-hand side we have the Znminus1th power of fn In particular Zn = 0P-as on the event that Znminus1 = 0 If we want to emphasize that probabilities P(middot) are

determined on the basis of the varying environment v we use the notation Pv(middot)

In probabilistic terms the definition says for n ge 1 that given Z0 Znminus1 the

random variable Zn may be realized as the sum of iid random variables Yin

i = 1 Znminus1 with distribution fn

Zn =

Znminus1sumi=1

Yin

This corresponds to the following conception of the process Z Zn is the number

of individuals of some population in generation n where all individuals reproduce

independently of each other and of Z0 and where fn is the distribution of the number

Yn of offspring of an individual in generation nminus 1 The distribution of Z0 which is

the initial distribution of the population may be arbitrary Mostly we choose it to be

Z0 = 1

EXAMPLE 11ndash A branching process with the constant environment f = f1 = f2 =middot middot middot is called a GaltonndashWatson process with offspring distribution f

The distribution of Zn is conveniently expressed via composing generating

functions For probability measures f1 fn on N0 and for natural numbers

0 le m lt n we introduce the probability measures

fmn = fm+1 middot middot middot fn [11]

Moreover let fnn be the Dirac measure δ1

PROPOSITION 11ndash Let Z be a branching process with initial size Z0 = 1 as andvarying environment (fn n ge 1) Then for n ge 0 the distribution of Zn is equal tothe measure f0n

PROOFndash Induction on n

Usually it is not straightforward to evaluate f0n explicitly The following example

contains an exceptional case of particular interest

EXAMPLE 12ndash LINEAR FRACTIONAL DISTRIBUTIONS A probability measure fon N0 is said to be of the linear fractional type if there are real numbers p a with

0 lt p lt 1 and 0 le a le 1 such that

f [z] = apqzminus1 for z = 0

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 9: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

x Discrete Time Branching Processes in Random Environment

With this volume we have two purposes in mind First we have to assert that the

basics of the theory of BPREs are somewhat scattered in the literature (since the late

1960s) from which they are not at all easily accessible Thus we start by presenting

them in a unified manner In order to simplify matters we confine ourselves from

the beginning to the case where the environment varies in an iid fashion the model

going back to the now classical paper written by Smith and Wilkinson in 1969 We

also put together that material which is required from the topic of branching processes

in a varying environment Overall the proofs are now substantially simplified and

streamlined but at the same time some of the theorems could be better shaped

Second we would like to advance some scientific work on branching processes in

a random environment conditioned on survival which was conducted since around

2000 by a German-Russian group of scientists consisting of Valery Afanasyev

Christian Boumlinghoff Elena Dyakonova Jochen Geiger Goumltz Kersting Vladimir

Vatutin and Vitali Wachtel This research was generously supported by the German

Research Association DFG and the Russian Foundation for Basic Research RFBR In

this book we again do not aim to present our results in their most general setting yet

an ample amount of technical context cannot be avoided

We start the book by describing in Chapter 1 some properties of branching

processes in a varying environment (BPVEs) In particular we give a (short) proof of

the theorem describing the necessary and sufficient conditions for the dichotomy in

the asymptotic behavior of a BPVE such a process should either die or its population

size should tend to infinity with time Besides the construction of family trees

size-biased trees and Geigerrsquos tree representing conditioned family trees are

described here in detail These trees play an important role in studying subcritical

BPREs Chapter 2 leads the reader in to the world of BPREs It contains

classification of BPREs describes some properties of supercritical BPREs and gives

rough estimates for the growth rate of the survival probability for subcritical BPREs

Conclusions of Chapter 2 are supported by Chapter 3 where the asymptotic behavior

of the probabilities of large deviations for all types of BPREs is investigated

Properties of BPREs are closely related to the properties of the so-called associated

random walk (ARW) constituted by the logarithms of the expected population sizes

of particles of different generations This justifies the appearance of Chapter 4 that

includes some basic results on the theory of random walks and a couple of findings

concerning properties of random walks conditioned to stay non-negative or negative

and probabilities of large deviations for different types of random walks

Chapters 5 through 9 deal with various statements describing the asymptotic

behavior of the survival probability and Yaglom-type functional conditional limit

theorems for the critical and subcritical BPREs and analyzing properties of the ARW

Preface xi

providing survival of a BPRE for a long time Here the theory of random walks

conditioned to stay non-negative or negative demonstrates its beauty and power

Thus it is shown in Chapter 5 (under the annealed approach) that if a critical

BPRE survives up to a distant moment n then the minimum value of the ARW on

the interval [0 n] is attained at the beginning of the evolution of the BPRE and the

longtime behavior of the population size of such BPREs (conditioned on survival)

resembles the behavior of the ordinary supercritical GaltonndashWatson branching

processes If however a critical BPRE is considered under the quenched approach

(Chapter 6) then given the survival of the process for a long time the evolution of

the population size in the past has an oscillating character the periods when the

population size was very big were separated by intervals when the size of the

population was small

Chapters 7ndash9 are devoted to the weakly intermediately and strongly subcritical

BPREs investigated under the annealed approach To study properties of such

processes it is necessary to make changes in the initial measures based on the

properties of the ARWs The basic conclusion of Chapters 7ndash8 is the survival

probability of the weakly and intermediately subcritical BPREs up to a distant

moment n is proportional to the probability for the corresponding ARW to stay

non-negative within the time interval [0 n] Finally it is shown in Chapter 9 that

properties of strongly subcritical BPREs are in many respect similar to the

properties of the subcritical GaltonndashWatson branching processes In particular the

survival probability of such a process up to a distant moment n is proportional to the

expected number of particles in the process at this moment

We do not pretend that this book includes all interesting and important results

established up to now for BPREs In particular we do not treat here BPREs with

immigration and multitype BPREs The last direction of the theory of BPREs is a

very promising field of investigation that requires study of properties of Markov chains

generated by products of random matrices To attract the attention of future researchers

to this field we give a short survey of some recent results for multitype BPREs in

Chapter 10 The book is concluded by an Appendix that contains statements of results

used in the proofs of some theorems but not fitting the main line of the monograph

Goumltz KERSTING

Vladimir VATUTIN

August 2017

List of Notations

P(N0) set of all probability measures f on N0 =0 1 2

f [z] f(s) weights and generating function of the measuref 2

f the mean of the measure f 2f the normalized second factorial moment 2flowast size-biased measure 21fmn = fm+1 middot middot middot fn convolutions of probability measures 3fmn = fm middot middot middot fn+1 convolutions of probability measures 107κ(a f) =(f )minus2

suminfiny=a y

2f [y]truncated second moment 102

θ the moment of extinction of a branching process6 28

T lowast size-biased tree 21V random environment 25X = logF logarithm of expected population size 27κ(λ) = logE[eλX ] cumulant generating function 36γ strict descending ladder epoch 61γprime weak descending ladder epoch 61Γ strict ascending ladder epoch 61Γprime weak ascending ladder epoch 62Ln = min(S0 S1 Sn) 38Mn = max(S1 Sn) 76τ(n) = min0 le k le n Sk = Ln

the left-most moment when the minimum value ofthe random walk on the interval [0 n] is attained70

P probability measure given the environment 3 26E expectation given the environment 3 26

xiv Discrete Time Branching Processes in Random Environment

P probability measure obtained by averaging withrespect to the environment 25

E expectation taken after averaging with respect tothe environment 25

P+ Pminus change of measure 93

Pplusmn change of measure 134

P change of measure 170 198P+ Pminus change of measure 172

Plowast change of measure 234

1

Branching Processes inVarying Environment

11 Introduction

Branching processes are a fundamental object in probability theory They serve as

models for the reproduction of particles or individuals within a collective or a

population Here we act on the assumption that the population evolves within clearly

distinguishable generations which allows us to examine the population at the

founding generation n = 0 and the subsequent generations n = 1 2 To begin

with we focus on the sequence of population sizes Zn at generation n n ge 0 Later

we shall study whole family trees

Various kinds of randomness can be incorporated into such branching models

For this monograph we have two such types in mind On the one hand we take

randomness in reproduction into account Here a main assumption is that different

individuals give birth independently and that their offspring distributions coincide

within each generation On the other hand we consider environmental stochasticity

This means that these offspring distributions may change at random from one

generation to the next A fundamental question concerns which one of the two

random components will dominate and determine primarily the modelrsquos long-term

behavior We shall get to know the considerable influence of environmental

fluctuations

This first chapter is of a preliminary nature Here we look at branching models

with reduced randomness We allow that the offspring distributions vary among the

generations but as a start in a deterministic fashion So to speak we consider the above

model conditioned by its environment

Discrete Time Branching Processes in Random Environment First EditionGoumltz Kersting and Vladimir Vatutincopy ISTE Ltd 2017 Published by ISTE Ltd and John Wiley amp Sons Inc

2 Discrete Time Branching Processes in Random Environment

We begin with introducing some notation Let P(N0) be the space of all probability

measures on the natural numbers N0 = 0 1 2 For f isin P(N0) we denote its

weights by f [z] z = 0 1 We also define

f(s) =infinsumz=0

f [z]sz 0 le s le 1

The resulting function on the interval [0 1] is the generating function of the

measure f Thus we take the liberty here to denote the measure and its generating

function by one and the same symbol f This is not just as probability measures and

generating functions uniquely determine each other but operations on probability

measures are often most conveniently expressed by means of their generating

functions Therefore for two probability measures f1 and f2 the expressions f1f2 or

f1 f2 do not only stand for the product or composition of their generating functions

but also stand for the respective operations with the associated probability measures

(in the first case it is the convolution of f1 and f2) Similarly the derivative f prime of the

function f may be considered as well as the measure with weights

f prime[z] = (z + 1)f [z + 1] (which in general is no longer a probability measure) This

slight abuse of notation will cause no confusions but on the contrary will facilitate

presentation Recall that the mean and the normalized second factorial moment

f =infinsumz=1

zf [z] and f =1

f2

infinsumz=2

z(z minus 1)f [z]

can be obtained from the generating functions as

f = f prime(1) f =f primeprime(1)f prime(1)2

NOTEndash Any operation we shall apply to probability measures (and more generally to

finite measures) on N0 has to be understood as an operation applied to their generating

functions

We are now ready for first notions Let (ΩF P) be the underlying probability

space

DEFINITION 11ndash A sequence v = (f1 f2 ) of probability measures on N0 is

called a varying environment

DEFINITION 12ndash Let v = (fn n ge 1) be a varying environment Then a stochastic

process Z = Zn n isin N0 with values in N0 is called a branching process withenvironment v if for any integers z ge 0 n ge 1

P(Zn = z | Z0 Znminus1) = (fZnminus1n )[z] P-as

Branching Processes in Varying Environment 3

On the right-hand side we have the Znminus1th power of fn In particular Zn = 0P-as on the event that Znminus1 = 0 If we want to emphasize that probabilities P(middot) are

determined on the basis of the varying environment v we use the notation Pv(middot)

In probabilistic terms the definition says for n ge 1 that given Z0 Znminus1 the

random variable Zn may be realized as the sum of iid random variables Yin

i = 1 Znminus1 with distribution fn

Zn =

Znminus1sumi=1

Yin

This corresponds to the following conception of the process Z Zn is the number

of individuals of some population in generation n where all individuals reproduce

independently of each other and of Z0 and where fn is the distribution of the number

Yn of offspring of an individual in generation nminus 1 The distribution of Z0 which is

the initial distribution of the population may be arbitrary Mostly we choose it to be

Z0 = 1

EXAMPLE 11ndash A branching process with the constant environment f = f1 = f2 =middot middot middot is called a GaltonndashWatson process with offspring distribution f

The distribution of Zn is conveniently expressed via composing generating

functions For probability measures f1 fn on N0 and for natural numbers

0 le m lt n we introduce the probability measures

fmn = fm+1 middot middot middot fn [11]

Moreover let fnn be the Dirac measure δ1

PROPOSITION 11ndash Let Z be a branching process with initial size Z0 = 1 as andvarying environment (fn n ge 1) Then for n ge 0 the distribution of Zn is equal tothe measure f0n

PROOFndash Induction on n

Usually it is not straightforward to evaluate f0n explicitly The following example

contains an exceptional case of particular interest

EXAMPLE 12ndash LINEAR FRACTIONAL DISTRIBUTIONS A probability measure fon N0 is said to be of the linear fractional type if there are real numbers p a with

0 lt p lt 1 and 0 le a le 1 such that

f [z] = apqzminus1 for z = 0

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 10: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

Preface xi

providing survival of a BPRE for a long time Here the theory of random walks

conditioned to stay non-negative or negative demonstrates its beauty and power

Thus it is shown in Chapter 5 (under the annealed approach) that if a critical

BPRE survives up to a distant moment n then the minimum value of the ARW on

the interval [0 n] is attained at the beginning of the evolution of the BPRE and the

longtime behavior of the population size of such BPREs (conditioned on survival)

resembles the behavior of the ordinary supercritical GaltonndashWatson branching

processes If however a critical BPRE is considered under the quenched approach

(Chapter 6) then given the survival of the process for a long time the evolution of

the population size in the past has an oscillating character the periods when the

population size was very big were separated by intervals when the size of the

population was small

Chapters 7ndash9 are devoted to the weakly intermediately and strongly subcritical

BPREs investigated under the annealed approach To study properties of such

processes it is necessary to make changes in the initial measures based on the

properties of the ARWs The basic conclusion of Chapters 7ndash8 is the survival

probability of the weakly and intermediately subcritical BPREs up to a distant

moment n is proportional to the probability for the corresponding ARW to stay

non-negative within the time interval [0 n] Finally it is shown in Chapter 9 that

properties of strongly subcritical BPREs are in many respect similar to the

properties of the subcritical GaltonndashWatson branching processes In particular the

survival probability of such a process up to a distant moment n is proportional to the

expected number of particles in the process at this moment

We do not pretend that this book includes all interesting and important results

established up to now for BPREs In particular we do not treat here BPREs with

immigration and multitype BPREs The last direction of the theory of BPREs is a

very promising field of investigation that requires study of properties of Markov chains

generated by products of random matrices To attract the attention of future researchers

to this field we give a short survey of some recent results for multitype BPREs in

Chapter 10 The book is concluded by an Appendix that contains statements of results

used in the proofs of some theorems but not fitting the main line of the monograph

Goumltz KERSTING

Vladimir VATUTIN

August 2017

List of Notations

P(N0) set of all probability measures f on N0 =0 1 2

f [z] f(s) weights and generating function of the measuref 2

f the mean of the measure f 2f the normalized second factorial moment 2flowast size-biased measure 21fmn = fm+1 middot middot middot fn convolutions of probability measures 3fmn = fm middot middot middot fn+1 convolutions of probability measures 107κ(a f) =(f )minus2

suminfiny=a y

2f [y]truncated second moment 102

θ the moment of extinction of a branching process6 28

T lowast size-biased tree 21V random environment 25X = logF logarithm of expected population size 27κ(λ) = logE[eλX ] cumulant generating function 36γ strict descending ladder epoch 61γprime weak descending ladder epoch 61Γ strict ascending ladder epoch 61Γprime weak ascending ladder epoch 62Ln = min(S0 S1 Sn) 38Mn = max(S1 Sn) 76τ(n) = min0 le k le n Sk = Ln

the left-most moment when the minimum value ofthe random walk on the interval [0 n] is attained70

P probability measure given the environment 3 26E expectation given the environment 3 26

xiv Discrete Time Branching Processes in Random Environment

P probability measure obtained by averaging withrespect to the environment 25

E expectation taken after averaging with respect tothe environment 25

P+ Pminus change of measure 93

Pplusmn change of measure 134

P change of measure 170 198P+ Pminus change of measure 172

Plowast change of measure 234

1

Branching Processes inVarying Environment

11 Introduction

Branching processes are a fundamental object in probability theory They serve as

models for the reproduction of particles or individuals within a collective or a

population Here we act on the assumption that the population evolves within clearly

distinguishable generations which allows us to examine the population at the

founding generation n = 0 and the subsequent generations n = 1 2 To begin

with we focus on the sequence of population sizes Zn at generation n n ge 0 Later

we shall study whole family trees

Various kinds of randomness can be incorporated into such branching models

For this monograph we have two such types in mind On the one hand we take

randomness in reproduction into account Here a main assumption is that different

individuals give birth independently and that their offspring distributions coincide

within each generation On the other hand we consider environmental stochasticity

This means that these offspring distributions may change at random from one

generation to the next A fundamental question concerns which one of the two

random components will dominate and determine primarily the modelrsquos long-term

behavior We shall get to know the considerable influence of environmental

fluctuations

This first chapter is of a preliminary nature Here we look at branching models

with reduced randomness We allow that the offspring distributions vary among the

generations but as a start in a deterministic fashion So to speak we consider the above

model conditioned by its environment

Discrete Time Branching Processes in Random Environment First EditionGoumltz Kersting and Vladimir Vatutincopy ISTE Ltd 2017 Published by ISTE Ltd and John Wiley amp Sons Inc

2 Discrete Time Branching Processes in Random Environment

We begin with introducing some notation Let P(N0) be the space of all probability

measures on the natural numbers N0 = 0 1 2 For f isin P(N0) we denote its

weights by f [z] z = 0 1 We also define

f(s) =infinsumz=0

f [z]sz 0 le s le 1

The resulting function on the interval [0 1] is the generating function of the

measure f Thus we take the liberty here to denote the measure and its generating

function by one and the same symbol f This is not just as probability measures and

generating functions uniquely determine each other but operations on probability

measures are often most conveniently expressed by means of their generating

functions Therefore for two probability measures f1 and f2 the expressions f1f2 or

f1 f2 do not only stand for the product or composition of their generating functions

but also stand for the respective operations with the associated probability measures

(in the first case it is the convolution of f1 and f2) Similarly the derivative f prime of the

function f may be considered as well as the measure with weights

f prime[z] = (z + 1)f [z + 1] (which in general is no longer a probability measure) This

slight abuse of notation will cause no confusions but on the contrary will facilitate

presentation Recall that the mean and the normalized second factorial moment

f =infinsumz=1

zf [z] and f =1

f2

infinsumz=2

z(z minus 1)f [z]

can be obtained from the generating functions as

f = f prime(1) f =f primeprime(1)f prime(1)2

NOTEndash Any operation we shall apply to probability measures (and more generally to

finite measures) on N0 has to be understood as an operation applied to their generating

functions

We are now ready for first notions Let (ΩF P) be the underlying probability

space

DEFINITION 11ndash A sequence v = (f1 f2 ) of probability measures on N0 is

called a varying environment

DEFINITION 12ndash Let v = (fn n ge 1) be a varying environment Then a stochastic

process Z = Zn n isin N0 with values in N0 is called a branching process withenvironment v if for any integers z ge 0 n ge 1

P(Zn = z | Z0 Znminus1) = (fZnminus1n )[z] P-as

Branching Processes in Varying Environment 3

On the right-hand side we have the Znminus1th power of fn In particular Zn = 0P-as on the event that Znminus1 = 0 If we want to emphasize that probabilities P(middot) are

determined on the basis of the varying environment v we use the notation Pv(middot)

In probabilistic terms the definition says for n ge 1 that given Z0 Znminus1 the

random variable Zn may be realized as the sum of iid random variables Yin

i = 1 Znminus1 with distribution fn

Zn =

Znminus1sumi=1

Yin

This corresponds to the following conception of the process Z Zn is the number

of individuals of some population in generation n where all individuals reproduce

independently of each other and of Z0 and where fn is the distribution of the number

Yn of offspring of an individual in generation nminus 1 The distribution of Z0 which is

the initial distribution of the population may be arbitrary Mostly we choose it to be

Z0 = 1

EXAMPLE 11ndash A branching process with the constant environment f = f1 = f2 =middot middot middot is called a GaltonndashWatson process with offspring distribution f

The distribution of Zn is conveniently expressed via composing generating

functions For probability measures f1 fn on N0 and for natural numbers

0 le m lt n we introduce the probability measures

fmn = fm+1 middot middot middot fn [11]

Moreover let fnn be the Dirac measure δ1

PROPOSITION 11ndash Let Z be a branching process with initial size Z0 = 1 as andvarying environment (fn n ge 1) Then for n ge 0 the distribution of Zn is equal tothe measure f0n

PROOFndash Induction on n

Usually it is not straightforward to evaluate f0n explicitly The following example

contains an exceptional case of particular interest

EXAMPLE 12ndash LINEAR FRACTIONAL DISTRIBUTIONS A probability measure fon N0 is said to be of the linear fractional type if there are real numbers p a with

0 lt p lt 1 and 0 le a le 1 such that

f [z] = apqzminus1 for z = 0

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 11: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

List of Notations

P(N0) set of all probability measures f on N0 =0 1 2

f [z] f(s) weights and generating function of the measuref 2

f the mean of the measure f 2f the normalized second factorial moment 2flowast size-biased measure 21fmn = fm+1 middot middot middot fn convolutions of probability measures 3fmn = fm middot middot middot fn+1 convolutions of probability measures 107κ(a f) =(f )minus2

suminfiny=a y

2f [y]truncated second moment 102

θ the moment of extinction of a branching process6 28

T lowast size-biased tree 21V random environment 25X = logF logarithm of expected population size 27κ(λ) = logE[eλX ] cumulant generating function 36γ strict descending ladder epoch 61γprime weak descending ladder epoch 61Γ strict ascending ladder epoch 61Γprime weak ascending ladder epoch 62Ln = min(S0 S1 Sn) 38Mn = max(S1 Sn) 76τ(n) = min0 le k le n Sk = Ln

the left-most moment when the minimum value ofthe random walk on the interval [0 n] is attained70

P probability measure given the environment 3 26E expectation given the environment 3 26

xiv Discrete Time Branching Processes in Random Environment

P probability measure obtained by averaging withrespect to the environment 25

E expectation taken after averaging with respect tothe environment 25

P+ Pminus change of measure 93

Pplusmn change of measure 134

P change of measure 170 198P+ Pminus change of measure 172

Plowast change of measure 234

1

Branching Processes inVarying Environment

11 Introduction

Branching processes are a fundamental object in probability theory They serve as

models for the reproduction of particles or individuals within a collective or a

population Here we act on the assumption that the population evolves within clearly

distinguishable generations which allows us to examine the population at the

founding generation n = 0 and the subsequent generations n = 1 2 To begin

with we focus on the sequence of population sizes Zn at generation n n ge 0 Later

we shall study whole family trees

Various kinds of randomness can be incorporated into such branching models

For this monograph we have two such types in mind On the one hand we take

randomness in reproduction into account Here a main assumption is that different

individuals give birth independently and that their offspring distributions coincide

within each generation On the other hand we consider environmental stochasticity

This means that these offspring distributions may change at random from one

generation to the next A fundamental question concerns which one of the two

random components will dominate and determine primarily the modelrsquos long-term

behavior We shall get to know the considerable influence of environmental

fluctuations

This first chapter is of a preliminary nature Here we look at branching models

with reduced randomness We allow that the offspring distributions vary among the

generations but as a start in a deterministic fashion So to speak we consider the above

model conditioned by its environment

Discrete Time Branching Processes in Random Environment First EditionGoumltz Kersting and Vladimir Vatutincopy ISTE Ltd 2017 Published by ISTE Ltd and John Wiley amp Sons Inc

2 Discrete Time Branching Processes in Random Environment

We begin with introducing some notation Let P(N0) be the space of all probability

measures on the natural numbers N0 = 0 1 2 For f isin P(N0) we denote its

weights by f [z] z = 0 1 We also define

f(s) =infinsumz=0

f [z]sz 0 le s le 1

The resulting function on the interval [0 1] is the generating function of the

measure f Thus we take the liberty here to denote the measure and its generating

function by one and the same symbol f This is not just as probability measures and

generating functions uniquely determine each other but operations on probability

measures are often most conveniently expressed by means of their generating

functions Therefore for two probability measures f1 and f2 the expressions f1f2 or

f1 f2 do not only stand for the product or composition of their generating functions

but also stand for the respective operations with the associated probability measures

(in the first case it is the convolution of f1 and f2) Similarly the derivative f prime of the

function f may be considered as well as the measure with weights

f prime[z] = (z + 1)f [z + 1] (which in general is no longer a probability measure) This

slight abuse of notation will cause no confusions but on the contrary will facilitate

presentation Recall that the mean and the normalized second factorial moment

f =infinsumz=1

zf [z] and f =1

f2

infinsumz=2

z(z minus 1)f [z]

can be obtained from the generating functions as

f = f prime(1) f =f primeprime(1)f prime(1)2

NOTEndash Any operation we shall apply to probability measures (and more generally to

finite measures) on N0 has to be understood as an operation applied to their generating

functions

We are now ready for first notions Let (ΩF P) be the underlying probability

space

DEFINITION 11ndash A sequence v = (f1 f2 ) of probability measures on N0 is

called a varying environment

DEFINITION 12ndash Let v = (fn n ge 1) be a varying environment Then a stochastic

process Z = Zn n isin N0 with values in N0 is called a branching process withenvironment v if for any integers z ge 0 n ge 1

P(Zn = z | Z0 Znminus1) = (fZnminus1n )[z] P-as

Branching Processes in Varying Environment 3

On the right-hand side we have the Znminus1th power of fn In particular Zn = 0P-as on the event that Znminus1 = 0 If we want to emphasize that probabilities P(middot) are

determined on the basis of the varying environment v we use the notation Pv(middot)

In probabilistic terms the definition says for n ge 1 that given Z0 Znminus1 the

random variable Zn may be realized as the sum of iid random variables Yin

i = 1 Znminus1 with distribution fn

Zn =

Znminus1sumi=1

Yin

This corresponds to the following conception of the process Z Zn is the number

of individuals of some population in generation n where all individuals reproduce

independently of each other and of Z0 and where fn is the distribution of the number

Yn of offspring of an individual in generation nminus 1 The distribution of Z0 which is

the initial distribution of the population may be arbitrary Mostly we choose it to be

Z0 = 1

EXAMPLE 11ndash A branching process with the constant environment f = f1 = f2 =middot middot middot is called a GaltonndashWatson process with offspring distribution f

The distribution of Zn is conveniently expressed via composing generating

functions For probability measures f1 fn on N0 and for natural numbers

0 le m lt n we introduce the probability measures

fmn = fm+1 middot middot middot fn [11]

Moreover let fnn be the Dirac measure δ1

PROPOSITION 11ndash Let Z be a branching process with initial size Z0 = 1 as andvarying environment (fn n ge 1) Then for n ge 0 the distribution of Zn is equal tothe measure f0n

PROOFndash Induction on n

Usually it is not straightforward to evaluate f0n explicitly The following example

contains an exceptional case of particular interest

EXAMPLE 12ndash LINEAR FRACTIONAL DISTRIBUTIONS A probability measure fon N0 is said to be of the linear fractional type if there are real numbers p a with

0 lt p lt 1 and 0 le a le 1 such that

f [z] = apqzminus1 for z = 0

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 12: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

xiv Discrete Time Branching Processes in Random Environment

P probability measure obtained by averaging withrespect to the environment 25

E expectation taken after averaging with respect tothe environment 25

P+ Pminus change of measure 93

Pplusmn change of measure 134

P change of measure 170 198P+ Pminus change of measure 172

Plowast change of measure 234

1

Branching Processes inVarying Environment

11 Introduction

Branching processes are a fundamental object in probability theory They serve as

models for the reproduction of particles or individuals within a collective or a

population Here we act on the assumption that the population evolves within clearly

distinguishable generations which allows us to examine the population at the

founding generation n = 0 and the subsequent generations n = 1 2 To begin

with we focus on the sequence of population sizes Zn at generation n n ge 0 Later

we shall study whole family trees

Various kinds of randomness can be incorporated into such branching models

For this monograph we have two such types in mind On the one hand we take

randomness in reproduction into account Here a main assumption is that different

individuals give birth independently and that their offspring distributions coincide

within each generation On the other hand we consider environmental stochasticity

This means that these offspring distributions may change at random from one

generation to the next A fundamental question concerns which one of the two

random components will dominate and determine primarily the modelrsquos long-term

behavior We shall get to know the considerable influence of environmental

fluctuations

This first chapter is of a preliminary nature Here we look at branching models

with reduced randomness We allow that the offspring distributions vary among the

generations but as a start in a deterministic fashion So to speak we consider the above

model conditioned by its environment

Discrete Time Branching Processes in Random Environment First EditionGoumltz Kersting and Vladimir Vatutincopy ISTE Ltd 2017 Published by ISTE Ltd and John Wiley amp Sons Inc

2 Discrete Time Branching Processes in Random Environment

We begin with introducing some notation Let P(N0) be the space of all probability

measures on the natural numbers N0 = 0 1 2 For f isin P(N0) we denote its

weights by f [z] z = 0 1 We also define

f(s) =infinsumz=0

f [z]sz 0 le s le 1

The resulting function on the interval [0 1] is the generating function of the

measure f Thus we take the liberty here to denote the measure and its generating

function by one and the same symbol f This is not just as probability measures and

generating functions uniquely determine each other but operations on probability

measures are often most conveniently expressed by means of their generating

functions Therefore for two probability measures f1 and f2 the expressions f1f2 or

f1 f2 do not only stand for the product or composition of their generating functions

but also stand for the respective operations with the associated probability measures

(in the first case it is the convolution of f1 and f2) Similarly the derivative f prime of the

function f may be considered as well as the measure with weights

f prime[z] = (z + 1)f [z + 1] (which in general is no longer a probability measure) This

slight abuse of notation will cause no confusions but on the contrary will facilitate

presentation Recall that the mean and the normalized second factorial moment

f =infinsumz=1

zf [z] and f =1

f2

infinsumz=2

z(z minus 1)f [z]

can be obtained from the generating functions as

f = f prime(1) f =f primeprime(1)f prime(1)2

NOTEndash Any operation we shall apply to probability measures (and more generally to

finite measures) on N0 has to be understood as an operation applied to their generating

functions

We are now ready for first notions Let (ΩF P) be the underlying probability

space

DEFINITION 11ndash A sequence v = (f1 f2 ) of probability measures on N0 is

called a varying environment

DEFINITION 12ndash Let v = (fn n ge 1) be a varying environment Then a stochastic

process Z = Zn n isin N0 with values in N0 is called a branching process withenvironment v if for any integers z ge 0 n ge 1

P(Zn = z | Z0 Znminus1) = (fZnminus1n )[z] P-as

Branching Processes in Varying Environment 3

On the right-hand side we have the Znminus1th power of fn In particular Zn = 0P-as on the event that Znminus1 = 0 If we want to emphasize that probabilities P(middot) are

determined on the basis of the varying environment v we use the notation Pv(middot)

In probabilistic terms the definition says for n ge 1 that given Z0 Znminus1 the

random variable Zn may be realized as the sum of iid random variables Yin

i = 1 Znminus1 with distribution fn

Zn =

Znminus1sumi=1

Yin

This corresponds to the following conception of the process Z Zn is the number

of individuals of some population in generation n where all individuals reproduce

independently of each other and of Z0 and where fn is the distribution of the number

Yn of offspring of an individual in generation nminus 1 The distribution of Z0 which is

the initial distribution of the population may be arbitrary Mostly we choose it to be

Z0 = 1

EXAMPLE 11ndash A branching process with the constant environment f = f1 = f2 =middot middot middot is called a GaltonndashWatson process with offspring distribution f

The distribution of Zn is conveniently expressed via composing generating

functions For probability measures f1 fn on N0 and for natural numbers

0 le m lt n we introduce the probability measures

fmn = fm+1 middot middot middot fn [11]

Moreover let fnn be the Dirac measure δ1

PROPOSITION 11ndash Let Z be a branching process with initial size Z0 = 1 as andvarying environment (fn n ge 1) Then for n ge 0 the distribution of Zn is equal tothe measure f0n

PROOFndash Induction on n

Usually it is not straightforward to evaluate f0n explicitly The following example

contains an exceptional case of particular interest

EXAMPLE 12ndash LINEAR FRACTIONAL DISTRIBUTIONS A probability measure fon N0 is said to be of the linear fractional type if there are real numbers p a with

0 lt p lt 1 and 0 le a le 1 such that

f [z] = apqzminus1 for z = 0

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 13: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

1

Branching Processes inVarying Environment

11 Introduction

Branching processes are a fundamental object in probability theory They serve as

models for the reproduction of particles or individuals within a collective or a

population Here we act on the assumption that the population evolves within clearly

distinguishable generations which allows us to examine the population at the

founding generation n = 0 and the subsequent generations n = 1 2 To begin

with we focus on the sequence of population sizes Zn at generation n n ge 0 Later

we shall study whole family trees

Various kinds of randomness can be incorporated into such branching models

For this monograph we have two such types in mind On the one hand we take

randomness in reproduction into account Here a main assumption is that different

individuals give birth independently and that their offspring distributions coincide

within each generation On the other hand we consider environmental stochasticity

This means that these offspring distributions may change at random from one

generation to the next A fundamental question concerns which one of the two

random components will dominate and determine primarily the modelrsquos long-term

behavior We shall get to know the considerable influence of environmental

fluctuations

This first chapter is of a preliminary nature Here we look at branching models

with reduced randomness We allow that the offspring distributions vary among the

generations but as a start in a deterministic fashion So to speak we consider the above

model conditioned by its environment

Discrete Time Branching Processes in Random Environment First EditionGoumltz Kersting and Vladimir Vatutincopy ISTE Ltd 2017 Published by ISTE Ltd and John Wiley amp Sons Inc

2 Discrete Time Branching Processes in Random Environment

We begin with introducing some notation Let P(N0) be the space of all probability

measures on the natural numbers N0 = 0 1 2 For f isin P(N0) we denote its

weights by f [z] z = 0 1 We also define

f(s) =infinsumz=0

f [z]sz 0 le s le 1

The resulting function on the interval [0 1] is the generating function of the

measure f Thus we take the liberty here to denote the measure and its generating

function by one and the same symbol f This is not just as probability measures and

generating functions uniquely determine each other but operations on probability

measures are often most conveniently expressed by means of their generating

functions Therefore for two probability measures f1 and f2 the expressions f1f2 or

f1 f2 do not only stand for the product or composition of their generating functions

but also stand for the respective operations with the associated probability measures

(in the first case it is the convolution of f1 and f2) Similarly the derivative f prime of the

function f may be considered as well as the measure with weights

f prime[z] = (z + 1)f [z + 1] (which in general is no longer a probability measure) This

slight abuse of notation will cause no confusions but on the contrary will facilitate

presentation Recall that the mean and the normalized second factorial moment

f =infinsumz=1

zf [z] and f =1

f2

infinsumz=2

z(z minus 1)f [z]

can be obtained from the generating functions as

f = f prime(1) f =f primeprime(1)f prime(1)2

NOTEndash Any operation we shall apply to probability measures (and more generally to

finite measures) on N0 has to be understood as an operation applied to their generating

functions

We are now ready for first notions Let (ΩF P) be the underlying probability

space

DEFINITION 11ndash A sequence v = (f1 f2 ) of probability measures on N0 is

called a varying environment

DEFINITION 12ndash Let v = (fn n ge 1) be a varying environment Then a stochastic

process Z = Zn n isin N0 with values in N0 is called a branching process withenvironment v if for any integers z ge 0 n ge 1

P(Zn = z | Z0 Znminus1) = (fZnminus1n )[z] P-as

Branching Processes in Varying Environment 3

On the right-hand side we have the Znminus1th power of fn In particular Zn = 0P-as on the event that Znminus1 = 0 If we want to emphasize that probabilities P(middot) are

determined on the basis of the varying environment v we use the notation Pv(middot)

In probabilistic terms the definition says for n ge 1 that given Z0 Znminus1 the

random variable Zn may be realized as the sum of iid random variables Yin

i = 1 Znminus1 with distribution fn

Zn =

Znminus1sumi=1

Yin

This corresponds to the following conception of the process Z Zn is the number

of individuals of some population in generation n where all individuals reproduce

independently of each other and of Z0 and where fn is the distribution of the number

Yn of offspring of an individual in generation nminus 1 The distribution of Z0 which is

the initial distribution of the population may be arbitrary Mostly we choose it to be

Z0 = 1

EXAMPLE 11ndash A branching process with the constant environment f = f1 = f2 =middot middot middot is called a GaltonndashWatson process with offspring distribution f

The distribution of Zn is conveniently expressed via composing generating

functions For probability measures f1 fn on N0 and for natural numbers

0 le m lt n we introduce the probability measures

fmn = fm+1 middot middot middot fn [11]

Moreover let fnn be the Dirac measure δ1

PROPOSITION 11ndash Let Z be a branching process with initial size Z0 = 1 as andvarying environment (fn n ge 1) Then for n ge 0 the distribution of Zn is equal tothe measure f0n

PROOFndash Induction on n

Usually it is not straightforward to evaluate f0n explicitly The following example

contains an exceptional case of particular interest

EXAMPLE 12ndash LINEAR FRACTIONAL DISTRIBUTIONS A probability measure fon N0 is said to be of the linear fractional type if there are real numbers p a with

0 lt p lt 1 and 0 le a le 1 such that

f [z] = apqzminus1 for z = 0

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 14: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

2 Discrete Time Branching Processes in Random Environment

We begin with introducing some notation Let P(N0) be the space of all probability

measures on the natural numbers N0 = 0 1 2 For f isin P(N0) we denote its

weights by f [z] z = 0 1 We also define

f(s) =infinsumz=0

f [z]sz 0 le s le 1

The resulting function on the interval [0 1] is the generating function of the

measure f Thus we take the liberty here to denote the measure and its generating

function by one and the same symbol f This is not just as probability measures and

generating functions uniquely determine each other but operations on probability

measures are often most conveniently expressed by means of their generating

functions Therefore for two probability measures f1 and f2 the expressions f1f2 or

f1 f2 do not only stand for the product or composition of their generating functions

but also stand for the respective operations with the associated probability measures

(in the first case it is the convolution of f1 and f2) Similarly the derivative f prime of the

function f may be considered as well as the measure with weights

f prime[z] = (z + 1)f [z + 1] (which in general is no longer a probability measure) This

slight abuse of notation will cause no confusions but on the contrary will facilitate

presentation Recall that the mean and the normalized second factorial moment

f =infinsumz=1

zf [z] and f =1

f2

infinsumz=2

z(z minus 1)f [z]

can be obtained from the generating functions as

f = f prime(1) f =f primeprime(1)f prime(1)2

NOTEndash Any operation we shall apply to probability measures (and more generally to

finite measures) on N0 has to be understood as an operation applied to their generating

functions

We are now ready for first notions Let (ΩF P) be the underlying probability

space

DEFINITION 11ndash A sequence v = (f1 f2 ) of probability measures on N0 is

called a varying environment

DEFINITION 12ndash Let v = (fn n ge 1) be a varying environment Then a stochastic

process Z = Zn n isin N0 with values in N0 is called a branching process withenvironment v if for any integers z ge 0 n ge 1

P(Zn = z | Z0 Znminus1) = (fZnminus1n )[z] P-as

Branching Processes in Varying Environment 3

On the right-hand side we have the Znminus1th power of fn In particular Zn = 0P-as on the event that Znminus1 = 0 If we want to emphasize that probabilities P(middot) are

determined on the basis of the varying environment v we use the notation Pv(middot)

In probabilistic terms the definition says for n ge 1 that given Z0 Znminus1 the

random variable Zn may be realized as the sum of iid random variables Yin

i = 1 Znminus1 with distribution fn

Zn =

Znminus1sumi=1

Yin

This corresponds to the following conception of the process Z Zn is the number

of individuals of some population in generation n where all individuals reproduce

independently of each other and of Z0 and where fn is the distribution of the number

Yn of offspring of an individual in generation nminus 1 The distribution of Z0 which is

the initial distribution of the population may be arbitrary Mostly we choose it to be

Z0 = 1

EXAMPLE 11ndash A branching process with the constant environment f = f1 = f2 =middot middot middot is called a GaltonndashWatson process with offspring distribution f

The distribution of Zn is conveniently expressed via composing generating

functions For probability measures f1 fn on N0 and for natural numbers

0 le m lt n we introduce the probability measures

fmn = fm+1 middot middot middot fn [11]

Moreover let fnn be the Dirac measure δ1

PROPOSITION 11ndash Let Z be a branching process with initial size Z0 = 1 as andvarying environment (fn n ge 1) Then for n ge 0 the distribution of Zn is equal tothe measure f0n

PROOFndash Induction on n

Usually it is not straightforward to evaluate f0n explicitly The following example

contains an exceptional case of particular interest

EXAMPLE 12ndash LINEAR FRACTIONAL DISTRIBUTIONS A probability measure fon N0 is said to be of the linear fractional type if there are real numbers p a with

0 lt p lt 1 and 0 le a le 1 such that

f [z] = apqzminus1 for z = 0

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 15: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

Branching Processes in Varying Environment 3

On the right-hand side we have the Znminus1th power of fn In particular Zn = 0P-as on the event that Znminus1 = 0 If we want to emphasize that probabilities P(middot) are

determined on the basis of the varying environment v we use the notation Pv(middot)

In probabilistic terms the definition says for n ge 1 that given Z0 Znminus1 the

random variable Zn may be realized as the sum of iid random variables Yin

i = 1 Znminus1 with distribution fn

Zn =

Znminus1sumi=1

Yin

This corresponds to the following conception of the process Z Zn is the number

of individuals of some population in generation n where all individuals reproduce

independently of each other and of Z0 and where fn is the distribution of the number

Yn of offspring of an individual in generation nminus 1 The distribution of Z0 which is

the initial distribution of the population may be arbitrary Mostly we choose it to be

Z0 = 1

EXAMPLE 11ndash A branching process with the constant environment f = f1 = f2 =middot middot middot is called a GaltonndashWatson process with offspring distribution f

The distribution of Zn is conveniently expressed via composing generating

functions For probability measures f1 fn on N0 and for natural numbers

0 le m lt n we introduce the probability measures

fmn = fm+1 middot middot middot fn [11]

Moreover let fnn be the Dirac measure δ1

PROPOSITION 11ndash Let Z be a branching process with initial size Z0 = 1 as andvarying environment (fn n ge 1) Then for n ge 0 the distribution of Zn is equal tothe measure f0n

PROOFndash Induction on n

Usually it is not straightforward to evaluate f0n explicitly The following example

contains an exceptional case of particular interest

EXAMPLE 12ndash LINEAR FRACTIONAL DISTRIBUTIONS A probability measure fon N0 is said to be of the linear fractional type if there are real numbers p a with

0 lt p lt 1 and 0 le a le 1 such that

f [z] = apqzminus1 for z = 0

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 16: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

4 Discrete Time Branching Processes in Random Environment

with q = 1minus p For a gt 0 this implies

f [0] = 1minus a f =a

p f =

2q

a

We shall see that it is convenient to use the parameters f and f instead of a and

p Special cases are for a = 1 the geometric distribution g with success probability

p and for a = 0 the Dirac measure δ0 at point 0 In fact f is a mixture of both ie

f = ag + (1 minus a)δ0 A random variable Z with values in N0 has a linear fractional

distribution if

P(Z = z | Z ge 1) = pqzminus1 for z ge 1

that is if its conditional distribution given Z ge 1 is geometric with success

probability p Then

P(Z ge 1) = a =( 1

f+

f

2

)minus1

For the generating function we find

f(s) = 1minus a1minus s

1minus qs 0 le s le 1

(leading to the naming of the linear fractional) It is convenient to convert it for f gt 0into

1

1minus f(s)=

1

f middot (1minus s)+

f

2 0 le s lt 1 [12]

Note that this identity uniquely characterizes the linear fractional measure f with

mean f and normalized second factorial moment f

The last equation now allows us to determine the composition f0n of linear

fractional probability measures fk with parameters fk fk 1 le k le n From

f0n = f1 f1n

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+

f12

Iterating this formula we obtain (with f1 middot middot middot fkminus1 = 1 for k = 1)

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

1

2

nsumk=1

fkf1 middot middot middot fkminus1

[13]

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 17: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

Branching Processes in Varying Environment 5

It implies that the measure f0n itself is of the linear fractional type with a mean

and normalized second factorial moment

f0n = f1 middot middot middot fn f0n =nsum

k=1

fkf1 middot middot middot fkminus1

This property of perpetuation is specific for probability measures of the linear

fractional type

For further investigations we now rule out some cases of less significance

ASSUMPTION V1ndash The varying environment (f1 f2 ) fulfills 0 lt fn lt infin for alln ge 1

Note that in the case of fn = 0 the population will as be completely wiped out

in generation n

From Proposition 11 we obtain formulas for moments of Zn in a standard manner

Taking derivatives by means of Leibnizrsquos rule and induction we have for 0 le m lt n

f primemn(s) =

nprodk=m+1

f primek(fkn(s))

and f primenn(s) = 1 In addition using the product rule we obtain after some

rearrangements

f primeprimemn(s) = f prime

mn(s)2

nsumk=m+1

f primeprimek (fkn(s))

f primek(fkn(s))

2prodkminus1

j=m+1 fprimej(fjn(s))

[14]

and f primeprimenn(s) = 0 Evaluating these equations for m = 0 and s = 1 we get the

following formulas for means and normalized second factorial moments of Zn which

we had already come across in the case of linear fractional distributions (now the

second factorial moments may well take the value infin)

PROPOSITION 12ndash For a branching process Z with initial size Z0 = 1 as andenvironment (f1 f2 ) fulfilling V 1 we have

E[Zn] = f1 middot middot middot fnE[Zn(Zn minus 1)]

E[Zn]2=

nsumk=1

fkf1 middot middot middot fkminus1

[15]

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 18: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

6 Discrete Time Branching Processes in Random Environment

We note that these equations entail the similarly built formula

Var[Zn]

E[Zn]2=

nsumk=1

ρkf1 middot middot middot fkminus1

[16]

set up for the standardized variances

ρk =1

f2k

infinsumz=0

(z minus fk)2fk[z]

of the probability measures fk Indeed

nsumk=1

ρkf1 middot middot middot fkminus1

=

nsumk=1

f primeprimek (1) + f prime

k(1)minus f primek(1)

2

f1 middot middot middot fkminus1 middot f2k

=nsum

k=1

f primeprimek (1)

f1 middot middot middot fkminus1 middot f2k

+nsum

k=1

( 1

f1 middot middot middot fkminus 1

f1 middot middot middot fkminus1

)

=nsum

k=1

fkf1 middot middot middot fkminus1

+1

f1 middot middot middot fnminus 1

=E[Zn(Zn minus 1)]

E[Zn]2+

1

E[Zn]minus 1 =

Var[Zn]

E[Zn]2

12 Extinction probabilities

For a branching process Z let

θ = minn ge 1 Zn = 0

be the moment when the population dies out Then P(θ le n) = P(Zn = 0) and the

probability that the population becomes ultimately extinct is equal to

q = P(θ lt infin) = limnrarrinfinP(Zn = 0)

In this section we would like to characterize as extinction For a first criterion

we use the Markov inequality

P(θ gt n) = P(Zn ge 1) le E[Zn]

and the fact that P(Zn ge 1) is decreasing in n We obtain

lim infnrarrinfin f1 middot middot middot fn = 0 rArr q = 1 [17]

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 19: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

Branching Processes in Varying Environment 7

On the other hand the PaleyndashZygmund inequality tells us that

P(θ gt n) = P(Zn gt 0) ge E[Zn]2

E[Z2n]

=E[Zn]

2

E[Zn] +E[Zn(Zn minus 1)]

which in combination with equation [16] yields the bound

1

P(θ gt n)le E[Z2

n]

E[Zn]2= 1 +

Var[Zn]

E[Zn|2= 1 +

nsumk=1

ρkf1 middot middot middot fkminus1

[18]

Thus the question arises as to which one of both bounds captures the size of

P(θ gt n) more adequately It turns out that under a mild extra assumption it is

the PaleyndashZygmund bound

ASSUMPTION V2ndash For the varying environment (f1 f2 ) there exists a constantc lt infin such that for all n ge 1

E[Yn(Yn minus 1)] le cE[Yn] middotE[Yn minus 1 | Yn gt 0]

where the random variables Y1 Y2 have the distributions f1 f2

This uniformity assumption is typically satisfied as illustrated by the following

examples

EXAMPLE 13ndash Assumption V 2 is fulfilled in the following cases

i) The Yn have arbitrary Poisson-distributions

ii) The Yn have arbitrary linear fractional distributions

iii) There is a constant c lt infin such that Yn le c as for all n

For the proof of (iii) rewrite V 2 as

E[Yn(Yn minus 1)] le cE[Yn | Yn gt 0] middotE[(Yn minus 1)+]

and observe that E[Yn | Yn gt 0] ge 1

Here comes the main result of this section

THEOREM 11ndash Let the branching process Z in a varying environment fulfillAssumption V 1 Then the conditions

i) q = 1

ii) E[Zn]2∣∣= o

(E[Z2

n])

as n rarr infin

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 20: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

8 Discrete Time Branching Processes in Random Environment

iii)infinsumk=1

ρkf1 middot middot middot fkminus1

= infin

are equivalent

Condition (ii) can be equivalently expressed as E[Zn] = o(radic

Var[Zn]) Thus

shortly speaking under Assumption V 2 we have as extinction whenever the random

fluctuations dominate the mean behavior of the process in the long run

For the proof we introduce a method of handling the measures f0n which will be

useful elsewhere too It mimics the calculation we got to know for linear fractional

distributions For a probability measure f isin P(N0) with mean 0 lt f lt infin we define

the function

ϕf (s) =1

1minus f(s)minus 1

f middot (1minus s) 0 le s lt 1

We also set

ϕf (1) = limsrarr1

ϕf (s) =f primeprime(1)2f prime(1)2

=f

2 [19]

where the limit arises by means of the Taylor expansion

f(s) = 1 + f prime(1)(sminus 1) +1

2f primeprime(t)(sminus 1)2 with some t isin (s 1)

From the convexity of the function f(s) we get that ϕf (s) ge 0 for all 0 le s le 1

Then for probability measures f1 fn with positive finite means we obtain

1

1minus f0n(s)=

1

f1 middot (1minus f1n(s))+ ϕf1(f1n(s))

Iterating the formula and having in mind the conventions fnn(s) = s and

f1 middot middot middot fkminus1 = 1 for k = 1 we arrive at the following expansion

PROPOSITION 13ndash For probability measures f1 fn with positive finite meansf1 fn we have

1

1minus f0n(s)=

1

f1 middot middot middot fn middot (1minus s)+

nsumk=1

ϕfk(fkn(s))

f1 middot middot middot fkminus1 0 le s lt 1

As seen from [12] the functions ϕfk are constant for linear fractional probability

measures In general we have the following sharp estimates

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 21: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

Branching Processes in Varying Environment 9

PROPOSITION 14ndash Let f isin P(N0) with mean 0 lt f lt infin Then it follows for0 le s le 1

1

2ϕf (0) le ϕf (s) le 2ϕf (1) [110]

Note that ϕf is identical to zero if f [z] = 0 for all z ge 2 Otherwise ϕf (0) gt 0

and the lower bound of ϕf becomes strictly positive Choosing s = 1 and s = 0in [110] we obtain ϕf (0)2 le ϕf (1) and ϕf (0) le 2ϕf (1) Note that for f = δk(Dirac-measure at point k) and k ge 2 we have ϕf (1) = ϕf (0)2 implying that the

constants 12 and 2 in [110] cannot be improved

PROOFndash i) We prepare the proof by showing for g1 g2 isin P(N0) the following

statement If g1 and g2 have the same support and if for any k ge 0 with g1[k] gt 0

we have

g1[z]

g1[k]le g2[z]

g2[k]for all z gt k

then g1 le g2 Indeed for g1[k] gt 0

sumzgek g1[z]

1minussumzgek g1[z]

=

sumzgek g1[z]g1[k]sumzltk g1[z]g1[k]

lesum

zgek g2[z]g2[k]sumzltk g2[z]g2[k]

=

sumzgek g2[z]

1minussumzgek g2[z]

and consequently

sumzgek

g1[z] lesumzgek

g2[z]

It follows that this inequality holds for all k ge 0 since vanishing summands on

the left-hand side may be removed Summing the inequality over k ge 0 we arrive at

the claim

For a special case consider for 0 le s le 1 and r isin N0 the probability measures

gs[z] =srminusz

1 + s+ middot middot middot+ sr 0 le z le r

Then for 0 lt s le t k le r z gt k we have gs[z]gs[k] ge gt[z]gt[k] We

therefore obtain that

gs =srminus1 + 2srminus2 + middot middot middot+ r

1 + s+ middot middot middot+ sr

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 22: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

10 Discrete Time Branching Processes in Random Environment

is a decreasing function in s Moreover g0 = r and g1 = r2 and it follows for

0 le s le 1

r

2le r + (r minus 1)s+ middot middot middot+ srminus1

1 + s+ middot middot middot+ srle r [111]

ii) Next we derive a second representation for ϕ = ϕf We have

1minus f(s) =

infinsumz=1

f [z](1minus sz) = (1minus s)

infinsumz=1

f [z]

zminus1sumk=0

sk

and

f prime(1)(1minus s)minus (1minus f(s)) = (1minus s)infinsumz=1

f [z]zminus1sumk=0

(1minus sk)

= (1minus s)2infinsumz=1

f [z]zminus1sumk=1

kminus1sumj=0

sj

= (1minus s)2infinsumz=1

f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

Therefore

ϕ(s) =f prime(1)(1minus s)minus (1minus f(s))

f prime(1)(1minus s)(1minus f(s))

=

suminfinz=1 f [z]((z minus 1) + (z minus 2)s+ middot middot middot+ szminus2)

f middotsuminfink=1 f [k](1 + s+ middot middot middot+ skminus1)

From [111] it follows

ϕ(s) le ψ(s)

fle 2ϕ(s) [112]

with

ψ(s) =

suminfinz=1 f [z](z minus 1)(1 + s+ middot middot middot+ szminus1)suminfin

k=1 f [k](1 + s+ middot middot middot+ skminus1)

Now consider the probability measures gs isin P(N0) 0 le s le 1 given by

gs[z] =f [z + 1](1 + s+ middot middot middot+ sz)suminfink=0 f [k + 1](1 + s+ middot middot middot+ sk)

z ge 0

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 23: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

Branching Processes in Varying Environment 11

Then for f [k + 1] gt 0 and z gt k after some algebra

gs[z]

gs[k]=

f [z + 1]

f [k + 1]

zminuskprodv=1

(1 +

1

sminus1 + middot middot middot+ sminuskminusv

)

which is an increasing function in s Therefore

ψ(s) = gs

is increasing in s In combination with [112] we get

ϕ(s) le ψ(s)

fle ψ(1)

fle 2ϕ(1) 2ϕ(s) ge ψ(s)

fge ψ(0)

fge ϕ(0)

This gives the claim of the proposition

PROOF (Proof of Theorem 11)ndash If q = 1 then P(θ gt n) rarr 0 as n rarr infin Therefore

the implications (i) rArr (ii) and (ii) rArr (iii) follow from formula [18] For the remaining

part of the proof note that P(θ gt n) = P(Zn = 0) = 1minusf0n(0) such that it follows

from Proposition 13

1

P(θ gt n)=

1

f1 middot middot middot fn+

nsumk=1

ϕfk(fkn(0))

f1 middot middot middot fkminus1 [113]

Moreover we observe that V 2 reads

fn(fn)2 le cfn middot fn minus (1minus fn[0])

1minus fn[0]

which can be converted to ϕfn(1) le cϕfn(0) Then Proposition 14 together with

ϕfn(1) = fn2 yields for 0 le s le 1

fn4c

le ϕfn(s)

Together with [113] and Proposition 12 this implies with d = max(1 4c)

1

P(θ gt n)ge 1

f1 middot middot middot fn+

1

4c

nsumk=1

fkf1 middot middot middot fkminus1

ge 1

d

E[Zn(Zn minus 1)] +E[Zn]

E[Zn]2=

1

d

(Var[Zn]

E[Zn]2+ 1

)

Now the implication (iii) rArr (i) follows from equation [16]

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 24: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

12 Discrete Time Branching Processes in Random Environment

13 Almost sure convergence

There are a few supermartingales which allow convergence considerations for

branching processes Z in a varying environment Under Assumption V 1 an obvious

choice is the process W = Wn n ge 0 given by

Wn =Zn

f1 middot middot middot fn n isin N0

which is easily seen to be a non-negative martingale Therefore there is an integrable

random variable W ge 0 such that

Zn

f1 middot middot middot fnrarr W as as n rarr infin

THEOREM 13ndash For a branching process Z with Z0 = 1 and in a varyingenvironment fulfilling the assumptions V 1 and V 2 we have

i) If q = 1 then W = 0 as

ii) If q lt 1 then E[W ] = 1

PROOFndash The first claim is obvious For the second one we observe that q lt 1 in view

of Theorem 11 implies

infinsumk=1

ρkf1 middot middot middot fkminus1

lt infin

From [16] it follows that

supn

E[Z2n]

E[Zn]2= sup

n

Var[Zn]

E[Zn]2+ 1 lt infin

Therefore W is a square-integrable martingale implying E[W ] = E[W0] = 1

The next theorem on the as convergence of the unscaled process is remarkable

also in that it requires no assumptions at all We name it the ChurchndashLindvall

theorem Among others it clarifies as to which condition is needed for Z with a

positive probability to stick forever in some state z ge 1 In its proof we shall

encounter a finer construction of a supermartingale

THEOREM 14ndash For a branching process Z = Zn n ge 0 in a varying environmentthere exists a random variable Zinfin with values in N0 cup infin such that as n rarr infin

Zn rarr Zinfin as

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 25: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

Branching Processes in Varying Environment 13

Moreover

P(Zinfin = 0 or infin) = 1 hArrinfinsum

n=1

(1minus fn[1]) = infin

PROOFndash i) We prepare the proof by showing that the sequence of probability

measures f0n is vaguely converging to a (possibly defective) measure g on N0 Note

that f0n[0] rarr q Thus either f0n rarr qδ0 vaguely (with the Dirac measure δ0 at

point 0) or else (by the HellyndashBray theorem) there exists a sequence of integers

0 = n0 lt n1 lt n2 lt middot middot middot such that as k rarr infin we have f0nkrarr g vaguely with

g = qδ0

In the latter case the limiting generating function g(s) is strictly increasing in s

and f0nk(s) rarr g(s) for all 0 le s lt 1 Then given n isin N0 we define ln =

nkmn = nk+1 with nk le n lt nk+1 thus ln le n lt mn We want to show

that flnn converges vaguely to δ1 For this purpose we consider a subsequence nprime

such that both flnprime nprime and fnprimemnprime converge vaguely to measures h1 and h2 Going in

f0mnprime = f0lnprime flnprime nprime fnprimemnprime to the limit we obtain

g(s) = g(h1(h2(s))) 0 le s lt 1

Since g is strictly increasing h1(h2(s)) = s which for generating functions

implies h1(s) = h2(s) = s Thus using the common sub-sub-sequence argument

flnn rarr δ1 as n rarr infin It follows that as n rarr infin

f0n(s) = f0ln(flnn(s)) rarr g(s) 0 le s lt 1

which means f0n rarr g vaguely as has been claimed

ii) We now turn to the proof of the first statement The case g(s) = 1 for all

0 le s lt 1 is obvious then g = δ0 and q = 1 and Zn is as convergent to 0 Thus

we are left with the case g(s) lt 1 for all s lt 1 Then there is a decreasing sequence

(bn n ge 0) of real numbers such that f0n(12) le bn le 1 and bn darr g(12) We

define the sequence (an n ge 0) using the following equation

f0n(an) = bn

Therefore 12 le an le 1 and we also have f0n+1(an+1) le f0n(an) or

equivalently fn+1(an+1) le an Then the process U = Un n ge 0 given by

Un = aZnn middot IZn gt 0

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0

Page 26: Thumbnails · 2017-11-04 · BPREs. Chapter 2 leads the reader in to the world of BPREs. It contains classification of BPREs, describes some properties of supercritical BPREs and

14 Discrete Time Branching Processes in Random Environment

is a non-negative supermartingale Indeed because of fn+1(0)Zn ge IZn = 0 and

fn+1(an+1) le an we have

E[Un+1 | Z0 Zn] = fn+1(an+1)Zn minus fn+1(0)

Zn le aZnn minus IZn = 0 = Un as

Thus Un is as convergent to a random variable U ge 0

Now we distinguish two cases Either g = qδ0 Then g(s) is strictly increasing

which implies an rarr 12 as n rarr infin Hence the as convergence of Un enforces the

as convergence of Zn with possible limit infin

Or g = qδ0 Then g(12) = q implying that for n rarr infin

E[Un] = f0n(an)minus f0n(0) = bn minusP(Zn = 0) rarr g(12)minus q = 0

and consequently U = 0 as implying Un rarr 0 as Since an ge 12 for all n this

enforces that Zn converges as to 0 or infin In both cases Zn rarr Zinfin as for some

random variable Zinfin

iii) For the second statement we use the representation Zn =sumZnminus1

i=1 Yin Define

the events Azn = sumzi=1 Yin = z Then for z ge 1

P(Azn) ge 3minusz(1minus fn[1])

Indeed if fn[1] ge 13 then

P(Azn) ge P(Y1n = 1 Y2n = middot middot middot = Yzn = 1)

ge (1minus fn[1])fn[1]z ge 3minusz(1minus fn[1])

and if fn[1] le 13 then either P(Yin gt 1) ge 13 or P(Yin = 0) ge 13 implying

P(Azn) ge P(min(Y1n Yzn) gt 1) +P(Y1n = middot middot middot = Yzn = 0)

ge 3minusz(1minus fn[1])

Now assumesuminfin

n=1(1 minus fn[1]) = infin As for fixed z the events Azn are

independent it follows by the BorelndashCantelli lemma that these events occur as

infinitely often From the as convergence of Zn we get for z ge 1

P(Zinfin = z) = P(Zn = z finitely often) le P(Azn occurs finitely often) = 0

This implies that P(1 le Zinfin lt infin) = 0