previously definition of a stationary process (a) constant mean (b) constant variance

previously

Definition of a stationary process (A) Constant mean (B) Constant variance(C) Constant covariance

White Noise Process: Example of Stationary Series.

Random Walk: Example of Non-stationary Series.

Economic time series are typically non-stationary(i) Share Prices(ii) Exchange Rate(iii) Income

time varying mean => non-stationary

Properties of time series II

Share Price Exchange Rate

Income


050100150200250300350400450 .25.5.7511.251.51.7522.252.5050100150200250 .35.4.45.5.55.6.65.71960196519701975 8.78.88.999.19.29.3

Why is non-stationarity important?

(A) Assumptions of the Classical Regression Model

(B) Spurious Regression Problem

Reading: Thomas 13.2 Non-stationary variables and the classical model

Other issues in this lecture(i) Order of Integration(ii) Dealing with non-stationarity

(a) Deterministic Trends(b) Stochastic Trends


Properties of time series II:

Why is stationarity important?

Stationarity is an assumptions about explanatory variables in the Classical Regression Model Yt = α + βXt + ut

- Typical regression model assumes that variance of time series (Xt) should tend to fixed finite constants in large samples

However, variables are typically stochastic in economics(mean and variance changes from sample to sample)

- probablility limit of these variances should equal fixed finite constants

Background Reading on Classical Regression Model R L Thomas Chapter Six.

Properties of time series II:

Classical Regression Model? Contd...

Inference (using t-statistics) in classical regression analysis is based on large sample theory.

- large sample theory is of no use if variance does not converge on a constant

- Consistency of OLS breaks down - sampling distribution takes non-standard form.- Can no longer use the t-distribution- normal hypothesis testing becomes invalid

Consequently, we can not use t-statistics in regression with non-stationary data.

Classical regression model was devised to deal with relationships between stationary variables. Should not be applied to non-stationary series.

Spurious Regression ProblemA significant problem associated with econometric estimation using non-stationary variables is that of spurious regression

- if an independent variable in a regression has a trend it is likely that the dependent variable will also have a trend

- for example the consumption function Ct = β0 + β1Yt + εt

- Trend dominated equations are likely to have (a) highly significant t-statistics(b) high value for the coefficient of determination R2

Spurious Regression Problem

yt = yt-1 + ut ut ~ iid(0,σ2)

xt = xt-1 + vt vt ~ iid(0,σ2)

ut and vt are serially and mutually uncorrelated

yt = β0 + β1xt + εt

since yt and xt are uncorrelated random walks we should expect R2

to tend to zero. However this is not the case.

Yule (1926): spurious correlation can persist in large samples with non-stationary time series.

- if two series are growing over time, they can be correlatedeven if the increments in each series are uncorrelated

Spurious Regression ProblemGranger and Newbold (1974)

- more recent study of non-stationary data and implications- simulation using repeated independent random walks yt = yt-1 + ut xt = xt-1 + vt


yt and xt are independent but strong correlation between yt

and yt-1, and also between xt and xt-1.

- regression of yt on xt gave high R2 but a low Durbin-Watson (DW) statistic- t-statistic often suggested a relationship when the series were independent- ran the regression in first difference there was a low R2 and a DW statistic which was close to 2

Phillips (1986) confirmed these simulation results theoretically. β1 converges on a random variable

Spurious Regression ProblemTwo random walks generated from Excel using RAND() commandhence independent



Two Random Walks

-4-202468101214

1 40 79 118 157 196 235 274 313 352 391 430 469

RW1 RW2

Spurious Regression ProblemPlot Correlogram using PcGive (Tools, Graphics, choose graph, Time series ACF, Autocorrelation Function)



0 5 10

0.25

0.50

0.75

1.00ACF-RW1

0 5 10

0.25

0.50

0.75

1.00 ACF-RW2

Spurious Regression ProblemEstimate regression using OLS in PcGive


based on two random walksyt = yt-1 + ut ut ~ iid(0,σ2)


EQ( 1) Modelling RW1 by OLS (using lecture 2a.in7) The estimation sample is: 1 to 498 Coefficient t-value Constant 3.147 25.8RW2 -0.302 -15.5

sigma 1.522 RSS 1148.534R^2 0.325 F(1,496) = 239.3 [0.000]**log-likelihood -914.706 DW 0.0411no. of observations 498 no. of parameters 2

Order of Integration

Definition A time series is said to be integrated of order d, written I(d), if after being difference d times it becomes stationary.

Series which are stationary without differencing are I(0).Many series are I(1) and hence they become stationary after differencing once.

For example, Yt ~I(1) implies Yt – Yt-1 = ΔYt ~I(0)

How do we deal with non-stationarity? Simple approaches(a) Deterministic Trend

- we can incorporate a deterministic time trend Zt = β1trend + ut => Zt* = Zt - β1

trend

Zt is a difference stationary process (DSP)

0 50 100 150 200 250 300 350 400 450 500

0

100

200

300

400

500 trend3

0 5 10

0.25

0.50

0.75

1.00 ACF-trend3

How do we deal with non-stationarity? Simple approaches(a) Deterministic Trend

Xt = Xt-1+ ut and Xt = β1trend + ut

=> Xt* = Xt - β1trend => (non-stationary)

- however, detrending a stochastic trend with a deterministic trend (i.e. time trend) does not result in a stationary variable

0 50 100 150 200 250 300 350 400 450 500

0.0

2.5

5.0

7.5

10.0

12.5RW2

0 50 100 150 200 250 300 350 400 450 500

100

200

300

400

500trend

How do we deal with non-stationarity? Simple approaches(b) First difference

Nelson and Plosser (1982) suggest that most time series have a stochastic trend. yt = yt-1 + ut

Hence if we first difference this produces a stationary variable.

Run regression with variables that are stationary by first differencing avoids spurious regression problem.

- t-statistics and R2 can be used for inference.

yt = β0 + β1xt + εt and subtracting yt-1 = β0 + β1xt-1 + εt-1

=> Δyt = θ1Δxt + vt

Dealing with non-stationarityFirst differenced random walks (i.e. difference stationary variables)

Δyt = yt - yt-1 = ut ut ~ iid(0,σ2)

Δxt = xt - xt-1 = vt vt ~ iid(0,σ2)

Plot correlograms and time series

0 5 10

0

1ACF-DRW1

0 5 10

0

1ACF-DRW2

0 50 100 150 200 250 300 350 400 450 500

0.0

0.5DRW1 DRW2

Dealing with non-stationarityEstimate regression in first differences using OLS

Δyt = θ1Δxt + vt

based on two first differenced random walks (ie difference stationary processes DSP)

Δyt = yt - yt-1 = ut ut ~ iid(0,σ2)

Δxt = xt - xt-1 = vt vt ~ iid(0,σ2)

EQ( 2) Modelling DRW1 by OLS (using lecture 2a.in7) The estimation sample is: 2 to 498

Coefficient t-valueDRW2 -0.028 -0.601

sigma 0.299 RSS 44.246R^2 0.001 F(1,495) = 0.3579 [0.550]log-likelihood -104.137 DW 1.92no. of observations 497 no. of parameters 1

How do we deal with non-stationarity? CaveatsTrend or Difference Stationary Processes? TSP or DSP?

(1) Although Nelson and Plosser (1982) suggest that most time series have a stochastic trend, literature is undecided eg Cochrane (1988) suggests US real GDP follows a deterministic trend. (TSP more applicable for real variables)

(2) May be preferable to work in logs for some time series.e.g., absolute growth rate when output is low will be much smaller than absolute growth rate when output is large. Percentage growth rate is much more constant.

(3) However it may not be so costly to overdifference.Evidence is more apparent for underdifferencing.

How do we deal with non-stationarity? CaveatsFirst differencing a Stochastic Trend - Warning

Assuming we have correctly identified a stochastic trend.It may not necessarily be worthwhile adopting the simple approach


Δyt = θ1Δxt + vt

We loose important information about β0. This would be the autonomous level of consumption if there was no income.

Also we do not get an idea of the coefficient on β1 the long run relationship.

Caveats about First DifferencingLoosing Long Run informationValid representation of equilibrium relationship between yt and xt


1. Three scenarios at end of period t-1(a) Y equals its equilibrium value yt-1 = β0 + β1xt-1

(b) Y is below its equilibrium value yt-1 < β0 + β1xt-1

(c) Y is above its equilibrium value yt-1 > β0 + β1xt-1

2. Three scenarios at end of period t(a) Y equals its equilibrium value yt = β0 + β1xt

(b) Y is below its equilibrium value yt < β0 + β1xt

(c) Y is above its equilibrium value yt > β0 + β1xt

Δyt = θ1Δxt + vt holds, only if 1(a) and 2(a) hold.

Change in Y depends on change in X and relationship between X and Y in the previous period.

Non-stationarity: some cautionary words.

Main conclusion: It is important to carefully consider whether time series is trend or difference stationary.

Test formally using the methods of Dickey and Fuller.

But do not loose sight of spurious regression problem.Preoccupation about stationarity lead researchers to loose sight of main objective.Estimating regression equations with non-stationary data which we can rely upon.

previously definition of a stationary process (a) constant mean (b) constant variance

Documents

regression of yt

nonstationary time series

xt t yt

classical regression

variance of time series

example of stationary

nonstationary variables

xt tsince yt