introduction to dynamic panel data: autoregressive models ... · “tests of specification for...

29
Introduction to Dynamic Panel Data: Autoregressive Models with Fixed Eects Eric Zivot Winter, 2013

Upload: vandan

Post on 21-Mar-2019

220 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Introduction to Dynamic Panel Data:

Autoregressive Models with Fixed Effects

Eric Zivot

Winter, 2013

Page 2: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Dynamic Panel Data (without covariates)

= −1 + + = 1 (individuals)

= 1 (time periods)

Typical assumptions

1. Stationarity: || 1

2. [|0 −1 ] = 0

3a. No serial correlation: ∼ (0 2)

3b. Homoskedasticity: ∼ (0 2)

Page 3: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Stationary Model Representation

= −1 + +

By recursive substitution

1 = 0 + + 1

2 = 1 + + 2

= [0 + + 1] + + 2

= 20 + (1 + ) + 1 + 2...

= 0 +

⎛⎝−1X=0

⎞⎠+−1X=0

Page 4: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Now

[|] = [0|] +

⎛⎝−1X=0

⎞⎠+

−1X=0

[−|]

= [0|] +

⎛⎝−1X=0

⎞⎠For large

≈ 0−1X=0

≈ 1

1−

Page 5: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

so that (for large )

[|] ≈1−

var[|] ≈∞X=0

var[−|]

=2

1− 2

Page 6: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Estimation

= −1 + +

= 0 +

⎛⎝−1X=0

⎞⎠+−1X=0

Can’t use RE estimation because

[−1 · ]

=

⎡⎣⎛⎝−10 +

⎛⎝−2X=0

⎞⎠+−2X=0

⎞⎠

⎤⎦ 6= 0

What about FE estimation?

Page 7: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Write the model in FE matrix notation as

y = y−1 + 1 + η

y×1

=

⎡⎢⎣ 1...

⎤⎥⎦ y−1×1

=

⎡⎢⎣ 0...

−1

⎤⎥⎦ η =⎡⎢⎣ 1...

⎤⎥⎦Define

Q = I −P1so that

Qy = y =

⎛⎜⎝ 1 − ...

⎞⎟⎠ =1

X=1

Page 8: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Then, the transformed model is

Qy = Qy−1 + Q1 +Qη⇒y = y−1 + η

The FE estimator of is the pooled OLS estimator on the transformed model

=

⎛⎝ X=1

y0−1y−1

⎞⎠−1 X=1

y0−1y

=

⎛⎝ X=1

y0−1Qy−1

⎞⎠−1 X=1

y0−1Qy

Page 9: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

For consistency, consider

− =

⎛⎝1

X=1

y0−1Qy−1

⎞⎠−1 1

X=1

y0−1Qη

will be consistent if

lim→∞

1

X=1

y0−1Qη = [y0−1Qη] = 0

Note: In the asymptotic analysis, the cross section dimension →∞ but thetime series dimension is held fixed!

Page 10: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Now

y0−1Qη =h0 −1

i⎡⎢⎣ 1 − ... −

⎤⎥⎦=

X=1

−1( − )

=1

X=1

−1 = −10 +

⎛⎝−2X=0

⎞⎠+−2X=0

Page 11: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Using

−1 = −10 +

⎛⎝−2X=0

⎞⎠+−2X=0

it follows that

[y0−1Qη] =X

=1

h−1( − )

i6= 0

because

[−1] =

⎡⎣−1⎛⎝ 1

X=1

⎞⎠⎤⎦ 6= 0

Page 12: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Result:

9 as →∞ for fixed

due to correlation between −1 and

Remark:

If both →∞ and →∞ then it can be shown that

because

=1

X=1

→ 0⇒ [−1] = 0

This is an example of “double asymptotic analysis” that is common in theanalysis of panel data models.

Page 13: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Bias of Fixed Effects Estimator

In the stationary model with ∼ iid (0 2) Nickell (1981, Ecta) showedthat for fixed and

[y0−1Qη] = −2()

() =1

1−

"1− 1

Ã1−

1−

!#⇒ is downward biased

Further, Nickell showed that for fixed as →∞

− → −(1− 2)

− 1

Ã1− 2()

− 1

!Notice that as →∞ and →∞ sequentially

− → 0

Page 14: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Example of Bias in FE estimation of dynamic panel model

FE bias: − / .05 .5 .952 -0.52 -0.75 -0.973 -0.35 -0.54 -0.7310 -0.11 -0.16 -0.2615 -0.07 -0.11 -0.17

Remarks

1. If 0 the bias is always negative, and massive for very small values of

2. As increases, the bias decreases but even with = 15 the bias is stillsubstantial for large .

Page 15: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

IV Estimation

Anderson and Hsiao (1981) suggested the following approach

1. First eliminate the fixed effect by taking first differences

− −1 = (−1 − −2) + − −1∆ = ∆−1 +∆ = 2

Note

[∆−1∆] 6= 0

due to correlation between −1 and −1 terms.

Page 16: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Stacking across time gives

∆y(−1)×1

= ∆y−1 +∆η

2. Do IV estimation using −2 as an instrument for ∆−1

[−2∆−1] = [−2(−1 − −2)] 6= 0[−2∆] = [−2

³ − −1

´] = 0

since is iid.

Page 17: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

The IV estimator is then

=

⎛⎝ X=1

y0−2∆y−1

⎞⎠−1 X=1

y0−2∆y

=

P=1

P=2 −2∆P

=1P=2 −2∆−1

which is consistent by construction.

Remark:

The Anderson-Hsiao estimator does not exploit all the relevant moment condi-tions so it is not the most efficient GMM estimator.

Page 18: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Arellano-Bond GMM Estimator

“Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review of Economic Studies, 58, 1991

Arellano and Bond (AB) derived all of the relevant moment conditions fromthe dynamic panel data model to be used in GMM estimation.

The moment condtions are based on the first differenced model

∆ = ∆−1 +∆ = 2

They showed that the number of moment conditions depends on (numberof time periods)

Page 19: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Example: = 4⇒ 6 moment condtions

= 2 : [∆20] = 0

= 3 :[∆30] = 0[∆31] = 0

= 4 :[∆40] = 0[∆41] = 0[∆42] = 0

For GMM estimation, define

g4 () =

⎛⎜⎜⎜⎜⎜⎜⎜⎜⎝

∆20∆30∆31∆40∆41∆42

⎞⎟⎟⎟⎟⎟⎟⎟⎟⎠=

⎛⎜⎜⎜⎜⎜⎜⎜⎜⎝

(∆2 − ∆1) 0(∆3 − ∆3) 0(∆3 − ∆2) 1(∆4 − ∆3) 0(∆4 − ∆3) 1(∆4 − ∆3) 2

⎞⎟⎟⎟⎟⎟⎟⎟⎟⎠

Page 20: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Notice that g4 () is a linear function of It may be re-expressed in matrixform as

g4 () =

⎡⎢⎢⎢⎢⎢⎢⎢⎢⎣

0 0 00 0 00 1 00 0 00 0 10 0 2

⎤⎥⎥⎥⎥⎥⎥⎥⎥⎦⎡⎢⎣ ∆2 − ∆1∆3 − ∆3∆4 − ∆3

⎤⎥⎦

= X40 [∆y4 − ∆y4−1]

Page 21: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

where

∆y4 =

⎡⎢⎣ ∆2∆3∆4

⎤⎥⎦ ∆y4−1 =

⎡⎢⎣ ∆1∆2∆3

⎤⎥⎦X4 =

⎡⎢⎣ 0 0 0 0 0 00 0 1 0 0 00 0 0 0 1 2

⎤⎥⎦

Page 22: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

The sample moments used for GMM estimation are then

g4() =1

X=1

X40 [∆y4 − ∆y4−1]

= S4∆ − S4∆−1

S4∆ =1

X=1

X40 ∆y4

S4∆−1 =1

X=1

X40 ∆y4−1

Page 23: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Because is randomly sampled over it follows that

S6×6

= [g4 ()g4 ()

0] = [X40 ∆η∆η0X4 ]

Under conditional heteroskedasticity, a consistent estimate is

S =1

X=1

X40 ∆η∆η0X4

∆η = [∆y4 − ∆y4−1]

For example, can use the Anderson and Hsiao IV estimate of

Page 24: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Under conditional homoskedasticity

S = [X40 ∆η∆η0X4 ] = [[X40 C

0ηη0CX

4 |X4 ]] =

= [X40 C0[ηη

0|X4 ]CX4 ] = 2[X

40 C

0CX4 ]

where

C0(−1)×

=

⎛⎜⎜⎜⎝−1 1 0 00 −1 1 0

0 0 0 −1 1

⎞⎟⎟⎟⎠So a consistent estimate of S is

S =1

X=1

X40 C0CX4

Estimation of 2 is not required for GMM estimation because it cancels out inthe resulting estimator.

Page 25: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

The efficient GMM estimator solves

min

( S−1) = g4()S−1g4()

= [S4∆ − S4∆−1]0S−1[S4∆ − S4∆−1]

Since ( S−1) is linear in the analytic solution is

(S−1) = (S40∆−1S

−1S4∆−1)S40∆−1S

−1S4∆

This estimator is known as the Arellano-Bond GMM estimator.

Page 26: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Example: International Difference in Output Growth Rates (Hayashi,Section 5.4)

Q: Do poor countries grow faster than rich countries? If so how much faster?

Neoclassical growth theory foundations

() = output per effective labor at time for a country

→ ∗ = steady state

The log-linear approximation around ∗ gives the adjustment equation

ln(())

= · [ln(∗)− ln(()]

= speed of convergence 0

Page 27: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Between any two time periods −1 and , the log-linear adjustment equationimplies

ln(()) = (1− ) · ln(∗) + ln((−1))

= exp [− · ( − −1)]

Define

() = ()

()() () = aggregate output

() = aggregate hours worked

() = level of labor augmenting technical progress

Page 28: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Assume () grows at a constant rate Then () = (0) exp( · ) and

ln () = ln

à ()

()

!− ln((0))− ·

Substituting into the output equation gives

ln

à ()

()

!= ln

à (−1)(−1)

!+ (1− ) [ln(∗)− ln((0))] +

= · ( − · −1)

Subtracting lnµ (−1)(−1)

¶from both sides gives the growth equation

∆ ln

à ()

()

!= (− 1) ln

à (−1)(−1)

!+(1− ) [ln(∗)− ln((0)] +

Because 1, the level of per capita output has a negative effect on growth.Hence, poor countries should grow faster than rich countries.

Page 29: Introduction to Dynamic Panel Data: Autoregressive Models ... · “Tests of Specification for Panel Data: Monte Carlo Evidence and an Applica-tion to Employment Equations”, Review

Turning the output equation into a dynamic panel data model

Assume

ln

à ()

()

!= ln

à (−1)(−1)

!+ (1− ) [ln(∗)− ln((0)] +

= · ( − · −1)

holds for every country where and are the same for every country but that∗ and might differ. Then we can write

= + −1 + +

= ln ( ()()) = log per capita output at time = (1− ){ln(∗ )− ln((0))} for country

= country and time specific shock (e.g. business cycle)