arima report

32
ARIMA Autoregressive Integrated Moving Average

Upload: joshy29

Post on 20-Nov-2014

342 views

Category:

Documents


9 download

DESCRIPTION

Introduction to ARIMA with EViews tutorial

TRANSCRIPT

Page 1: ARIMA Report

ARIMA

Autoregressive Integrated Moving Average

Page 2: ARIMA Report

Introduction - ARMA

ARMA - Auto Regressive Moving Average Introduced by Box and Jenkins in 1976. Box-Jenkins model. Used to develop a model that will forecast an

element based on its historical values. For example, the exchange rate in time t can

be forecasted based on its values in time t-2 and time t-5 plus stochastic error terms.

Page 3: ARIMA Report

Two Common Processes

Autoregressive process.

Moving average process.

It is likely that X has characteristics of both AR and MA and is therefore ARMA. In general, in an ARMA (p,q) process, there will be p autoregressive and q moving average.

Page 4: ARIMA Report

Introduction - ARIMA

Most economic time series (i.e. GDP) are nonstationary, that is, they are integrated.

In general, if a time series has to be differenced d times, it is integrated of order d or I(d).

On the other hand, if d=0, the resulting to I(0) process corresponds a stationary time series.

Therefore, if we have to difference a time series d times to make it stationary and then apply the ARMA(p,q) model to it, we say that the original time series is ARIMA (p,d,q).

Page 5: ARIMA Report

P, D, Q Model

P: the autoregressive parameter D: the integrated parameter of the number of

differencing passes Q: the moving average parameter For example: (0,1,2) model

Page 6: ARIMA Report

The Constants in ARIMA

If there are no autoregressive parameters in the model, then the expected value of the constant is , the mean of the series.

If there are autoregressive parameters in the series, then the constant represents the intercept.

If the series is differenced, then the constant represents the mean or intercept of the differenced series.

Page 7: ARIMA Report

Assumptions

Stationarity. No uncontrolled correlation. Arbitrary model lag order. No outliers. Randomly distributed shocks. Uncorrelated random errors.

Page 8: ARIMA Report

Stationarity

The input series should have a constant mean, variance, and autocorrelation through time.

This assumption is tested through the Augmented Dickey-Fuller Test and non-stationarity is fixed through differencing.

Assumptions

Page 9: ARIMA Report

No Uncontrolled Correlation

Autocorrelation means that the value of a given datum is largely determined by the value of the preceding datum in the series.

Assumption is tested through the Durbin-Watson Coefficient, with range of value from 0 to 4. A value of 2 indicates no autocorrelation, 0 indicates autocorrelation, and 4 indicates negative autocorrelation.

Assumptions

Page 10: ARIMA Report

Arbitrary Model Lag Order

The researcher must have a theoretical basis to establish the face validity of the order of the model.

Assumptions

Page 11: ARIMA Report

No Outliers

As in other forms of regression, outliers may affect conclusions strongly and misleadingly.

Assumptions

Page 12: ARIMA Report

Randomly Distributed Shocks

If shocks are present in the time series, they are assumed to be randomly distributed with a mean of 0 and a constant variance.

Assumptions

Page 13: ARIMA Report

Uncorrelated Random Errors

Residuals are randomly and normally distributed, have non-significant autocorrelations and partial autocorrelations, and have a mean of 0 and homogeneity of variance over time.

The Durbin-Watson test is the standard test for correlated error.

Assumptions

Page 14: ARIMA Report

Procedure

Test for the assumptions. The Box-Jenkins Methodology

Identification. Estimation. Diagnostic Checking. Forecasting.

Page 15: ARIMA Report

Test for Stationarity

Visual plot. Correlogram. Unit root test.

Augmented Dickey-Fuller Test Ho: Series is non-stationary Ha: Series is stationary. If absolute value of ADF > absolute value of the

critical regions, reject Ho

Page 16: ARIMA Report

Differencing Differencing is a procedure which attempts to de-trend

the data in order to control autocorrelation and achieve stationarity.

It does this by subtracting each datum in a series from its predecessor.

The number of times a series needs to be differenced to achieve stationarity is reflected in the d parameter.

In order to determine the necessary level of differencing, one should examine the plot of the data and autocorrelogram

Caution: Some time series may require little or no differencing. An over differenced series produce less stable coefficient estimates.

Procedure

Page 17: ARIMA Report

Identification Major tools: ACF and PACF One autoregressive (p) parameter: ACF - exponential

decay; PACF - spike at lag 1, no correlation for other lags. Two autoregressive (p) parameters: ACF - a sine-wave

shape pattern or a set of exponential decays; PACF - spikes at lags 1 and 2, no correlation for other lags.

One moving average (q) parameter: ACF - spike at lag 1, no correlation for other lags; PACF - damps out exponentially.

Two moving average (q) parameters: ACF - spikes at lags 1 and 2, no correlation for other lags; PACF - a sine-wave shape pattern or a set of exponential decays.

One autoregressive (p) and one moving average (q) parameter: ACF - exponential decay starting at lag 1; PACF - exponential decay starting at lag 1.

Procedure

Page 18: ARIMA Report

Estimation

approximate maximum likelihood method the fastest method should be used for very long time series (e.g., with more

than 30,000 observations) approximate maximum likelihood method with

backcasting must use this method first to establish initial parameter

estimates that are very close to the actual final values exact maximum likelihood method

may be inefficient when used to estimate parameters for seasonal models with long seasonal lags (e.g., with yearly lags of 365 days)

Procedure

Page 19: ARIMA Report

Diagnostic Checking

Test for the significance of the parameter estimates.

Use partial data to generate forecasts. Analysis of residuals.

Page 20: ARIMA Report

Limitations

The ARIMA method is appropriate only for a time series that is stationary.

At least 50 observations are recommended for the input data.

It is also assumed that the values of the estimated parameters are constant throughout the series.

Page 21: ARIMA Report

Illustration

Background: US GDP data from 1970 – 1996

Frequency: quarterly

Page 22: ARIMA Report

Extract Data From Excel

1

3

2

To Extract Data: File Open Foreign Data as Workfile Choose File Open

Page 23: ARIMA Report

Step 1: Check for Stationarity

Visual Plot

To Plot Data: Series Box View Graph Line

1

2

Page 24: ARIMA Report

Step 1: Check for Stationarity ADF / Unit Root Test

Series Box View Unit Root Test Level

1

3

2

Page 25: ARIMA Report

Step 2: Difference the Data ADF Test

Series Box View Unit Root Test 1st Difference

1

3

2

Page 26: ARIMA Report

Step 3: Estimate the P and Q• Correlogram – ACF and PACF

Series Box ViewCorrelogram 1st Difference

1

3

2

Page 27: ARIMA Report

Step 3: Estimate the P and Q

Correlogram – ACF and PACF

Page 28: ARIMA Report

Possible AR and MA models

And so on…

Page 29: ARIMA Report

Step 4: Estimate several modelsQuick Estimate EquationType equation: u_s_gdp c ar(1) ok1

3

2

Page 30: ARIMA Report

Quick Estimate EquationType Equation: u_s_gdp c ar(1) ma(1) ok1

3

2

Page 31: ARIMA Report

Step 5: Determine the model

Model AIC value SC value

AR(1) 10.04537 10.10206m AR(1)MA(1) 9.985905 10.07094

Page 32: ARIMA Report

Step 6: Checking

Use the t-test to check for the significance of the parameters.

Use the Durbin-Watson test to check for the autocorrelation of the error terms.