comparing automatic modeling procedures of tramo and x-12-arima, an update kathy mcdonald-johnson,...

Comparing Automatic Modeling Procedures of TRAMO andX-12-ARIMA, an Update

Kathy McDonald-Johnson, U.S. Census Bureau

Catherine Hood, Catherine Hood Consulting

Brian Monsell, U.S. Census Bureau

Chak Li, U.S. Census Bureau

ICES III June 2007

Acknowledgments

• Agustín Maravall

• Víctor Gómez

• Rita Petroni

• James Gomish

Update

• Similar comparisons in the past, especially Farooque, Hood, and Findley (2001)

»X-12-ARIMA chose models of similar quality to TRAMO models

»X-12-ARIMA perhaps better at identifying trading day effects than TRAMO

Update, Similar Approach

• We used a similar approach to that of Farooque, Hood, and Findley (2001), but we used improved versions of both programs

Outline

• Background on Automatic Modeling

• Methods

• Results

»Actual time series

»Simulated time series

• Conclusions

Background

Automatic ARIMA Modeling

• X-11-ARIMA from Statistics Canada, Dagum

»Picks best model from list

• TRAMO (Time series Regression with ARIMA noise, Missing observations and Outliers) from the Bank of Spain, Gómez and Maravall

»Multiple steps to obtain a model

TRAMO Automatic Modeling

• Gómez and Maravall (2000) gives description

• FORTRAN code from Gómez and Maravall provides additional detail

»Generously provided to U. S. Census Bureau for X-12-ARIMA Version 0.3 development

X-12-ARIMA Version 0.3

• Retains pick model method

»Pickmdl specification

• Adds step-through method based on the TRAMO method

»Automdl specification

X-12-ARIMA Comparisons

• Dent, Hood, McDonald-Johnson, and Feldpausch (2005) compared the step-through method to the pick model method

»Models of similar quality

»Step-through method more flexible

X-12-ARIMA's Automatic Transformation Selection

• Identification with the transform specification

• Fit a default model with the log transformation and with no transformation»Usually the airline model (0 1 1)(0 1 1) from Box

and Jenkins (1976)

• The model is chosen whose maximum likelihood value is larger»Likelihood of the log data is adjusted to be a

likelihood of the untransformed data

•By default, slight bias toward the log transformation

, ,ˆ ˆ 1Y YUntransformed N Log NL L

X-12-ARIMA's Automatic Regression Selection

• Trend Constant

»Identification with the step-through method, automdl specification

• Outliers

»Identification with the outlier specification

X-12-ARIMA's Automatic Regression Selection

Trading-Day Effect

Easter Effect

• Identification with the regression specification, aictest argument

• Test uses the AICC

»No bias (user can set bias)

• AIC Corrected (for sample size)

• Note: As N gets larger, AICC approaches the AIC

LAICC NN 11

Trading-Day Effects

• User specifies type

»Flow (cumulative)

»Stock (inventory)

• X-12-ARIMA compares AICC with and without the effect

Easter Effects

• Default tests for Easter effects of length 1, 8, and 15 days»User can specify length

• X-12-ARIMA compares four AICC values»No effect vs. each of the three different

length effects

Modeling Diagnostics

• Ljung-Box Q»Goodness-of-fit diagnostics (Ljung and

Box 1978)

• Spectrum of the model residuals»Diagnostic indicating seasonal or trading

day effects remaining in the model residuals

»Trading day frequencies defined in Cleveland and Devlin (1980)

Ljung-Box Q

• Based on sample autocorrelation of the regARIMA model residuals

• Residuals should behave like white noise

• Each Ljung-Box Q statistic of positive degrees of freedom has a corresponding p value

• An individual lag fails if the p value for the Q statistic for the lag is less than 0.05

Ljung-Box Q Failure

For this study

• If seven or more of the first 12 lags failor

• If 13 or more of the first 24 lags failor

• If lag 12 fails

Then the model fails according to this diagnostic

Spectrum of the Model Residuals

• Diagnostic indicating strength at frequencies of interest

• Visually significant peaks at seasonal or trading day frequencies indicate possible model problems

Significant Spectrum Peaks

• A spectral peak is considered to be significant if it

»Reaches a height beyond the median height of all the frequency measures

»Are taller than nearest neighbors by a visually significant amount

Significant Spectrum Peaks

For this study, any significant peak at

»seasonal frequencies one, two, three, for or five cycles per year and

»At either of the two trading-day frequencies

indicates model failure according to this diagnostic

Spectrum Diagnostic Information

• Graphical form»Output file line printer graph

»Higher resolution graph

• Text form»Log file

»Diagnostics file

• Failure warnings listed onscreen when X-12-ARIMA runs

Methods

Automatic Modeling Settings

• Test for log transformation

• Automatic regARIMA model identification

• Automatic outlier detection

• Test for

»Usual trading day

»Leap year

»Easter effects

Settings for X-12-ARIMA

• We expected some quarterly effects (higher autocorrelation three months apart), so we chose the maximum nonseasonal model order (maximum p, q) to be three»Default is two

• We chose to prefer balanced models to have an approach more like the TRAMO procedure»Default is not to prefer balanced models

Model Choices

• Ran TRAMO, X-12-ARIMA to identify transformation, model

• Hard-coded results into X-12-ARIMA input specification files

• Compared diagnostics from X-12-ARIMA

Clarification

• "TRAMO model" results are from X-12-ARIMA runs

»Initial TRAMO runs determined the transformation and model choices

Changes to Models

• Used X-12-ARIMA outlier set

• If any Easter regressor chosen, used X-12-ARIMA Easter effect of eight days

Actual Time Series

457 U. S. Census Bureau Series

• U. S. Building Permits

• Manufacturing

• Retail Sales

• Import/Export data

Descriptions available at

www.census.gov/cgi-bin/briefroom/BriefRm

Transformation Choice

• TRAMO and X-12-ARIMA agreed for 91% (417) of the series

• 40 series differed»85% (34/40) TRAMO chose log and X-12-

ARIMA chose no transformation

»15% (6/40) X-12-ARIMA chose log and TRAMO chose no transformation

Transformation Choice

• Transformation choice is fundamental

• We did not want to favor one program’s transformation over the other

• We dropped the 40 series of disagreement from further comparisons

Full Model Agreement(of 417 Series)

• 30% (124) of the regARIMA models agreed

»Any length Easter considered match

• 293 series to compare diagnostics

ARIMA Model Agreement(of 293 Series)

• 24% (70) of the ARIMA models agreed, showing differences only in the chosen regression effects

Easter Effects

• 76% (222) Easter effect agreement»13% (37) both programs chose an Easter effect

»63% (185) neither program chose an Easter effect

• 24% (71) Easter effect disagreement»24% (70) X-12-ARIMA chose Easter and TRAMO

did not

»0.3% (1) TRAMO chose Easter and X-12-ARIMA did not

Why Does X-12-ARIMA Include Easter Effects More Often?

• TRAMO checks for an Easter effect of one length

• X-12-ARIMA checks for three different lengths

• Do more possible regressors raise the chance of including an Easter effect?

Are the Easter Effects Appropriate?

• These economic series could indeed have Easter effects, but the results for X-12-ARIMA show Easter effects to be more prevalent than we would have expected

Trading-day Effects

• 57% (166) trading day agreement»24% (70) both programs chose trading-day

effects

»33% (96) neither program chose trading-day effects

• 43% (127) trading day disagreement»35% (104) X-12-ARIMA chose trading-day effects

and TRAMO did not

»8% (23) TRAMO chose trading-day effects and X-12-ARIMA did not

Appropriate Trading-day Effects

• Under specific conditions, we can evaluate whether a trading-day effect was missed

»One model includes a trading-day effect but the other does not

»The model with a trading-day effect has no spectrum peak at either trading-day frequency, but the model without a trading-day effect results in a peak at one or both of the trading-day frequencies

Trading Day Omitted

• 22% (64) of the series had this omission problem»20% (60) TRAMO omission

»1% (4) X-12-ARIMA omission

• Using a binomial distribution, the probability of seeing 60 out of 64 failures for one method if the probability of failure were equally 0.5 for each method is less than 0.01

Ljung-Box Q Model Failures

• 24% (69) one model passed and the other model failed»17% (50) TRAMO model failed

»6% (19) X-12-ARIMA model failed

• Binomial probability that 50 of 69 failures would be from one method is less than 0.01

Seasonal Spectrum Model Failures

• 14% (41) one model passed and the other model failed»8% (24) TRAMO model failed

»6% (17) X-12-ARIMA model failed

• Binomial probability of 24 of 41 failures being from one method is not significant at the 10% level, so there was no significant difference in the seasonal spectrum results

Simulated Time Series

Airline Model Series (0 1 1)(0 1 1)

• 3,500 monthly series

• 15 years long

• Nonseasonal moving average coefficient 0.6

• Seasonal moving average coefficient 0.9

• Start date 1980 (arbitrary choice)

No Model

• 0.6% (21) X-12-ARIMA did not choose a model

• TRAMO identified a model for each series

Fully Correct Model Identification

• Airline model with no trading day or Easter effects

• 66% (2,305) TRAMO correct

• 72% (2,516) X-12-ARIMA correct

Correct ARIMA Identification

• Also identified trading day or Easter effects

• 85% (2,978) TRAMO correct ARIMA

• 90% (3,159) X-12-ARIMA correct ARIMA

Nonseasonal Differencing

• 99% (3,480) TRAMO chose nonseasonal difference of order 1

• 99% (3,466) X-12-ARIMA chose nonseasonal difference of order 1

Seasonal Differencing

• 97% (3,378) TRAMO chose seasonal differencing of order 1

• 99% (3,470) X-12-ARIMA chose seasonal differencing of order 1

Easter Effect Identification

• 4% (148) TRAMO chose Easter effect

• 11% (392) X-12-ARIMA chose Easter effect

• No Easter effect present

• Binomial probability is less than 0.01 that we would see such a difference assuming equal probabilities of selection

Trading-day Effect Identification

• 13% (460) TRAMO chose trading-day effect

• 4% (138) X-12-ARIMA chose trading-day effect

• Binomial probability is less than 0.01 that we would see such a difference assuming equal probabilities of selection

Conclusions

• X-12-ARIMA mistakenly chooses an Easter effect more often than TRAMO

• As noted in Farooque, Hood, and Findley (2001), X-12-ARIMA still seems to choose trading-day effects more appropriately than TRAMO

Conclusions

• For known airline model simulations, X-12-ARIMA performed as well as TRAMO in identifying the ARIMA model

• X-12-ARIMA models performed as well as TRAMO when measured by the standard model diagnostics»Ljung-Box Q

»Spectrum of the model residuals

Newer Version of X-12-ARIMA

• We now have an improved version of X-12-ARIMA and hope to rerun the model identification to see if there are any changes to these results

Future Work

• Expand the study of simulated series to perform a more thorough evaluation of X-12-ARIMA’s new automatic modeling procedure using more varied models, model coefficients, regression effects, and series lengths

• Investigate how to improve the selection of the Easter effect

Disclaimer

This report is released to inform interested parties of ongoing research and to encourage discussion of work in progress. Any views expressed on statistical, methodological, technical, or operational issues are those of the authors and not necessarily those of the U.S. Census Bureau.

Much of the data analysis for this paper was generated using Base SAS® software, SAS/AF® software, and SAS/GRAPH® software, Versions 8 and 9 of the SAS System for Windows. Copyright © 1999-2003 SAS Institute Inc. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc., Cary, NC, USA.

We used R to simulate the airline model time series. Additional analysis was performed using Microsoft® Excel 2000. Copyright © 1985-1999 Microsoft Corporation. We checked our own calculations of the binomial probabilities involving the actual data using the Binomial Calculator at onlinestatbook.com/java/binomialProb.html (home page at onlinestatbook.com), and we used it alone for the comparisons involving the simulated data.

comparing automatic modeling procedures of tramo and x-12-arima, an update kathy mcdonald-johnson,...

model slide

bias slide

tramo slide

diagnostic slide

background slide

aic slide

flexible slide

development slide

Documents

world bank - tramo...

why does communication and consultation style matter ·...

tramo-seats · contents of the library (i) time series...

catherine catherine catherine - wordpress.com · 2012. 10....

gaskets hood baffle insert for factory … hood seals ......

hood shields chrome hood shield

robin hood leveled book • z robin hood

detalle empalme prfv - caasd.gov.do bombeo prfv.pdf ·...

mapas transnomad17 05 low...dia -day 1 jueves / thursday 14...

análisis micromorfológico de un tramo de camino inca en la...

catherine benincasa-letters of catherine benincasa

gimnastica faciala - catherine pez - cdn4.libris.ro faciala...

determinacion de pci-tramo huaycha

fixed ufm - gpi · up1 conexión para el transductor del...

sgt3510tl sgt3510tl - · pdf filetramo de torre 6 mts tower...

catherine kim poon catherine catherinekimpoon

€¦ · javier heraud - ca. mayta capac, tramo 1287 -...

catherine durose, catherine mangan, catherine needham and...

tramo p.o 7

el proceso constructivo del tramo atirantado del puente de