selecting an accounting-based valuation model 77305 ... · aegv_l, for 51.42 percent of the...
TRANSCRIPT
1
Selecting an Accounting-based Valuation Model*
Woo-Jin Chang INSEAD
Boulevard de Constance 77305, Fontainebleau
France [email protected]
+33 (0)1 60 72 92 48 (Telephone) +33 (0)1 60 72 92 53 (Facsimile)
Wayne R. Landsman
University of North Carolina at Chapel Hill CB# 3490 McColl Building
Chapel Hill, NC 27599-3490
[email protected] +1 919-962-3221 (Telephone) +1 919-962-4727 (Facsimile)
Steven J. Monahan
INSEAD Boulevard de Constance 77305, Fontainebleau
France [email protected]
+33 (0)1 60 72 92 14 (Telephone) +33 (0)1 60 72 92 53 (Facsimile)
January, 2012
Keywords: Abnormal earnings growth valuation, accounting-based valuation, non-steady-state
forecasting, residual income valuation.
JEL classification codes: G11, G12, G17, and M41.
* We benefited from the comments of Hubert Gatignon and workshop participants at INSEAD, Nanyang Technological University, National University of Singapore, and Singapore Management University.
2
Selecting an Accounting-based Valuation Model
Abstract
We evaluate the relative valuation accuracy of six accounting-based valuation models. We begin
by demonstrating that there is no single best model. The model with the lowest mean unsigned
valuation error does not have the lowest median unsigned valuation error; and, the model with
the lowest mean (median) unsigned valuation error is the most accurate for only 10.41 (14.41)
percent of our sample observations. In light of these facts, we evaluate whether firm attributes
can be used to identify the most accurate model ex ante. Using fitted probabilities from a set of
ordered logistic regressions we are able to identify the most accurate model for 35.80 percent of
the observations in our sample. Moreover, our model-selection algorithm is superior to several
alternative algorithms. We also provide evidence regarding the factors underlying the firm
attributes that we use to predict relative accuracy; and, we show that a lifecycle factor has strong
predictive power. This result is important as it suggests a parsimonious and practical criterion
that researchers and practitioners can use to choose among valuation models.
1
1. Introduction
Accounting-based valuation models are popular because they can be used to develop
rigorous estimates of value that do not rely on arbitrary assumptions about dividend policy.
As a result, equity analysts often rely (directly or indirectly) on these models when
developing price estimates. In addition, academic researchers and practitioners frequently
use accounting-based valuation models to estimate the expected rate of return on equity
capital (see Easton [2007] for a review of the literature).
An issue that users of accounting-based valuation models confront is: In terms of
accuracy, which model is best? Clearly, this is a moot issue if it is possible to develop a set
of unbiased, steady-state forecasts of future accounting numbers. However, this is often
infeasible, particularly for young, high-growth firms and firms that are undergoing structural
or life-cycle changes. Hence, users have to make assumptions about the forecast horizon and
the type of model (i.e., residual income valuation or abnormal earnings growth valuation) to
employ.
We evaluate the relative unsigned valuation errors of six accounting-based valuation
models that are representative of the models commonly used by practitioners and academics.
The models we evaluate differ along two dimensions: type and forecast horizon. Regarding
model type, we consider price estimates derived from either the residual income valuation
(RIV) model or the abnormal earnings growth valuation (AEGV) model. For each of these
two model types, we consider three different forecast horizons: (1) a naïve horizon in which
our RIV (AEGV) estimate equals contemporaneous equity book value per share (capitalized
forward earnings per share); (2) a short horizon of five years; and, (3) a long horizon of
fifteen years.
We begin our analyses by determining the mean and median unsigned valuation error
of each model. For a particular model and observation, we define the unsigned valuation
2
error as the ratio of the absolute value of the difference between the estimated share price and
the contemporaneous share price to contemporaneous share price. We demonstrate that the
RIV model assuming a short forecast horizon, RIV_S, and the AEGV model assuming a
short forecast horizon, AEGV_S, have the lowest mean and median unsigned valuation
errors, respectively. Hence, the model that is best for the typical firm, AEGV_S, is not best
on average, RIV_S. Moreover, and perhaps more importantly, for many of our sample
observations neither of these models is best. In particular, RIV_S (AEGV_S) has the lowest
unsigned valuation error for only 10.41 (14.41) percent of our sample observations.
The results described above lead us to conclude that there is no single best model.
Although this conclusion may be uncontroversial, it goes against an implicit assumption
made by many researchers. For example, when evaluating accounting-based estimates of the
expected rate of return on equity capital, Easton and Monahan [2005] implicitly assumes
there is a single model that generates the most reliable estimate. More recently, Jorgensen,
Lee, and Yoo [2011] (hereafter JLY) evaluates several accounting-based valuation models.
However, because the study primarily focuses on mean and median valuation errors, JLY
implicitly assumes there is a single, best model.
With the above in mind, we develop and evaluate an algorithm that uses observable
firm attributes to select a valuation model for a particular observation. Our algorithm
involves six steps. First, we create two random samples, which we refer to as the estimation
sample and the holdout sample. Second, for each observation in the estimation sample we
assign a rank to each of the six valuation models. This rank corresponds to the relative
magnitude of the unsigned valuation error obtained when we use that particular model to
estimate the price for the observation. The Nth most accurate model is assigned a rank of N -
i.e., the most accurate model is assigned a rank of one; the second most accurate model is
assigned a rank of two; etc.
3
Third, using the estimation sample we estimate six ordered logistic regressions, i.e.,
one regression for each valuation model. The dependent variable in each regression is the
rank described above. The independent variables are 23 firm attributes motivated by extant
analytical and empirical research. Fourth, for each holdout-sample observation and each
valuation model, we use the estimated coefficients from the relevant ordered logistic
regression to estimate the fitted probability that the valuation model in question will have
rank N. Fifth, for each holdout-sample observation and each valuation model, we create the
variable FIT_RANK, which equals the rank with the highest fitted probability. Finally, for
each holdout-sample observation, we select the valuation model with the lowest value of
FIT_RANK.
The above algorithm selects the AEGV model assuming a long forecast horizon,
AEGV_L, for 51.42 percent of the holdout-sample observations, the AEGV model assuming
a short forecast horizon, AEGV_S, for 28.27 percent of the holdout-sample observations, and
the naïve residual income model, RIV_N, for 17.82 percent of the holdout-sample
observations. Hence, on an ex ante basis, the algorithm tends to select one of two AEGV
models or a naïve RIV model. Even though the algorithm leads to a fairly distinct
categorization of observations, it still performs well. For 35.80 percent of the holdout-sample
observations the selected valuation model per the algorithm generates the most accurate
estimate of price. Moreover, it performs better than several alternative algorithms. For
example, for 70.70 (48.16) percent of the holdout-sample observations, the model selected by
algorithm is no less accurate (is more accurate) than a model selected randomly after
assigning each valuation model a selection probability equal to the actual percentage of the
estimation-sample observations for which that model is the most accurate.
A shortcoming of the algorithm described above is that it only provides indirect
evidence regarding the types of firms for which a particular accounting-based valuation
4
model should be used. To address this shortcoming, we factor analyze the independent
variables used in the ordered logistic regressions; and, we use the results of these analyses to
estimate factor scores. When doing this we only use observations from the estimation
sample. The analysis reveals that there are three latent factors, which relate to: (1)
investment opportunities; (2) lifecycle stage; and, (3) net operating asset turnover.
After performing the factor analysis, we modify the selection algorithm. First, for
each estimation-sample observation we replace the original independent variables used in the
ordered logistic regressions with the factor scores; and, we re-estimate the ordered logistic
regressions. We show that the lifecycle factor has strong explanatory power. In particular,
we show that for large, mature, stable firms: (1) AEGV models are superior to RIV models,
and (2) models that assume relatively long forecast horizons are more accurate.
Second, we use the factor patterns obtained from the estimation sample to estimate a
factor score for each holdout-sample observation. We combine this factor score with the
coefficients from the modified ordered logistic regressions to arrive at a fitted probability for
each holdout-sample observation and valuation model. We then recalculate the variable
FIT_RANK and we select the valuation model with the lowest value of FIT_RANK. Similar
to the original algorithm, the modified algorithm tends to select the AEGV_L (48.37 percent
of the holdout-sample observations), AEGV_S (39.49 percent of the holdout-sample
observations), or RIV_N (12.15 percent of the holdout-sample observations) model. It also
performs well. For 30.32 percent of the holdout-sample observations the selected valuation
model per the modified algorithm generates the most accurate estimate of price. Moreover,
for 67.80 (46.32) percent of the holdout sample observations, the predicted model from the
modified algorithm is no less accurate (is more accurate) than a model selected randomly
after assigning each valuation model a selection probability equal to the actual percentage of
the estimation-sample observations for which that model is the most accurate. Additional
5
analyses reveal that the lifecycle factor is the key determinant of the identity of the model
selected by the modified algorithm.
Our study extends the literature in three ways. First, we demonstrate that there is no
single, best valuation model. In particular, models that are considered “best” per extant
research frequently perform poorly for many firms. Second, we show that relative valuation
accuracy is predictable ex ante based on knowledge of a firm’s attributes. Finally, we show
that where a firm is its lifecycle is a key determinant of the identity of the appropriate
valuation model for that firm. This final result is important as it suggests that researchers and
practitioners should evaluate a firm’s lifecycle stage when deciding on which approach to use
to estimate the value of the firm’s equity.
The remainder of the paper proceeds as follows. In section two we briefly discuss the
related literature. In section three we describe our six accounting-based valuation models,
how we estimate the inputs to these models, and how we construct our samples. In sections
four and five we discuss our main empirical results and our sensitivity analyses. We
conclude in section six. Finally, there is one appendix.
2. Related Literature
A key analytical result in the literature is provided by Ohlson [1995], who solves a
conundrum: While value equals the present value of expected future dividends (i.e., the
dividend discount model or DDM), dividend policy is irrelevant from a first-order
perspective (Modigliani and Miller [1961]). Hence, to estimate value via the DDM, arbitrary
assumptions about future dividend policy must be made. Ohlson [1995] solves this
conundrum by showing that accounting-based valuation models can be used to develop
rigorous estimates of value that do not rely on assumptions about future dividend policy.
6
Penman and Sougiannis [1998] (hereafter PS) is the first study to provide empirical
support for the analytical results developed by Ohlson [1995]. PS shows that price estimates
obtained from accounting-based valuation models are less biased (i.e., have lower signed
errors) than estimates of value obtained from either the DDM or a cash-flow based model.
PS also performs conditional tests in which they evaluate how bias varies with certain
portfolio-level attributes. Although PS makes a clear contribution, the study is not definitive.
PS focuses on bias rather than accuracy, performs its analyses at the portfolio level, and uses
ex post realizations of the relevant payoffs to generate their value estimates. Hence, the PS
results do not shed light on either the firm-level determinants of relative accuracy or the
tradeoff between forecast accuracy and horizon length. Finally, when conducting the
conditional analyses, PS uses several conditioning variables that are a function of
contemporaneous price; hence, there is an issue of circularity.
Francis, Olsson, and Oswald [2000] (hereafter FOO) builds on the PS study in several
ways. FOO focuses on accuracy (i.e., unsigned valuation errors) and associations, uses Value
Line forecasts, and conducts analyses at the firm level. FOO shows that accounting-based
models are more accurate; however, they only consider one accounting-based model. Hence,
although FOO makes a clear contribution to the literature, it neither sheds light on the relative
accuracy of different accounting-based models nor attempts to identify firm-level attributes
that are associated with relative accuracy. Similarly, as discussed above, JLY evaluates
several accounting-based models; however, JLY primarily focuses on mean and median
unsigned errors and does not attempt to develop an approach for selecting between models.
Finally, a related stream of research pertains to studies that evaluate the reliability of
accounting-based measures of the expected rate of return on equity capital (Easton and
Monahan [2010] provides a review and discussion). Although these studies focus on
7
different phenomena such as the relation between accounting-based proxies and realized
returns, they also shed light on the relative merits of the underlying valuation models.
3. Empirical Proxies and Sample Construction
In section 3.1, we provide a general description of the six accounting-based valuation
models that we evaluate. In section 3.2, we provide details about how we estimate model
inputs. In section 3.3, we describe our sample construction algorithm.
3.1 Accounting-based Valuation Models
We evaluate three versions of the RIV model and three versions of the AEGV model.
We begin by describing our RIV models. To do this we refer to equation (1), which is shown
below.
RIV_ INFi ,0 = Bi ,0 +ROEi ,t − ri( ) × Bi ,t −1
1+ ri( )tt =1
∞∑ (1)
In equation (1), RIV_INFi,0 is the estimate of firm i’s share price at time zero obtained from a
residual income valuation model in which an infinite forecast horizon is adopted; Bi,t is firm
i’s expected equity book value per share at the end of time t; ROEi,t is firm i’s expected
accounting return on equity for time t, which equals the ratio of firm i’s expected earnings per
share for year t (i.e., epsi,t) to Bi,t-1; and, ri is the expected rate of return on equity capital for
firm i. All expectations are formed at time zero.
As discussed in Ohlson [1995], equation (1) is identical to the DDM as long as there
is clean surplus accounting and the discount rate is neither stochastic nor time varying.
Equation (1) is not directly implementable as it requires an infinite series of forecasts.
Hence, a finite forecast horizon must be adopted and a terminal value correction must be
made. With this in mind we consider the following models.
0,0,_ ii BNRIV = (1.A)
8
RIV_Si ,0 = Bi ,0 +ROEi ,t − ri( ) × Bi ,t −1
1+ ri( )tt=1
5∑ (1.B)
RIV_Li ,0 = Bi ,0 +ROEi ,t − ri( ) × Bi ,t−1
1+ ri( )tt =1
15∑ (1.C)
RIV_Ni,0 is the estimate of firm i’s share price at time zero after making a naïve assumption
about the forecast horizon; RIV_Si,0 is the estimate of firm i’s share price at time zero after
assuming a short horizon of five years; and, RIV_Li,0 is the estimate of firm i’s share price at
time zero after assuming a long horizon of fifteen years. In all cases we assume a terminal
value of zero.
The different versions of equation (1) reflect a tradeoff. Ceteris paribus, lengthening
the forecast horizon improves a model’s accuracy. Specifically, economic rents and the bias
in ROE attributable to conservative accounting are both expected to dissipate over time.
Hence, as discussed in Zhang [2000], in the long run the expected growth rate in residual
income converges to a constant. However, lengthening the forecast horizon is not costless as
long-horizon forecasts tend to be less precise than short-horizon forecasts. Hence, there is a
tradeoff and the issue of which forecast horizon to adopt is an empirical question.
A second tradeoff relates to the choice of the valuation anchor. In particular, rather
than using RIV and anchoring on contemporaneous equity book value per share, we can use
AEGV and anchor on forward earnings per share.
AEGV_ INFi ,0 =epsi ,1
ri+
RONECi ,t +1 − ri( ) × epsi ,t − dpsi ,t( )ri × 1+ ri( )t
t =1
∞∑ (2)
In equation (2), AEGV_INFi,0 is the estimate of firm i’s share price at time zero obtained
from an abnormal earnings growth valuation model in which an infinite forecast horizon is
adopted; epsi,t is firm i’s expected earnings per share for time t; RONECi,t+1 is firm i’s
expected accounting return on new equity capital for time t+1, which equals the ratio of the
9
expected change in firm i’s earnings per share for year t+1 (i.e., ∆epsi,t+1) to expected
reinvested earnings in time t (i.e., epsi,t-dpsi,t, dpsi,t denotes expected dividends per share for
firm i in time t); and, ri is firm i’s expected rate of return on equity capital. All expectations
are formed at time zero.
Equation (2) has two potential advantages vis-à-vis equation (1). First, ignoring
unconditional conservatism and assuming economic rents are expected to dissipate over time,
expected future RONEC’s, which primarily relates to the expected performance of future
investments, will converge towards ri faster than expected future ROE’s, which reflect the
expected performance of both past and future investments. Second, unconditional
conservatism only affects earnings per share and RONEC when there is non-zero growth in
assets accounted for conservatively. On the other hand, equity book value per share and ROE
are always affected by unconditional conservatism.
Despite its advantages, it is not obvious that the AEGV model dominates the RIV
model. Because the RIV model is anchored on equity book value per share it reduces the
importance of forecasting and, consequently, mitigates the impact of forecast errors.
Moreover, equity book value per share is a function of the firm’s past investments; hence, it
reflects management’s expectations regarding future growth and profitability.
In light of the above, the relative accuracy of equation (1) and equation (2) is an
empirical question. To address this question, we adjust equation (2) so that it can be
implemented. In particular, as with equation (1), equation (2) requires that an infinite
sequence of forecasts be developed, which is infeasible. Hence, we consider the following
versions of equation (2).
i
ii r
epsNAEGV 1,
0,_ = (2.A)
( ) ( )( )∑
=
+
+×−×−
+=4
1
,,1,1,0,
1_
tt
ii
titiiti
i
ii
rr
dpsepsrRONEC
r
epsSAEGV (2.B)
10
( ) ( )( )∑
=
+
+×−×−
+=14
1
,,1,1,0,
1_
tt
ii
titiiti
i
ii
rr
dpsepsrRONEC
r
epsLAEGV (2.C)
AEGV_Ni,0 is the estimate of firm i’s share price at time zero after making a naïve
assumption about the forecast horizon; AEGV_Si,0 is the estimate of firm i’s share price at
time zero after assuming a short horizon of five years; and, AEGV_Li,0 is the estimate of firm
i’s share price at time zero after assuming a long horizon of fifteen years. In all cases we
assume a terminal value of zero.
As discussed above, the different versions of equation (2) reflect a tradeoff between
the costs and benefits of lengthening the forecast horizon. Again, the nature of this tradeoff is
an empirical question.
3.2 Estimation of Model Inputs
Price at time zero, Pi,0, equals the share price per I/B/E/S on the date immediately
after the date that the last consensus I/B/E/S forecast was made available in fiscal year zero.
Equity book value per share at time zero, Bi,0, equals the ratio of COMPUSTAT data item
CEQ to COMPUSTAT data item CSHO (we adjust for stock splits and dividends). Earnings
per share at time zero, epsi,0, equals the ratio of COMPUSTAT data item IB to
COMPUSTAT data item CSHO (we adjust for stock splits and dividends). Dividends per
share at time zero, dpsi,0, equals COMPUSTAT data item DVPSX_F. Expected earnings per
share for years one through five (i.e., epsi,t ∀ t ∈ [1,5]) equals the median analysts’ consensus
forecast of earnings per share. We determine median forecasts from the available analysts’
forecasts in the last I/B/E/S file released prior to the end of fiscal year zero. If there is no
explicit consensus forecast of epsi,t, we set it equal to epsi,t-1×(1+gi,ltg). gi,ltg equals the
analysts’ consensus long-term growth rate in earnings per share. Expected dividends per
share, dpsi,t ∀ t > 0, equals to Ki,0×epsi,t. The variable Ki,0 is determined in the following
manner. If epsi,0 > 0, Ki,0 = MAX(0,MIN(dpsi,0÷epsi,0,1)). However, if epsi,0 ≤ 0, Ki,0 =
11
MAX(0,MIN(dpsi,0÷(0.06×Bi,0),1)). We use the clean surplus relation to determine expected
equity book value per share (i.e, Bi,t ∀ t ∈ [1,5]); in particular, Bi,t = Bi,t-1+epsi,t-dpsi,t.
Expected return on equity for years one through five (i.e., ROEi,t ∀ t ∈ [1,5]) equals
epsi,t÷Bi,t-1. Expected return on new equity capital for years two through five (i.e.,
RONECi,t+1 ∀ t ∈ [2,5]) equals ∆epsi,t+1÷(epsi,t-dpsi,t).
Based on results in Nissim and Penman [2001], we assume that during years six
through fifteen expected ROE fades to a constant. We implement this assumption in the
following manner. First, for each year between 1963 and 1999 we identify all COMPUSTAT
firms with the requisite data to compute ROE. Second, for each year we sort firms on the
basis of ROE and we form 20 portfolios. Third, for each annual portfolio we calculate the
median ROE for the year in which the portfolio was formed and the subsequent ten years.
We refer to these ROE’s as MROEk,p,t (i.e., the median ROE in year t+k of firms in portfolio
p formed in year t). Fourth, for each k and each p, we calculate the grand mean of the
portfolio medians. We refer to this variable as ∑=
=37
1
,,, 37
_t
tpkpk
MROEMROEGM . Finally, for
the i’th observation in our sample, we identify the portfolio p with the value GM_MROE0,p
that is closest in absolute value to the expected ROE for observation i in year 5 (i.e., ROEi,5)
and we set ROEi,t equal to GM_MROEt-5,p ∀ t ∈ [6,15].
In Figure One we graph GM_MROEk,p for the twenty portfolios. The value of k,
which varies between zero and ten, is shown on the x-axis. The value of GM_MROE0,p for
each value of p ∈ [1,20] is shown on the y-axis. As shown in the graph, ROE mean reverts
rapidly and begins to level off by year eight; hence, our assumption about the evolution of
ROE between years six and 15 appears reasonable.
12
Expected RONEC for years six through 15 is calculated as follows: RONECi,t =
RONECi,t-1-((RONECi,5-ri)÷10) ∀ t ∈ [6,15]. Hence, we assume that RONEC fades at a
constant rate until it ultimately equals ri in year 15.
To implement the RIV_L we need expected equity book value per share (i.e., Bi,t) for
years five through 14. As discussed above, we obtain Bi,5 via the clean surplus relation, the
analysts’ consensus forecast of epsi,5, and our assumption about dividends. To obtain Bi,t for t
> 5 we continue to use the clean surplus relation; however, epsi,t = ROEi,t×Bi,t-1 (the
determination of ROEi,t ∀ t > 5 is described above). We continue to set dpsi,t ∀ t > 5 equal to
Ki×epsi,t. To implement AEGV_L we need expected reinvested earnings per share (i.e.,
epsi,t-dpsi,t) for years five through 14. We use the analysts’ consensus forecast of epsi,5 and
our assumption about the dividend payout ratio to calculate the reinvested earnings per share
for year five. For years six through 14, (epsi,t-dpsi,t) = epsi,t-1×(RONECi,t×(1-Ki)+1)×(1-Ki).
Finally, we estimate ri as follows. First, for each industry j we estimate the following
weighted least squares regression for each month m between January, 1980 and December,
2010. The weights, which are not shown in the equation, equal the contemporaneous equity
market values of the observations. We use the industry definitions shown in Fama and
French [1997].
reti ,τ = α j ,m + βLEV_ IND j ,m × rM ,τ − rf ,τ( ) + εi ,τ (3)
In equation (3), reti,τ is realized stock return for firm i in month τ; αj,m is the estimated
intercept for industry j in month m; βLEV_IND j,m is the estimated levered beta for industry j
in month m; rM,τ is the realized return for month τ on the value-weighted market portfolio; rf,τ
is the market yield on ten-year, constant-maturity, U.S. treasury bonds issued in month τ;
and, εi,τ is an error term. We estimate the regression using all members of industry j in the
CRSP monthly files that have at least 30 monthly returns between month m-60 and month m.
13
Second, because βLEV_IND j,m reflects industry leverage, which likely differs from
firm-level leverage, we unlever it using the industry leverage ratio in month m. We refer to
this measure of beta as βUNLEV_IND j,m. Third, for sample observation i, we select the
month m that corresponds to the year and month in which our value estimate pertains and we
use i’s leverage ratio to re-lever the unlevered industry beta. We refer to this estimated firm-
level beta as βLEV_FIRMi,m. Finally, to determine ri, we multiply βLEV_FIRMi,m by five
percent (i.e., our assumed equity premium) and then add the contemporaneous market yield
on ten-year, constant-maturity, U.S. treasury bonds.1
3.3 Sample Construction
We begin by combining the 2010 COMPUSTAT annual file with the summary 2010
I/B/E/S tapes. We eliminate all observations with a fiscal year of 1979 or earlier because the
I/B/E/S data are sparse prior to 1980. We also eliminate observations with a standard
industrial classification (i.e., SIC) code that does not belong to any of the 48 industry groups
described in Fama and French [1997]. We do this because we cannot estimate
βLEV_FIRMi,m for these observations. We also eliminate observations for which
βLEV_FIRMi,m ∉ [0,2].
Finally, for each industry-year we randomly assign half of the observations to the
estimation sample and the remaining half to the holdout sample, which we use to evaluate the
out-of-sample performance of our selection algorithm. This leaves us with 35,473
observations for the estimation sample; however, because of data limitations, some of our
tests are based on smaller samples. The tenor of our results does not change if we estimate
all of our tests on a common sample.
1 We evaluate the sensitivity of our results to the assumptions underlying our estimate of ri. First, we assume four alternative values of the equity premium: three percent, four percent, six percent, and seven percent. Second, we consider alternative approaches for estimating beta. For example, we estimate firm-specific betas and we calculate the industry-level beta by value weighting the betas of the firms in the industry. Finally, we assume ri equals 10 percent for all firm-years. None of these alternative approaches for estimating ri affect the tenor of our results.
14
4. Main Empirical Results
4.1 Descriptive Statistics
Panel A of Table One contains descriptive statistics on key attributes of the
observations in the estimation sample. Two comments are warranted. First, the estimation
sample consists of large firms most of which are expected (at least by analysts who provide
forecasts to I/B/E/S) to perform well. In particular, the mean (median) of ROE5 is 0.199
(0.160) and the mean (median) of RONEC5 is 207.066 (0.175). These numbers are
considerably higher than the mean (median) of r, which equals 0.113 (0.111). Second, both
ROE and RONEC take on extreme values. For example, the first and 99th percentiles of
ROE5 are −0.379 and 1.028; moreover, the first and 99th percentiles of RONEC5 are 0 and
1.173. These extreme values suggest we will obtain extreme valuation errors for some of our
sample observations. However, because we focus on relative ranks, these extreme errors
should not confound our statistical tests.
In Panel B of Table One we provide descriptive statistics on the signed valuation
errors of each of our six valuation models. These statistics pertain to estimation-sample
observations. To determine the signed valuation error for particular observation and
valuation model we divide the difference between the model’s value estimate and the
contemporaneous stock price by the contemporaneous stock price. All of our models tend to
underestimate equity value. Moreover, with the exception of AEGV_L, which has mean
signed error of 0.941, the mean signed errors are also negative. Not surprisingly, models
with longer forecast horizons exhibit mean (median) errors that are less negative than models
with short forecast horizons. Nonetheless, even RIV_L, which assumes a fifteen-year
forecast horizon, has a median (mean) signed error of −0.310 (−0.258). Finally, consistent
with the extreme values of ROE and RONEC shown in Panel A of Table One, the absolute
15
magnitudes of some of the signed errors are large. For example, the first and 99th percentiles
of the signed errors obtained from the AEGV_L model are -4.506 and 20.919.
Finally, in Panel C of Table One we show correlations between contemporaneous
price and each of our value estimates as well as the correlations between the value estimates.
Pearson product moment (Spearman rank order) correlations are shown above (below) the
diagonal. The correlations and t-statistics shown in the table relate to the estimation sample.
The correlations equal the mean of the annual correlations and each t-statistic equals the
relevant mean correlation divided by its temporal standard error. Because of the extreme
observations noted above, we focus on the Spearman rank order correlations. Two comments
are warranted. First, all of the value estimates have a significant, positive relation with price.
Second, the correlations between our naïve value estimates and price are roughly equal to, or
larger than, the correlations between price and the value estimates that rely on more
sophisticated assumptions about the forecast horizon. For example, the correlation between
price and RIV_N is 0.71, which is larger than the correlation between price and AEGV_L,
0.70. Taken together these results suggest that: (1) all of the value estimates are associated
with price and (2) naïve valuation models perform roughly as well as more sophisticated
approaches. As shown below, this second result is knife-edged.
4.2 Relative Unsigned Valuation Errors
Panel A of Table Two contains descriptive statistics on unsigned valuation errors.
These statistics pertain to the estimation sample. To determine the unsigned valuation error
for a particular observation and valuation model we divide the absolute value of the
difference between the model’s value estimate and the contemporaneous stock price by the
contemporaneous stock price. We also provide descriptive statistics for the best model,
which, for a particular observation, we define as the model with the lowest unsigned
valuation error for that observation. Several comments bear mentioning.
16
First, ignoring the best model, the mean and median unsigned valuation errors are
large for all of the models. For example, the mean (median) unsigned valuation error for
AEGV_L is 2.195 (0.465). Second, and related to the first point, some of the unsigned
valuation errors are extremely large. For example, ignoring the best model, the unsigned
valuation errors of RIV_N have the lowest 99th percentile. Nonetheless, the 99th percentile of
RIV_N’s unsigned valuation errors exceeds one (i.e., 1.337). Moreover, the 75th percentiles
of the valuation errors for AEGV_S and AEGV_L are 0.633 and 0.875, respectively. Third,
the model with the lowest median error (i.e., AEGV_S with a median unsigned valuation
error of 0.376) does not have the lowest mean unsigned valuation error. In particular,
AEGV_S has a mean unsigned valuation error of 0.662 whereas RIV_S has the lowest mean
unsigned valuation error (i.e., 0.470). Finally, ex post using the best model for each
observation is clearly superior to using the same model for each observation. Specifically,
the mean (median) unsigned valuation error for the best model is 0.203 (0.124), which is
orders of magnitude smaller than the mean (median) unsigned valuation errors of each of the
other models.
In Panel B of Table Two we further explore the potential improvement in accuracy
obtained from using the most accurate model for each estimation-sample observation. In
particular, for each observation we compute the difference between the unsigned valuation
error of each model and the unsigned valuation error of the best model. In Panel B of Table
Two, we provide descriptive statistics for these differences. These results illustrate that there
is considerable potential for improvement. For example, the AEGV_S (RIV_S) has the
smallest median (mean) difference, which equals a nontrivial 0.162 (0.267).
Finally, in Table Three we show the percentage of estimation-sample observations for
which each valuation model ranks first, second, third, etc., in terms of relative unsigned
valuation errors. Note that lower rankings imply higher relative accuracy. First, only three
17
models (i.e., AEGV_L, RIV_N, and AEGV_N) rank first for more than 16.67 percent (i.e.,
one-sixth) of the observations. The one-sixth cutoff is relevant because it represents the
frequency of observations for which each model would rank first if relative accuracy was
purely random. Second, AEGV_L ranks first most often. It is the best model for 28.53
percent of the observations; whereas, RIV_N and AEGV_N are the most accurate for 19.10
and 17.06 percent of the observations, respectively. Finally, although AEGV_L, RIV_N, and
AEGV_N are the most accurate, they are also the least accurate. In fact, for a randomly
chosen observation each of these models is more likely be the least accurate and not the most
accurate. In particular, for 37.58, 39.73, and 18.06 percent of the observations AEGV_L,
RIV_N, and AEGV_N ranks last in terms of valuation accuracy.
Based on the results shown in Table Two and Table Three we conclude that there is
no single, best model. The model that is best for the typical firm is not best on average.
Moreover, models that have the highest frequency of being the most accurate also have the
highest frequency of being the least accurate. In fact, these models are more likely to be the
least accurate not the most accurate.
4.3 Predicting Ranked Accuracy
As discussed in section 4.2, there is considerable potential for improvement vis-à-vis
an algorithm of using the same valuation model for each observation. Whether this potential
can be realized ex ante, however, is not obvious. With this in mind we develop an algorithm
for predicting model ranks. In particular, for each valuation model we estimate an ordered
logistic regression in which for each observation the dependent variable is the model’s rank
in terms of accuracy and the independent variables are listed below. We estimate the ordered
logistic regression using observations from the estimation sample.
18
Variable Description LOGSALE Natural log of contemporaneous sales AGE Duration in months since CRSP initiated coverage of the firm VOL Volatility of market-adjusted stock returns RET Market-adjusted, buy-hold stock return ROE Contemporaneous return on equity LOSS Equals one if contemporaneous net income is negative, zero otherwise NOATO Contemporaneous net operating asset turnover
∆NOATO Change in net operating asset turnover OPM Contemporaneous operating profit margin
∆OPM Change in operating profit margin SGROW Contemporaneous sales growth CASHFLOW Contemporaneous cash flow to sales (cash flow equals the sum of GAAP cash
flow from operations and research and development expenditures) INVEST Contemporaneous investment to sales (investment equals the sum of research
and development expenditures and capital expenditures) PAYER Equals one if the firm paid dividends in the current year, zero otherwise LEV Contemporaneous ratio of financial obligations to total assets
∆LEV Change in LEV CASH Contemporaneous ratio of financial assets to total assets
∆CASH Change in CASH ACCRUALS Contemporaneous ratio of working-capital accruals to sales C-SCORE Contemporaneous C-score per Penman and Zhang [2000] REV Ratio of most recent forecast revision to equity book value per share LTG The contemporaneous analysts' consensus long-term growth rate in earnings
per share ∆FOLLOW Change in analysts' following during the last three months before the end of
year zero.
Table A1 of Appendix A provides descriptive statistics on the variables shown above. We
motivate our choice of variables as follows.
First, the variables LOGSALE, AGE, and VOL relate to firm lifecycle. In particular,
larger, older firms with low return volatility are more established; hence, ceteris paribus,
forecast precision is higher for these firms. Second, to the extent current stock returns and
accounting performance is indicative of future performance, variables such as RET, ROE,
LOSS, NOATO, ∆NOATO, OPM, ∆OPM, SGROW, and CASHFLOW are relevant. In
particular, higher performance implies higher economic rents, which are not reflected in
either contemporaneous equity book value per share or expected earnings per share in year
19
+1. Similarly, to the extent current investment choices reflect manager’s expectations about
future performance, INVEST is relevant. Third, the variables PAYER, LEV, ∆LEV, CASH,
and ∆CASH relate to financial policy, which is linked to investment policy as well as
performance. Fourth, ACCRUALS and C-SCORE relate to earnings quality (e.g., Sloan
[1996]) and the effect of unconditional conservatism on contemporaneous equity book value
per share (e.g., Feltham and Ohlson [1996]). Finally, extant research (e.g., Hughes, Liu, and
Su [2008]) shows that REV and LTG are related to analysts’ forecast errors, and we add the
variable ∆FOLLOW as changes in coverage may affect the quality of the consensus forecast.
Although our variables are motivated by analytical and empirical results, they are ad
hoc; and, each variable relates to several phenomena. For example, Hughes, Liu, and Su
[2008] shows that RET is related to future analysts’ forecast errors; firms that pay dividends
are typically more stable and mature; firm lifecycle and investment opportunities are related;
etc. Moreover, the a priori relation between the independent variables and the relative
accuracy of our accounting-based valuation models is unclear. For example, the forecasts of
early-stage firms’ earnings are likely less precise. However, these firms are also likely to
have relatively valuable investment opportunities. Hence, earnings forecasts are more
relevant.
In light of the above, we consider the relation between the independent variables and
the relative valuation accuracy of the different accounting-based valuation models to be an
empirical issue. Hence, we estimate the ordered logistic regressions and evaluate their
predictive power. Given the degree of complexity involved (i.e., six regressions with 23
independent variables each), we do not discuss the estimated coefficients for each of the
ordered logistic regressions in the main text. Rather, in the main text we focus on their
predictive power. We relegate the discussion of the details regarding how we estimate each
regression, the estimated coefficients, etc. to Appendix A and Table A2.
20
For each observation in the holdout sample, we use the following approach to arrive
at the valuation model that is predicted to be the most accurate ex ante. First, for each
observation and each valuation model, we determine the fitted probability of each rank (i.e.,
one through six). Second, for each observation and each valuation model we create the
variable FIT_RANK that equals the rank with the highest fitted probability for that
observation and valuation model. Finally, for each observation we select the valuation model
that has the lowest value of FIT_RANK (i.e., lower ranks imply greater accuracy). If, for a
particular observation, two or more valuation models have the lowest value of FIT_RANK,
we choose the valuation model with the highest cumulative probability assigned to
FIT_RANK per the fitted probabilities from the estimated ordered logistic regressions.
We provide the following hypothetical example to clarify our selection algorithm.
For purposes of exposition but without loss of generality we assume there are only three
valuations models: RIV_N, RIV_S, and RIV_L. Hence, there are only three possible ranks.
For a hypothetical observation in the holdout sample assume the fitted probabilities implied
by the estimated coefficients from the ordered logistic regressions are as follows:
Fitted Probability that the Model will have Rank:
One Two Three RIV_N 0.25 0.00 0.75 RIV_S 0.21 0.79 0.00 RIV_L 0.00 0.80 0.20
To choose the valuation model for the hypothetical observation we begin by
determining the value of FIT_RANK for each valuation model. These values are three, two,
and two for the RIV_N, RIV_S, and RIV_L models, respectively. Next, we select the
valuation model with the lowest value of FIT_RANK. However, there is a tie – i.e., both
RIV_S and RIV_L have a FIT_RANK of two. Hence, we select RIV_S because the
21
cumulative probability associated with its FIT_RANK is 1.00, which exceeds the cumulative
probability of 0.80 associated with the FIT_RANK of RIV_L.
As the above example illustrates, our algorithm reduces the likelihood of choosing a
model that will exhibit extreme performance. In particular, the RIV_N model is not chosen
even though it has the highest probability of ranking first (i.e., 0.25). The reason for this is
that it has an even higher probability of ranking last (i.e., 0.75). Moreover, even though
RIV_L model has a higher likelihood of ranking second than the RIV_S model, it is not
chosen because the cumulative probability of RIV_S ranking second is higher.
In Panel A of Table Four we show the percentage of holdout-sample observations for
which each of the six valuation models is selected per the algorithm described above. Two
AEGV models dominate: AEGV_L and AEGV_S, which are selected for 51.42 and 28.27
percent of the observations. Only one RIV model, RIV_N, is selected for more than one-
sixth of the observations (it is selected for 17.82 percent of the observations).
In Panel B of Table Four we evaluate the effectiveness of our selection algorithm. In
particular, we show the percentage of holdout-sample observations for which the selected
model has realized rank one, two, etc. Our algorithm performs well. For 35.80 and 16.56
percent of the observations, the selected model is the best and second best model. Moreover,
although the selected model is the worst model for 14.34 percent of the observations, this is
an improvement vis-à-vis an approach in which one of the most accurate models per Table
Three is used exclusively (i.e., only AEGV_L, only RIV_N, or only AEGV_N). In particular,
as shown in Table Three, these three models are the most inaccurate for 37.58, 39.73, and
18.06 percent of the observations, respectively.
We further evaluate our model-selection algorithm in Panel C of Table Four. We
begin by providing descriptive statistics for the unsigned valuation errors obtained assuming
that for each observation we use the model selected by the algorithm. We refer to this as SM
22
(i.e., the selected model). The median unsigned valuation error for SM is 0.354, which is
smaller than the median unsigned valuation errors of each of the separate models; however,
the mean unsigned valuation error for SM is 0.502. This exceeds the mean unsigned
valuation error for RIV_S (i.e., 0.470), which per Panel B of Table Two is the model with
lowest mean unsigned valuation error.
Given the above results are not clear-cut, we conduct further analyses. In particular,
in the remaining rows we provide descriptive statistics for the difference in the unsigned
valuation error of SM and the unsigned valuation errors obtained from a model selected via
one of five alternative algorithms.
1. An algorithm in which we select the best ex post model for each observation. We
refer to the valuation model selected using this algorithm as BEST. This algorithm
provides us with the lower bound of the valuation error.
2. A random algorithm in which, for each observation, each valuation model has a 1/6th
probability of being selected. We refer to the valuation model selected using this
algorithm as RAN1. This algorithm embeds the assumption that relative accuracy of
each of the valuation models does not depend on firm-level attributes.
3. A random algorithm in which, for each observation, each valuation model has a
probability of being selected that equals the percentage of estimation-sample
observations for which that model is best per the results shown in Table Three. We
refer to valuation the model selected using this algorithm as RAN2. This algorithm
embeds the assumption that the percentage of observations for which a particular
valuation model is best is known ex ante but that the identity of the specific
observations for which the model is best is unknown.
4. A random algorithm in which, for each observation, each valuation model has a
probability of being selected that equals the percentage of estimation-sample
23
observations for which that model is selected by the algorithm that is based on the
results of the ordered logistic regressions. We refer to the valuation model selected
using this algorithm as RAN3. This algorithm embeds the assumption that the
percentage of observations for which a particular valuation model is selected by our
algorithm is known ex ante but that the identity of the specific observations for which
the model is selected is unknown.
5. A random algorithm in which we always use the AEGV_L model, which, per Table
Three, is the valuation model that is most frequently ranked first. This algorithm
embeds the assumption that the model that is most frequently the best ex post is the
best choice for each observation ex ante.
Ignoring the best model, SM (i.e., the selected model per our algorithm) outperforms
all of the models selected via the alternative algorithms. It is more accurate (no less accurate)
than: (1) RAN1 for 50.89 (67.36) percent of the observations; (2) RAN2 for 48.16 (70.70)
percent of the observations; (3) RAN3 for 38.96 (76.86) percent of the observations; and, (4)
AEGV_L 32.56 (83.98) percent of the observations. Moreover, the median difference
between the unsigned valuation error of SM and the BEST model is 0.104. This is roughly
64 percent of the smallest median shown in Panel B of Table Two (i.e., the median of
(|AEGV_S-P|−|BEST-P|)÷P, which equals 0.162).
4.4 Factor Analysis
The algorithm discussed above performs well; however, because of the large number
of independent variables in the logistic regressions, it is difficult to use the results shown in
Table A2 to draw conclusions about the types of firms that are matched to a particular
valuation model. With this in mind we perform exploratory factor analysis on the 23
independent variables. When conducting our factor analysis we only use observations from
the estimation sample. We retain factors with eigenvalues greater than one and we find three
24
factors. The coefficients associated with the raw factor patterns are presented in the first
three columns of Table Five. To clarify our interpretation of the factors we rotate them using
varimax rotation. Factor patterns after rotation are presented in the last three columns of
Table Five. Numbers shown in bold font relate to coefficients that: (1) have a correlation of
0.25 or higher with the factor in the column heading and (2) are not highly correlated with
either of the remaining factors.
First, regarding factor one, OPM, ∆OPM, and CASHFLOW have strong positive
loadings and INVEST has a strong negative loading. Hence, we interpret factor one as
relating to investment opportunities and we refer to it as INV_OPS. This interpretation is
based on the logic that firms with high (low) profits and cash flow but that aren’t (are)
investing arguably have poor (good) investment opportunities. Second, we interpret factor
two as relating to firm lifecycle and we refer to it as LIFECYCLE. This interpretation
reflects the fact that factor two has high positive loadings on LOGSALE, AGE, PAYER, and
LEV but high negative loadings on VOL, LOSS, CASH, and LTG. Finally, factor three loads
positively on NOATO and ∆ΝΟΑΤΟ; hence, we refer to it as NOA_TURN.
To evaluate the factors we modify the selection algorithm described in Section 4.3.
We refer to this as the modified algorithm. First, for each observation in the both the
estimation and holdout sample and each factor we compute a factor score. Second, for each
valuation model we estimate an ordered logistic regression in which the dependent variable is
the rank of the valuation model and the independent variables are the factor scores related to
INV_OPS, LIFECYCLE, and NOA_TURN. When estimating these regressions we only use
observations from the estimation sample. Third, as discussed in section 4.3, we compute
FIT_RANK for each holdout-sample observation and valuation model. Finally, for each
holdout-sample observation, we select the valuation model with the highest value of
FIT_RANK.
25
Table Six shows the results obtained from estimating the modified ordered logistic
regressions on the estimation sample. For each of the three factors we show the estimated
coefficient and two marginal effects. ME_1 (ME_6) is the marginal effect on the probability
that the related valuation model will rank first (sixth) assuming a change of one standard
deviation in the relevant factor and setting the remaining two factors equal to their mean
values. Note that, as show in Table Three, RIV_S and AEGV_S rank sixth for only 0.01 and
0.74 percent of the observations; hence, it is not feasible to calculate the marginal effects of
the changes in the factors on the probability that these models will rank sixth.
The results shown in Table Six illustrate that the factor LIFECYCLE has strong
explanatory power regarding the relative rankings of the different models, and it is the only
factor for which this is the case. Two observations are noteworthy. First, for all of the RIV
models, an increase in LIFECYCLE leads to a significant reduction in the probability that the
model will rank first and a significant increase in the probability that a particular RIV model
will rank last. On the other hand, for all of the AEGV models, an increase in LIFECYCLE
leads to a significant increase in the probability that the model will rank first and, with the
exception of AEGV_S, a significant decrease in the probability that a particular AEGV model
will rank last. Second, differences in LIFECYCLE are also related to differences in forecast
horizon. For example, a one standard deviation change in LIFECYCLE changes the
probability of AEGV_L ranking first (last) by 0.089 (−0.091). However, a one standard
deviation change in LIFECYCLE changes the probability of AEGV_N ranking first by only
0.055 (−0.068).
Finally, the pseudo r-squared of each of the ordered logistic regressions is less than
five percent. Hence, whether the factors, particularly LIFECYCLE, provide useful
information about the relative accuracy of the valuation models is unclear. With this in mind,
we evaluate the accuracy of the modified algorithm. To do this we follow the same
26
procedure we used to evaluate the accuracy of the original algorithm based on the full set of
23 variables.
In Panel A of Table Seven we show the percentage of holdout-sample observations
for which each of the valuation models is selected by the modified algorithm. Similar to the
original algorithm, the modified algorithm selects either the AEGV_S and AEGV_L models
(39.49 and 48.37 percent of the observations) or the RIV_N model (12.15 percent of the
observations). However, as shown in Panel B of Table Seven, the modified algorithm does
not perform as well as the original algorithm, which is not surprising given there is a loss of
information when the 23 independent variables are reduced to three factors. Nonetheless, the
modified algorithm performs better than a simple, one-size-fits-all approach. For example,
the modified algorithm selects the best model for 30.32 percent of the observations, which, as
shown in Table Three, is higher than the percentage related to any specific model. Moreover,
the modified algorithm selects the worst model much less frequently than any of the three
best models per Table Three (i.e., AEGV_L, RIV_N, and AEGV_N). In particular, the
modified algorithm selects the worst model for 13.93 percent of the observations; whereas, as
shown in Table Three, AEGV_L, RIV_N, and AEGV_N is the worst model for 37.58, 39.73,
and 18.06 percent of the estimation-sample observations.
In Panel C of Table Seven we provide additional results regarding the performance of
the modified algorithm. In particular, we provide descriptive statistics regarding the
difference between the unsigned valuation errors of the selected model via the modified
algorithm (i.e., SM) and the unsigned valuation errors of models selected by various
alternative criteria, which we describe in Section 4.3. The findings are similar to those based
on original algorithm reported in Panel D of Table Four. In particular, the modified
algorithm is more accurate (no less accurate) than: (1) RAN1 for 48.28 (64.89) percent of the
observations; (2) RAN2 for 46.32 (67.80) percent of the observations; (3) RAN3 for 34.33
27
(74.93) percent of the observations; and, (4) AEGV_L for 32.22 (80.59) percent of the
observations. Moreover, the median difference between the unsigned valuation error of SM
and the BEST model is 0.144.
Finally, in Table Eight we present analyses that are analogous to those in Table
Seven. The key difference is that in Table Eight we only use the LIFECYLCE factor to
determine the selected model.2 The findings are virtually identical to those in Table Seven.
Hence, these results demonstrate a key result: Where a firm lies in its lifecycle is the primary
determinant of the best valuation model for that firm.
5. Sensitivity Analyses
5.1 Alternative Approaches to Forecasting Earnings
As discussed in section 3.2, we obtain our earnings forecasts for years t+1 through t+5
from I/B/E/S. This is a common approach. Nonetheless, recent results in Hou, van Dijk, and
Zhang [2011] (HVZ hereafter) show that (abstract): “earnings forecasts generated [from a]
cross-sectional model are superior to analysts’ forecasts in terms of coverage, forecast bias,
and earnings response coefficient. Moreover, the model-based ICC [(i.e., implied cost of
capital)] is a more reliable proxy for expected returns than the ICC based on analysts’
forecasts.”
The above quotation implies that it is possible that our results lack generality. In
particular, if I/B/E/S-based forecasts are not representative of the earnings expectations
embedded in price and the effect that these errors have on valuation accuracy varies across
the valuation models we evaluate, our results may be an artifact of using I/B/E/S data.
2 In particular, using the estimation sample we estimate univariate ordered logistic regressions for each valuation model in which LIFECYCLE is the only independent variable. Next, we use the estimated coefficients from the univariate ordered logistic regressions to determine FIT_RANK for each holdout-sample observation and valuation model. Finally, for each holdout-sample observation we select the valuation model with the lowest value of FIT_RANK.
28
To address this potential concern, we compute six alternative value estimates based
on one of the models described in section 3.1, however, using earnings forecasts implied by a
modified version of the cross-sectional forecasting model described in HVZ.3 We refer to
these value estimates as the HVZ-based estimates and we refer to our original value estimates
as the I/B/E/S-based estimates. Next, we compare the valuation accuracy of each HVZ-based
estimate to its I/B/E/S-based counterpart. Untabulated results show that the I/B/E/S-based
estimate per AEGV_N, AEGV_S, AEGV_L, RIV_N, RIV_S, RIV_L is more accurate than
the HVZ-based estimate for 81.2, 75.4, 67.9, 0.0, 81.2, and 77.1 percent of the estimation-
sample observations.4 Hence, we conclude that the I/B/E/S-based value estimates are more
reliable than HVZ-based value estimates, and that our results do not appear to be an artifact
of using I/B/E/S data.
5.2 Potential Hindsight Bias
As discussed in section 3.3, the estimation sample is formed by randomly selecting
half of the observations from each industry-year. This implies that the coefficients estimated
via the logistic regressions reflect information from all the years between 1980 and 2010
inclusive. These coefficients are then used to determine the selected model for each
observation in the holdout sample regardless of the date of the observation. Hence, our
results may be affected by hindsight bias.
To determine whether hindsight bias is an issue we compare the selected model per
our algorithm (i.e., the original algorithm) to the selected model from an alternative algorithm
that is based on “rolling” regressions (i.e., the rolling algorithm). In particular, for each year t
∈ [1990, 2010] we estimate logistic regressions that are identical to the regressions discussed
3 In particular, we use a modified version of equation (1) in HVZ. Our modification is to add an additional regressor that equals the interaction of Ei,t (i.e., earnings of firm i in year t) and NegEi,t (an indicator that equals one if Ei,t < 0 and zero otherwise). 4 The RIV_N model does not rely on earnings forecasts; hence, the HVZ-based estimate is identical to the I/B/E/S-based estimate. Consequently, the I/B/E/S-based estimate is neither more nor less accurate than the HVZ-based estimate.
29
in section 4.3 with the exception that we only use data from year t−10 through year t−1
inclusive.5 Hence, the estimates obtained from the rolling regressions are not affected by
hindsight bias.
Untabulated results show that the rolling algorithm yields very similar results as the
original algorithm. When the rolling algorithm is used the RIV_N, RIV_S, RIV_L,
AEGV_N, AEGV_S, and AEGV_L model is selected for 18.01, 1.24, 1.62, 1.11, 22.65, and
55.37 percent of the observations that relate to the years occurring between 1990 and 2010
inclusive. These percentages are very similar to the percentages shown in Panel A of Table
Four, which are 17.83, 1.73, 0.36, 0.39, 28.87, and 51.42 for the RIV_N, RIV_S, RIV_L,
AEGV_N, AEGV_S, and AEGV_L model, respectively.6 Moreover, the selected model per
the rolling algorithm has rank one, two three, four, five, or six for 36.39, 14.00, 9.29, 9.92,
13.63, and 16.78 percent of the observations that relate to the years occurring between 1990
and 2010 inclusive. These percentages are very similar to the percentages shown in Panel B
of Table, which are 35.80, 16.56, 9.23, 10.37, 13.72, and 16.78 for rank one, two, three, four,
five, and six, respectively. In light of these facts we conclude that our results based on the
original algorithm are not attributable to hindsight bias.
6. Conclusion
Valuation is ultimately a practical endeavor. Accounting-based valuation models are
practical because they can be used to develop rigorous estimates of value without having to
make impractical and arbitrary assumptions about dividend policy. Nonetheless, even when
accounting-based models are used, another practical question remains: When steady-state 5 Because we require ten years of historical data for the rolling regressions, we can only use the rolling algorithm to select a valuation model for observations that relate to the year 1990 or later. 6 We also identify the observations that have a selected model per both original and rolling algorithm. Untabulated results show that for 79.04 percent of these observations the selected model per the rolling algorithm is identical to the selected model per the original algorithm.
30
forecasting is infeasible, which model is most appropriate? To address this question, we
evaluate two widely used classes of models, residual income and abnormal earnings growth,
and three forecast horizons for each model class.
Our study has three primary findings that are relevant to both academics and
practitioners who use accounting-based valuation models. First, we demonstrate that there is
no single, best valuation model; and, that models that are considered “best” per extant
research frequently perform poorly. Second, we develop a model selection algorithm and we
show that relative valuation accuracy is predictable ex ante based on knowledge of a firm’s
attributes. Third, we demonstrate that where a firm is its lifecycle is a key determinant of the
identity of the appropriate valuation model for that firm. This final result is important as it
suggests a parsimonious and practical criterion that researchers and practitioners can use to
choose among valuation models.
31
Appendix A
In this appendix we provide descriptive statistics and results of estimating the ordered
logistic regressions that we use in our original selection algorithm (i.e., the algorithm that is
described in section 4.3). Descriptive statistics are shown in Table A1. The results of
estimating the ordered logistic regressions are shown in Table A2.
Variables are defined as follows. Please note that in the definitions, we refer to year
zero, which is pertinent because our value estimates are made as of the end of year zero. In
addition, we omit firm subscripts. LOGSALE is the log of COMPUSTAT data item SALE
for year zero. AGE is the number of months between the date the observation first appears
on CRSP and the last fiscal month of year zero. VOL is the volatility of daily, market-
adjusted stock returns for year zero. RET is the buy-hold, market adjusted stock return for
year zero. The data for both VOL and RET are from the CRSP daily stock return files. ROE
is return on equity and equals the ratio of the year zero value of COMPUSTAT data item IB
(i.e., income before extraordinary items) to the year -1 value of COMPUSTAT data item
CEQ (i.e., common equity). LOSS is an indicator variable that equals one if IB in year zero
is negative and zero otherwise. The definitions NOATO, ∆NOATO, OPM, and ∆OPM are
provided in the appendix on pp. 149-151 of Nissim and Penman [2001]. SGROW equals the
annual growth rate in year zero of the COMPUSTAT data item SALE. CASHFLOW equals
the sum of COMPUSTAT data item OANCFC in year zero (i.e., net cash flow from
continuing operations) and COMPUSTAT data item RDIP in year zero (i.e., research and
development expense) divided by COMPUSTAT data item SALE in year zero. INVEST
equals the sum of COMPUSTAT data item RDIP in year zero and COMPUSTAT data item
PPENT in year zero (i.e., net property, plant, and equipment). PAYER equals one if
COMPUSTAT data item DVPSX_F (dividends per share – ex date – fiscal) is positive in
year zero, and zero otherwise. LEV (∆LEV) equal the ratio of (the change in) financial
32
obligations for year zero to COMPUSTAT data item AT (i.e., total assets) in year zero.
CASH, and ∆CASH equal the ratio of (the change in) financial assets for year zero to
COMPUSTAT data item AT in year zero. The definitions financial obligations and financial
assets are provided in the appendix on pp. 149-151 of Nissim and Penman [2001].
ACCRUALS equals the ratio of accruals in year zero, which are measured via the balance-
sheet method described in Sloan [1996], to COMPUSTAT data item SALE in year zero. We
use the definition of C-score shown in Penman and Zhang [2002]. REV equals the change in
the analysts’ consensus forecast of earnings per share in year plus one. The change is
measured by subtracting the value at the end of third fiscal quarter of year zero from the
value at the end of the fourth fiscal quarter of year zero. We deflate REV by COMPUSTAT
data item CEQ as of year -1. LTG is the analysts’ consensus long-term growth rate in
earnings per share as of the last fiscal month of year zero. ∆FOLLOW is the change in
analyst following during year zero.
33
References
Easton, P. 2007. Estimating the cost of capital implied by market prices and accounting data.
Foundations and Trends in Accounting 241-364.
Easton, P., and S. J. Monahan. 2005. An evaluation of accounting based measures of
expected returns. The Accounting Review 80 (2): 501-538.
Easton, P., and S. J. Monahan. 2010. Evaluating accounting-based measures of expected
returns: Easton and Monahan and Botosan and Plumlee redux. Unpublished Manuscript.
University of Notre Dame and INSEAD.
Fama, E., and K. French. 1997. Industry costs of equity. Journal of Financial Economics 43:
154-194.
Feltham, G., and J. Ohlson. 1996. Uncertainty resolution and the theory of depreciation
measurement. Journal of Accounting Research 34 (2): 209-234.
Francis, J., P. Olsson, and D. Oswald. 2000. Comparing the accuracy and explainability of
dividend, free cash flow, and abnormal earnings equity value estimates. Journal of
Accounting Research 38 (1): 45-70.
Hou, K., M. van Dijk, and Y. Zhang. 2011. The implied cost of capital: A new approach.
Forthcoming Journal of Accounting and Economics.
Hughes, J., J. Liu, and W. Su. 2008. On the relation between predictable market returns and
predictable analyst forecast errors. Review of Accounting Studies 13: 266-291.
Jorgensen, B., Y. G. Lee, and Y. K. Yoo. 2011. The valuation accuracy inferred from
empirical implementations of the abnormal earnings growth model: U.S. evidence.
Journal of Business Finance & Accounting 38 (4): 446-471.
Modigliani, F., and M. H. Miller. 1961. Dividend policy, growth, and the valuation of shares.
The Journal of Business 34 (4): 411-433.
Nissim, D., and S. H. Penman. 2001. Ratio analysis and equity valuation: From research to
practice. Review of Accounting Studies 6: 109-154.
Ohlson, J. 1995. Earnings, book values, and dividends in equity valuation. Contemporary
Accounting Research 11: 661-687.
34
Penman, S. H., and T. Sougiannis, 1998. A comparison of dividend, cash flow, and earnings
approaches to equity valuation. Contemporary Accounting Research 15 (3): 343-383.
Penman, S. H., and X. Zhang. 2002. Accounting conservatism, quality of earnings, and stock
returns. The Accounting Review 77 (2): 237-264.
Sloan, R. 1996. Do stock prices fully reflect information in accruals and cash flows about
future earnings? The Accounting Review 71: 289-315.
Zhang, X. 2000. Conservative accounting and equity valuation. Journal of Accounting and
Economics 29 (1): 125-149.
35
Figure One
-150%
-100%
-50%
0%
50%
100%
0 1 2 3 4 5 6 7 8 9 10
Gra
nd M
ean
of th
e M
edia
ns o
f RO
E fo
r P
ortfo
lio p
in th
e k'
th Y
ear
F
ollo
win
g th
e P
ortfo
lio-f
orm
atio
n Ye
ar
Year Relative to Porfolio Formation Year
36
Table One – Descriptive Statistics and Correlations (Estimation Sample) Panel A – Descriptive Statistics for Model Inputs and Key Firm Attributes
MEAN STD MIN P1 P5 P10 Q1 P50 Q3 P90 P95 P99 MAXMVE 3,238.6 14,034.2 0.6 11.4 28.3 45.5 113.1 371.0 1,429.8 5,341.4 12,627.9 56,856.0 460,767.9ASSET 5,062.7 38,945.0 0.9 9.8 24.9 41.8 107.4 384.8 1,762.07,018.6 16,339.0 75,889.0 1,916,658.4SALE 2,604.4 11,076.8 -299.3 0.8 13.5 29.6 91.9 330.2 1,307.6 4,887.3 10,467.0 42,539.6 425,071.0BM 0.575 0.455 -11.961 -0.064 0.105 0.169 0.301 0.494 0.750 1.068 1.334 2.051 11.633EP 0.063 0.115 -5.153 -0.291 -0.052 0.013 0.046 0.070 0.098 0.134 0.162 0.232 5.331βLEV_FIRM 0.962 0.446 0.000 0.065 0.267 0.400 0.632 0.933 1.269 1.597 1.763 1.944 2.000r 0.113 0.036 0.024 0.046 0.058 0.068 0.087 0.111 0.136 0.1610.178 0.210 0.249
ROE1 0.130 3.987 -300.403 -1.200 -0.174 0.024 0.094 0.144 0.204 0.291 0.386 0.962 519.786
ROE2 0.169 4.319 -250.557 -1.273 -0.042 0.052 0.107 0.152 0.208 0.291 0.386 1.402 426.775
ROE3 0.146 16.387 -2,368.667 -0.932 0.012 0.064 0.113 0.156 0.211 0.295 0.398 1.382 1,687.318
ROE4 0.174 3.345 -529.862 -0.645 0.028 0.070 0.115 0.158 0.214 0.297 0.404 1.266 161.842
ROE5 0.199 3.392 -87.718 -0.379 0.039 0.074 0.116 0.160 0.216 0.299 0.400 1.028 560.541
RONEC2 328.739 34,750.513 -4,866.412 -1.846 -0.111 0.000 0.062 0.177 0.275 0.500 0.894 3.923 6,223,529.409
RONEC3 267.489 22,325.700 -4,866.412 -0.759 0.000 0.000 0.097 0.175 0.250 0.380 0.566 2.130 3,620,370.360
RONEC4 400.836 48,095.302 -4,866.412 -0.356 0.000 0.000 0.100 0.173 0.250 0.355 0.500 1.729 8,831,999.990
RONEC5 207.066 14,556.055 -4,866.412 0.000 0.000 0.000 0.100 0.175 0.250 0.350 0.500 1.173 1,839,999.997
37
Panel B – Descriptive Statistics for Signed Valuation Errors
MEAN STD MIN P1 P5 P10 Q1 P50 Q3 P90 P95 P99 MAX(RIV_N-P)÷P -0.415 0.523 -10.023 -1.064 -0.893 -0.829 -0.696 -0.502 -0.243 0.084 0.351 1.075 32.822(RIV_S-P)÷P -0.374 0.572 -32.398 -1.621 -0.888 -0.748 -0.587 -0.400 -0.180 0.078 0.273 0.913 23.541(RIV_L-P)÷P -0.258 0.983 -81.628 -2.109 -0.924 -0.743 -0.541 -0.310 -0.026 0.358 0.721 1.864 19.988(AEGV_N-P)÷P -0.452 1.742 -192.344 -4.813 -1.546 -0.876 -0.581 -0.348 -0.077 0.245 0.530 1.427 35.340(AEGV_S-P)÷P -0.156 2.092 -152.269 -3.604 -1.068 -0.743 -0.479 -0.213 0.126 0.655 1.204 3.423 69.589(AEGV_L-P)÷P 0.941 22.897 -1993.076 -4.506 -1.058 -0.806 -0.451 -0.063 0.497 1.597 2.937 20.919 1570.048 Panel C – Correlations
P RIV_N RIV_S RIV_L AEGV_N AEGV_S AEGV_LP CORR 1.00 0.88 0.74 0.73 0.58 0.61 0.51
T-STAT 24.18 7.53 7.46 4.61 5.31 5.88RIV_N CORR 0.71 1.00 0.84 0.81 0.65 0.66 0.54
T-STAT 43.57 10.62 10.06 5.99 6.45 6.17RIV_S CORR 0.81 0.90 1.00 0.99 0.87 0.89 0.68
T-STAT 49.98 123.23 369.23 13.10 13.72 11.41RIV_L CORR 0.80 0.80 0.96 1.00 0.88 0.91 0.72
T-STAT 50.06 73.02 656.99 13.23 14.12 12.67AEGV_N CORR 0.78 0.75 0.91 0.93 1.00 0.94 0.68
T-STAT 37.53 48.95 171.45 204.14 66.78 11.76AEGV_S CORR 0.78 0.68 0.90 0.97 0.91 1.00 0.80
T-STAT 49.06 40.37 170.83 450.80 135.61 20.37AEGV_L CORR 0.70 0.56 0.79 0.89 0.78 0.95 1.00
T-STAT 46.25 27.08 82.11 149.70 65.60 315.60
38
The estimation sample consists of 35,473 firm-years. MVE is the market capitalization (CSHO × PRCC_F) at the end of year zero. ASSET is the total assets (AT) at the end of year zero. SALE is sales during the year zero. BM is the book to market ratio, defined as the book value of common equity (CEQ) over the market capitalization. EP is lead earnings to price ratio defined as I/B/E/S analysts’ consensus (median) forecast of one-year ahead earnings per share over stock price at the end of year zero. βLEV_FIRM is the estimated firm-level beta. See section 3.2 for detailed definitions. r is the cost of equity capital. See section 3.2 for detailed definitions. ROE1, ROE2, ROE3, ROE4, and ROE5 are expected return on equity, defined as earnings per share over the beginning balance of book value per share, for year one through five, respectively. RONEC 2, RONEC 3, RONEC 4, and RONEC 5 are expected return on new equity capital for years two through five. P is the share price at the I/B/E/S database release date. RIV_N, RIV_S, and RIV_L are the estimates of a firm’s share price at time zero based on RIV Model after making a naïve assumption about the forecast horizon, assuming a horizon of five years, and assuming a horizon of fifteen years, respectively. AEGV_N, AEGV_S, and AEGV_L are the estimates of a firm’s share price at time zero based on AEGV Model after making a naïve assumption about the forecast horizon, assuming a horizon of five years, and assuming a horizon of fifteen years, respectively. See section 3.1 and 3.2 for detailed definitions. Regarding the correlations shown in Panel C, Pearson product moment (Spearman rank order) correlations are shown above (below) diagonal. Correlations are calculated as the means of annual the correlations and t-statistics based on the temporal standard errors of the annual coefficients.
39
Table Two – Evaluation of Unsigned Valuation Errors (Estimation Sample) Panel A – Descriptive Statistics for Unsigned Valuation Errors
MEAN STD MIN P1 P5 P10 Q1 P50 Q3 P90 P95 P99 MAX|RIV_N-P|÷P 0.529 0.408 0.000 0.014 0.070 0.138 0.312 0.5280.712 0.847 0.920 1.337 32.822|RIV_S-P|÷P 0.470 0.497 0.000 0.011 0.053 0.105 0.238 0.4230.603 0.771 0.950 1.870 32.398|RIV_L-P|÷P 0.507 0.881 0.000 0.008 0.041 0.079 0.200 0.3960.607 0.860 1.149 2.944 81.628|AEGV_N-P|÷P 0.628 1.687 0.000 0.009 0.042 0.083 0.211 0.408 0.631 1.000 1.746 5.071 192.344|AEGV_S-P|÷P 0.662 1.991 0.000 0.007 0.035 0.072 0.184 0.376 0.633 1.143 1.902 5.719 152.269|AEGV_L-P|÷P 2.195 22.811 0.000 0.008 0.041 0.081 0.211 0.465 0.875 2.006 3.759 28.979 1993.076|BEST-P|÷P 0.203 0.246 0.000 0.001 0.008 0.016 0.046 0.124 0.288 0.497 0.650 0.928 14.776 Panel B – Comparison of Each Model to the Best Model
MEAN STD MIN P1 P5 P10 Q1 P50 Q3 P90 P95 P99 MAX(|RIV_N-P|-|BEST-P|)÷P 0.326 0.361 0.000 0.000 0.000 0.000 0.067 0.299 0.510 0.668 0.755 1.032 30.229(|RIV_S-P|-|BEST-P|)÷P 0.267 0.385 0.000 0.000 0.000 0.000 0.090 0.215 0.370 0.508 0.608 1.251 20.949(|RIV_L-P|-|BEST-P|)÷P 0.304 0.776 0.000 0.000 0.000 0.000 0.085 0.190 0.332 0.549 0.828 2.342 66.852(|AEGV_N-P|-|BEST-P|)÷P 0.425 1.587 0.000 0.000 0.000 0.000 0.055 0.209 0.394 0.722 1.346 4.494 177.569(|AEGV_S-P|-|BEST-P|)÷P 0.459 1.915 0.000 0.000 0.000 0.000 0.049 0.162 0.359 0.847 1.566 5.206 137.493(|AEGV_L-P|-|BEST-P|)÷P 1.992 22.799 0.000 0.000 0.000 0.000 0.000 0.242 0.647 1.796 3.525 28.714 1992.996 The estimation sample consists of 35,473 firm-years. P is the share price at the I/B/E/S database release date. RIV_N, RIV_S, and RIV_L are the estimates of a firm’s share price at time zero based on RIV Model after making a naïve assumption about the forecast horizon, assuming a horizon of five years, and assuming a horizon of fifteen years, respectively. AEGV_N, AEGV_S, and AEGV_L are the estimates of a firm’s share price at time zero based on AEGV Model after making a naïve assumption about the forecast horizon, assuming a horizon of five years, and assuming a horizon of fifteen years, respectively. BEST is the estimate of firm’s share price at time zero based on the model with the lowest unsigned valuation error for each observation. See section 3.1 and 3.2 for detailed definitions.
40
Table Three – Relative Model Rankings (Estimation Sample)
1 2 3 4 5 6RIV_N 19.10% 9.49% 7.53% 8.04% 16.12% 39.73%RIV_S 10.41% 21.92% 18.07% 19.87% 29.73% 0.01%RIV_L 10.50% 17.74% 31.08% 26.83% 9.98% 3.88%AEGV_N 17.06% 11.01% 25.28% 20.22% 8.37% 18.06%AEGV_S 14.41% 32.89% 10.24% 14.81% 26.91% 0.74%AEGV_L 28.53% 6.95% 7.81% 10.24% 8.89% 37.58%
Percentage of Obs. for which Model Ranks:
The estimation sample consists of 35,473 firm-years. Table Three shows the percentage of observations for which each valuation model ranks first, second, third, fourth, fifth, and sixth in terms of relative unsigned valuation errors. RIV_N, RIV_S, and RIV_L are the estimates of a firm’s share price at time zero based on RIV Model after making a naïve assumption about the forecast horizon, assuming a horizon of five years, and assuming a horizon of fifteen years, respectively. AEGV_N, AEGV_S, and AEGV_L are the estimates of a firm’s share price at time zero based on AEGV Model after making a naïve assumption about the forecast horizon, assuming a horizon of five years, and assuming a horizon of fifteen years, respectively. See section 3.1 and 3.2 for detailed definitions.
41
Table Four – Attributes of the Selected Model per the Original Algorithm (Holdout Sample)
Panel A – Percentage of Holdout-sample Observations for which a Particular Model is
Selected
RIV_N RIV_S RIV_L AEGV_N AEGV_S AEGV_L17.82% 1.73% 0.36% 0.39% 28.27% 51.42%
Panel B – Percentage of Holdout-sample Observations for which the Selected Model has
Rank N
1 2 3 4 5 635.80% 16.56% 9.23% 10.37% 13.72% 14.34%
The holdout sample consists of 23,696 firm-years with available data. RIV_N, RIV_S, and RIV_L are the estimates of a firm’s share price at time zero based on RIV Model after making a naïve assumption about the forecast horizon, assuming a horizon of five years, and assuming a horizon of fifteen years, respectively. AEGV_N, AEGV_S, and AEGV_L are the estimates of a firm’s share price at time zero based on AEGV Model after making a naïve assumption about the forecast horizon, assuming a horizon of five years, and assuming a horizon of fifteen years, respectively. See section 3.1 and 3.2 for detailed definitions.
42
Panel C – Descriptive Statistics for Unsigned Valuation Errors (Holdout Sample)
MEAN STD % < 0 % ≤ 0 MIN P1 P5 P10 Q1 P50 Q3 P90 P95 P99 MAX|SM-P|÷P 0.502 0.871 0.000 0.007 0.033 0.067 0.165 0.354 0.605 0.907 1.304 3.135 43.320(|SM-P|-|BEST-P|)÷P 0.307 0.813 0.00% 35.80% 0.000 0.000 0.000 0.000 0.000 0.104 0.355 0.718 1.138 2.867 42.045(|SM-P|-|RAN1-P|)÷P -0.276 6.827 50.89% 67.36% -629.766 -3.619 -0.744 -0.494 -0.233 -0.009 0.085 0.410 0.703 2.037 26.111(|SM-P|-|RAN2-P|)÷P -0.403 7.319 48.16% 70.70% -436.275 -4.551 -0.896 -0.544 -0.240 0.000 0.049 0.352 0.627 1.837 38.942(|SM-P|-|RAN3-P|)÷P -0.792 18.299 38.96% 76.86% -2188.144 -7.857 -1.323 -0.644 -0.186 0.000 0.000 0.240 0.456 1.303 14.447(|SM-P|-|AEGV_L-P|)÷P -1.490 21.917 32.56% 83.98% -2188.144 -24.351 -2.420 -1.073 -0.203 0.000 0.000 0.119 0.259 0.609 12.163
NA
The holdout sample consists of 23,696 firm-years with available data. SM is the estimate of a firm’s share price at time zero obtained assuming that for each observation we use the model selected by the algorithm described in section 4.3. BEST is the estimate of a firm’s share price at time zero based on the model with the lowest unsigned valuation error for each observation. RAN1 is the estimate of a firm’s share price at time zero from the valuation model selected using the algorithm that for each observation, each valuation model has a 1/6th probability of being selected. RAN2 is the estimate of a firm’s share price at time zero from the valuation model selected using the algorithm that for each observation, each valuation model has a probability of being selected that equals the percentage of observations for which that model is best per Table Three. RAN3 is the estimate of a firm’s share price at time zero from the valuation model selected using the algorithm that for each observation, each valuation model has a probability of being selected that equals the percentage of observations for which that model is selected by the algorithm that is based on the results of the ordered logistic regressions. AEGV_L are the estimates of a firm’s share price at time zero based on AEGV Model after assuming a horizon of fifteen years. P is the share price at the I/B/E/S database release date. See section 4.3 for detailed definitions.
43
Table Five – Results of Factor Analysis (Estimation Sample)
INV_OPS LIFECYCLE NOA_TURN INV_OPS LIFECYCLE NOA_TURNLOGSALE 0.157 -0.693 0.011 0.0473 0.7092 0.0102AGE 0.072 -0.594 0.010 -0.0215 0.5977 0.0079VOL -0.111 0.616 0.001 -0.0140 -0.6260 0.0028RET -0.017 0.033 0.022 -0.0118 -0.0352 0.0224ROE 0.023 -0.071 0.008 0.0117 0.0738 0.0078LOSS -0.090 0.428 -0.010 -0.0220 -0.4368 -0.0096NOATO -0.001 0.023 0.831 -0.0060 -0.0204 0.8310∆NOATO 0.000 -0.006 0.831 -0.0094 0.0081 0.8305OPM 0.961 0.085 0.000 0.9621 0.0654 0.0099∆OPM 0.624 0.109 -0.001 0.6337 -0.0102 0.0056SGROW 0.010 0.137 0.004 0.0308 -0.1343 0.0047CASHFLOW 0.683 0.104 0.000 0.6905 0.0038 0.0071INVEST -0.767 -0.071 -0.001 -0.7682 -0.0497 -0.0082PAYER 0.091 -0.661 0.001 -0.0128 0.6670 -0.0004LEV -0.077 -0.302 0.001 -0.1234 0.2859 -0.0013∆LEV 0.054 0.044 -0.027 0.0609 -0.0354 -0.0261CASH -0.072 0.562 0.007 0.0166 -0.5665 0.0085∆CASH 0.013 -0.059 -0.003 0.0037 0.0605 -0.0035ACCRUALS 0.894 0.080 0.000 0.8956 0.0600 0.0087C-SCORE -0.009 0.049 0.006 -0.0009 -0.0497 0.0057REV 0.004 -0.019 0.003 0.0012 0.0199 0.0026LTG -0.024 0.367 0.000 0.0336 -0.3663 0.0011∆FOLLOW -0.016 0.128 0.008 0.0040 -0.1290 0.0080
EIGENVALUE 3.240 2.466 1.382
Raw Factor Pattern Factor Pattern after Varimax Rotation
This subset of the estimation sample consists of 23,799 firm-years with available data. LOGSALE is a natural log of contemporaneous sales. AGE is a year-to-date, duration in months since CRSP initiated coverage of the firm. VOL is a year-to-date, market-adjusted stock return volatility. RET is a year-to-date, market-adjusted, buy-hold stock return. ROE is the contemporaneous return on equity. LOSS equals one if contemporaneous net income is negative, zero otherwise. NOATO is contemporaneous net operating asset turnover. ΔNOATO is a change in net operating asset turnover. OPM is a contemporaneous operating profit margin. ΔOPM is a change in operating profit margin. SGROW is a contemporaneous sales growth. CASHFLOW is a contemporaneous cash flow to sales (cash flow equals the sum of GAAP cash flow from operations and research and development expenditures). INVEST is the contemporaneous investment to sales (investment equals the sum of research and development expenditures and capital expenditures). PAYER equals one if the firm paid dividends in the current year, zero otherwise. LEV is a Contemporaneous ratio of financial obligations to total assets. ∆LEV is a change in LEV. CASH is a contemporaneous ratio of financial assets to total assets. ∆CASH is a change in CASH. ACCRUALS is a contemporaneous ratio of working-capital accruals to sales. C-SCORE is a contemporaneous C-score per Penman and Zhang [2000]. REV is Ratio of most recent forecast revision to equity book value per share. LTG is the contemporaneous analysts' consensus long-term growth rate in earnings per share. ∆FOLLOW is a change in analysts' following during the last three months before the end of year zero.
44
Table Six – Results Obtained from Estimating Ordered Logistic Regressions of Ranks on Factor Scores (Estimation Sample)
Est. p-value Est. p-value Est. p-value Est. p-value Est. p-value Est. p-valueINV_OPS -0.024 0.185 -0.013 0.313 -0.010 0.415 -0.022 0.2770.027 0.074 0.027 0.157
ME_1 0.003 0.185 0.001 0.233 0.001 0.381 0.003 0.192 -0.004 0.101 -0.006 0.098ME_6 -0.006 0.185 0.000 0.381 -0.003 0.192 0.006 0.098
LIFECYCLE 0.692 0.000 0.533 0.000 0.229 0.000 -0.470 0.000 -0.292 0.000 -0.407 0.000ME_1 -0.090 0.000 -0.040 0.000 -0.021 0.000 0.055 0.000 0.038 0.000 0.089 0.000ME_6 0.170 0.000 0.007 0.000 -0.068 0.000 -0.091 0.000
NOA_TURN 0.130 0.004 0.056 0.019 0.008 0.562 -0.015 0.238 -0.025 0.075 -0.023 0.130ME_1 -0.017 0.009 -0.004 0.040 -0.001 0.526 0.002 0.187 0.003 0.059 0.005 0.097ME_6 0.032 0.009 0.000 0.526 -0.002 0.187 -0.005 0.097
INT_6 -0.253 0.000 -8.581 0.000 -3.418 0.000 -1.550 0.000 -4.880 0.000 -0.683 0.000INT_5 0.473 0.000 -0.646 0.000 -2.153 0.000 -1.031 0.000 -1.073 0.000 -0.320 0.000INT_4 0.837 0.000 0.246 0.000 -0.482 0.000 -0.017 0.199 -0.479 0.000 0.081 0.000INT_3 1.185 0.000 0.986 0.000 0.926 0.000 1.188 0.000 -0.0750.000 0.416 0.000INT_2 1.711 0.000 2.421 0.000 2.169 0.000 1.851 0.000 1.704 0.000 0.730 0.000Pseudo R-squared 0.035 0.023 0.004 0.016 0.007 0.013
NA
NA
NA
AEGV_L
NA
NA
NA
RIV_N RIV_S RIV_L AEGV_N AEGV_S
This subset of the estimation ample consists of 23,799 firm-years with available data. INV_OPS, LIFECYCLE, and EARN_QUAL are the factor scores from Factor 1, Factor 2, and Factor 3 shown in Table Five, respectively. ME_1 is the marginal effect on the probability that the related valuation model will rank first assuming a change of one standard deviation in the relevant factor and setting the remaining two factors equal to their mean values. ME_6 is the marginal effect on the probability that the related valuation model will rank sixth assuming a change of one standard deviation in the relevant factor and setting the remaining two factors equal to their mean values. INT_6, INT_5, INT_4, INT_3, and INT_2 are intercepts corresponding to the outcomes ranked sixth, fifth, fourth, third, and second, respectively.
45
Table Seven – Attributes of the Selected Model per the Modified Algorithm Using All Three Factors (Holdout Sample)
Panel A – Percentage of Holdout-sample Observations for which a Particular Model is
Selected
RIV_N RIV_S RIV_L AEGV_N AEGV_S AEGV_L12.15% 0.00% 0.00% 0.00% 39.49% 48.37%
Panel B – Percentage of Holdout-sample Observations for which the Selected Model has
Rank N
1 2 3 4 5 630.32% 18.17% 9.14% 11.24% 17.21% 13.93%
46
Panel C – Descriptive Statistics for Unsigned Valuation Errors
MEAN STD % < 0 % ≤ 0 MIN P1 P5 P10 Q1 P50 Q3 P90 P95 P99 MAX|SM-P|÷P 0.555 1.594 0.000 0.007 0.035 0.071 0.181 0.381 0.647 0.931 1.374 3.375 98.931(|SM-P|-|BEST-P|)÷P 0.359 1.558 0.00% 30.32% 0.000 0.000 0.000 0.000 0.000 0.144 0.400 0.755 1.182 3.029 98.638(|SM-P|-|RAN1-P|)÷P -0.223 6.917 48.28% 64.89% -629.766 -3.315 -0.673 -0.441 -0.200 0.000 0.108 0.423 0.726 2.062 98.050(|SM-P|-|RAN2-P|)÷P -0.351 7.426 46.32% 67.80% -436.275 -4.525 -0.813 -0.489 -0.203 0.000 0.079 0.379 0.659 1.922 98.289(|SM-P|-|RAN3-P|)÷P -0.740 14.224 34.33% 74.93% -1082.932 -7.631 -1.217 -0.550 -0.102 0.000 0.001 0.254 0.451 1.080 88.810(|SM-P|-|AEGV_L-P|)÷P -1.437 21.908 32.22% 80.59% -2188.144 -22.189 -2.218 -1.003 -0.143 0.000 0.000 0.156 0.280 0.620 5.182 The holdout sample consists of 23,696 firm-years with available data. RIV_N, RIV_S, and RIV_L are the estimates of a firm’s share price at time zero based on RIV Model after making a naïve assumption about the forecast horizon, assuming a horizon of five years, and assuming a horizon of fifteen years, respectively. AEGV_N, AEGV_S, and AEGV_L are the estimates of a firm’s share price at time zero based on AEGV Model after making a naïve assumption about the forecast horizon, assuming a horizon of five years, and assuming a horizon of fifteen years, respectively. See section 3.1 and 3.2 for detailed definitions. SM is the estimate of a firm’s share price at time zero obtained assuming that for each observation we use the model selected by the modified algorithm described in section 4.4. BEST is the estimate of a firm’s share price at time zero based on the model with the lowest unsigned valuation error for each observation. RAN1 is the estimate of a firm’s share price at time zero from the valuation model selected using the algorithm that for each observation, each valuation model has a 1/6th probability of being selected. RAN2 is the estimate of a firm’s share price at time zero from the valuation model selected using the algorithm that for each observation, each valuation model has a probability of being selected that equals the percentage of observations for which that model is best per Table Three. RAN3 is the estimate of a firm’s share price at time zero from the valuation model selected using the algorithm that for each observation, each valuation model has a probability of being selected that equals the percentage of observations for which that model is selected by the modified algorithm. AEGV_L are the estimates of a firm’s share price at time zero based on AEGV Model after assuming a horizon of fifteen years. P is the share price at the I/B/E/S database release date.
47
Table Eight – Attributes of the Selected Model per the Modified Algorithm Using only the Life Cycle Factor (Holdout Sample)
Panel A – Percentage of Holdout-sample Observations for which a Particular Model is
Selected
RIV_N RIV_S RIV_L AEGV_N AEGV_S AEGV_L12.12% 0.00% 0.00% 0.00% 39.50% 48.37%
Panel B – Percentage of Holdout-sample Observations for which the Selected Model has
Rank N
1 2 3 4 5 630.32% 18.13% 9.15% 11.24% 17.24% 13.92%
48
Panel C – Descriptive Statistics for Unsigned Valuation Errors
MEAN STD % < 0 % ≤ 0 MIN P1 P5 P10 Q1 P50 Q3 P90 P95 P99 MAX|SM-P|÷P 0.554 1.589 0.000 0.007 0.035 0.071 0.181 0.381 0.647 0.931 1.373 3.375 98.931(|SM-P|-|BEST-P|)÷P 0.359 1.553 0.00% 30.32% 0.000 0.000 0.000 0.000 0.000 0.144 0.400 0.753 1.180 3.029 98.638(|SM-P|-|RAN1-P|)÷P -0.224 6.916 48.27% 64.87% -629.766 -3.323 -0.674 -0.441 -0.200 0.000 0.108 0.423 0.725 2.070 98.050(|SM-P|-|RAN2-P|)÷P -0.351 7.420 46.32% 67.78% -436.275 -4.525 -0.813 -0.490 -0.203 0.000 0.080 0.379 0.658 1.916 98.289(|SM-P|-|RAN3-P|)÷P -0.747 14.308 34.46% 74.99% -1082.932 -8.727 -1.203 -0.546 -0.105 0.000 0.000 0.253 0.451 1.079 98.601(|SM-P|-|AEGV_L-P|)÷P -1.438 21.907 32.22% 80.60% -2188.144 -22.357 -2.221 -1.003 -0.143 0.000 0.000 0.156 0.280 0.620 5.182 The holdout sample consists of 23,696 firm-years with available data. RIV_N, RIV_S, and RIV_L are the estimates of a firm’s share price at time zero based on RIV Model after making a naïve assumption about the forecast horizon, assuming a horizon of five years, and assuming a horizon of fifteen years, respectively. AEGV_N, AEGV_S, and AEGV_L are the estimates of a firm’s share price at time zero based on AEGV Model after making a naïve assumption about the forecast horizon, assuming a horizon of five years, and assuming a horizon of fifteen years, respectively. See section 3.1 and 3.2 for detailed definitions. SM is the estimate of a firm’s share price at time zero obtained assuming that for each observation we use the model selected by the modified algorithm described in section 4.4. BEST is the estimate of a firm’s share price at time zero based on the model with the lowest unsigned valuation error for each observation. RAN1 is the estimate of a firm’s share price at time zero from the valuation model selected using the algorithm that for each observation, each valuation model has a 1/6th probability of being selected. RAN2 is the estimate of a firm’s share price at time zero from the valuation model selected using the algorithm that for each observation, each valuation model has a probability of being selected that equals the percentage of observations for which that model is best per Table Three. RAN3 is the estimate of a firm’s share price at time zero from the valuation model selected using the algorithm that for each observation, each valuation model has a probability of being selected that equals the percentage of observations for which that model is selected by the modified algorithm. AEGV_L are the estimates of a firm’s share price at time zero based on AEGV Model after assuming a horizon of fifteen years. P is the share price at the I/B/E/S database release date.
49
Table A1 – Descriptive Statistics for Firm Attributes (Estimation Sample)
MEAN STD MIN P1 P5 P10 Q1 P50 Q3 P90 P95 P99 MAX NOBSLOGSALE 5.860 2.041 -6.908 0.637 2.707 3.436 4.542 5.811 7.183 8.501 9.260 10.659 12.960 35290AGE 191 190 11 13 19 27 56 127 255 439 642 873 1008 35471VOL 0.086 0.083 -0.051 -0.010 0.007 0.016 0.036 0.067 0.113 0.172 0.222 0.367 3.826 35443RET 0.164 0.735 -1.040 -0.833 -0.604 -0.459 -0.203 0.067 0.359 0.781 1.189 2.677 23.698 35449ROE -0.147 44.670 -8351.000 -1.755 -0.503 -0.209 0.041 0.130 0.202 0.306 0.420 1.103 431.531 35418LOSS 0.201 0.401 0.000 0.000 0.000 0.000 0.000 0.000 0.000 1.000 1.000 1.000 1.000 35470NOATO 5.001 77.796 -0.407 0.154 0.431 0.638 1.232 2.113 3.482 6.109 9.152 25.857 6595.471 32504∆NOATO -0.488 78.331 -6537.678 -16.615 -3.109 -1.451 -0.402 -0.016 0.240 0.834 1.681 8.388 6547.484 31622OPM -1.929 77.378 -8305.500 -8.700 -0.684 -0.144 0.020 0.060 0.112 0.181 0.239 0.494 285.986 35272∆OPM -0.573 70.256 -7865.466 -2.036 -0.259 -0.103 -0.024 0.001 0.023 0.104 0.281 3.042 4725.085 34992SGROW 0.286 3.175 -6.488 -0.537 -0.213 -0.101 0.015 0.113 0.260 0.544 0.874 2.659 385.485 35193CASHFLOW -0.524 41.705 -5428.000 -2.610 -0.128 0.002 0.0700.146 0.253 0.395 0.508 0.870 771.842 31516INVEST 1.592 54.804 -136.960 -0.276 -0.017 0.000 0.016 0.067 0.171 0.403 0.914 6.406 5500.846 35272PAYER 0.481 0.500 0.000 0.000 0.000 0.000 0.000 0.000 1.000 1.000 1.000 1.000 1.000 35473LEV 0.212 0.195 0.000 0.000 0.000 0.000 0.044 0.183 0.326 0.456 0.541 0.794 3.426 35473∆LEV 0.002 0.103 -3.126 -0.273 -0.119 -0.073 -0.028 0.000 0.023 0.087 0.146 0.334 1.953 35425CASH 0.217 0.231 -0.030 0.001 0.006 0.012 0.039 0.129 0.319 0.587 0.732 0.916 1.000 35473∆CASH -0.007 0.099 -0.978 -0.337 -0.166 -0.102 -0.034 -0.001 0.025 0.079 0.131 0.294 0.968 35425ACCRUALS -0.171 16.248 -1152.584 -1.425 -0.328 -0.184 -0.085 -0.035 0.004 0.052 0.097 0.270 1879.000 31516C-SCORE 0.529 11.582 -0.120 0.000 0.000 0.000 0.000 0.016 0.138 0.399 0.856 5.204 1088.263 35473REV -0.034 1.618 -159.136 -0.794 -0.205 -0.109 -0.038 -0.004 0.010 0.039 0.077 0.357 139.791 35473LTG 0.146 0.138 -0.927 0.000 0.000 0.000 0.070 0.130 0.200 0.300 0.350 0.700 1.000 35473∆FOLLOW 0.217 0.787 -0.933 -0.750 -0.500 -0.429 -0.167 0.000 0.333 1.000 1.500 3.167 16.000 28074 LOGSALE is a natural log of contemporaneous sales. AGE is the year-to-date duration in months since CRSP initiated coverage of the firm. VOL is the year-to-date market-adjusted stock return volatility. RET is the year-to-date market-adjusted, buy-hold stock return. ROE is the contemporaneous return on equity. LOSS equals one if contemporaneous net income is negative, zero otherwise. NOATO is contemporaneous net operating asset turnover. ΔNOATO is a change in net operating asset turnover. OPM is a contemporaneous operating profit margin. ΔOPM is a change in operating profit margin. SGROW is a contemporaneous sales
50
growth. CASHFLOW is a contemporaneous cash flow to sales (cash flow equals the sum of GAAP cash flow from operations and research and development expenditures). INVEST is the contemporaneous investment to sales (investment equals the sum of research and development expenditures and capital expenditures). PAYER equals one if the firm paid dividends in the current year, zero otherwise. LEV is a Contemporaneous ratio of financial obligations to total assets. ∆LEV is a change in LEV. CASH is a contemporaneous ratio of financial assets to total assets. ∆CASH is a change in CASH. ACCRUALS is a contemporaneous ratio of working-capital accruals to sales. C-SCORE is a contemporaneous C-score per Penman and Zhang [2000]. REV is Ratio of most recent forecast revision to equity book value per share. LTG is the contemporaneous analysts' consensus long-term growth rate in earnings per share. ∆FOLLOW is a change in analysts' following during the last three months before the end of year zero.
51
Table A2 – Results Obtained from Estimating Ordered Logistic Regressions of Ranks on Firm Attributes (Estimation Sample)
Est. p-value Est. p-value Est. p-value Est. p-value Est. p-value Est. p-valueLOGSALE 0.159 0.000 0.151 0.000 -0.037 0.000 0.011 0.167 -0.067 0.000 -0.126 0.000AGE 0.000 0.427 0.000 0.005 0.000 0.145 0.000 0.076 0.000 0.032 0.000 0.000VOL -2.991 0.000 -2.616 0.000 0.208 0.295 0.281 0.164 2.315 0.000 2.077 0.000RET 0.607 0.000 0.600 0.000 -0.053 0.005 0.092 0.000 -0.472 0.000 -0.566 0.000ROE 0.031 0.007 0.004 0.462 -0.002 0.702 -0.039 0.000 -0.0050.365 0.003 0.605LOSS -1.132 0.000 -0.711 0.000 -0.110 0.001 1.213 0.000 0.879 0.000 0.115 0.001NOATO 0.003 0.004 0.001 0.016 0.000 0.363 -0.001 0.004 -0.001 0.033 0.000 0.538∆NOATO -0.001 0.035 -0.001 0.102 0.000 0.719 0.001 0.021 0.000 0.084 0.000 0.895OPM -0.001 0.302 -0.002 0.111 0.000 0.863 -0.010 0.018 0.0020.080 0.002 0.137∆OPM 0.000 0.698 0.000 0.953 0.000 0.763 0.009 0.007 0.001 0.051 0.000 0.456SGROW 0.000 0.971 0.002 0.847 -0.009 0.316 -0.007 0.508 -0.022 0.043 0.011 0.312CASHFLOW 0.000 0.895 0.001 0.408 0.001 0.572 0.009 0.189 -0.002 0.202 -0.002 0.277INVEST 0.000 0.863 0.000 0.535 0.000 0.860 0.007 0.081 0.0000.917 -0.001 0.383PAYER 0.384 0.000 0.416 0.000 0.253 0.000 -0.003 0.920 -0.253 0.000 -0.464 0.000LEV -0.169 0.023 -0.118 0.099 0.377 0.000 -0.168 0.018 0.0380.600 0.126 0.083∆LEV 0.511 0.000 0.536 0.000 0.277 0.033 0.520 0.000 -0.597 0.000 -0.930 0.000CASH 0.092 0.235 0.800 0.000 -0.563 0.000 1.148 0.000 -0.3100.000 -0.778 0.000∆CASH -0.398 0.004 -0.676 0.000 0.199 0.135 -0.321 0.016 0.523 0.000 0.406 0.003ACCRUALS 0.000 0.939 0.003 0.437 0.001 0.792 0.016 0.117 -0.002 0.572 -0.005 0.209C-SCORE 0.000 0.853 0.001 0.311 0.000 0.622 0.002 0.193 -0.001 0.527 -0.001 0.264REV -0.007 0.381 -0.009 0.233 0.004 0.615 -0.003 0.673 0.0220.010 0.004 0.645LTG 0.836 0.000 0.030 0.763 -1.335 0.000 0.290 0.003 -2.300 0.000 0.941 0.000∆FOLLOW 0.021 0.208 0.002 0.893 -0.068 0.000 -0.008 0.612 -0.081 0.000 0.056 0.000INT_6 -1.187 0.000 -9.675 0.000 -3.046 0.000 -2.144 0.000 -4.486 0.000 0.024 0.705INT_5 -0.397 0.000 -1.617 0.000 -1.777 0.000 -1.593 0.000 -0.534 0.000 0.402 0.000INT_4 0.010 0.884 -0.662 0.000 -0.090 0.145 -0.533 0.000 0.115 0.069 0.820 0.000INT_3 0.399 0.000 0.144 0.022 1.336 0.000 0.698 0.000 0.562 0.000 1.168 0.000INT_2 0.983 0.000 1.658 0.000 2.590 0.000 1.371 0.000 2.451 0.000 1.496 0.000
Pseudo R-squared 0.073 0.053 0.01 0.033 0.04 0.031
AEGV_LRIV_N RIV_S RIV_L AEGV_N AEGV_S
This subset of the estimation sample consists of 23,799 firm-years with available data. LOGSALE is a natural log of contemporaneous sales. AGE is the year-to-date duration in months since CRSP initiated coverage of the firm. VOL is the year-to-date market-adjusted stock return volatility. RET is the year-to-date market-adjusted, buy-hold stock return. ROE is the contemporaneous return on equity. LOSS equals one if contemporaneous net income is negative, zero otherwise. NOATO is contemporaneous net operating asset turnover. ΔNOATO is a change in net operating asset turnover. OPM is a contemporaneous operating profit margin. ΔOPM is a change in operating profit margin. SGROW is a contemporaneous sales growth. CASHFLOW is a contemporaneous cash flow to sales (cash flow equals the sum of GAAP cash flow from operations and research and development expenditures). INVEST is the contemporaneous investment to sales (investment equals the sum of research and development expenditures and capital expenditures). PAYER equals one if the firm paid dividends in the current year, zero otherwise. LEV is a Contemporaneous ratio of financial obligations to total assets. ∆LEV is a change in LEV. CASH is a contemporaneous ratio of financial assets to total assets. ∆CASH is a change in CASH. ACCRUALS is a contemporaneous ratio of working-capital accruals to sales. C-SCORE is a contemporaneous C-score per Penman and Zhang [2000]. REV is Ratio of most recent forecast revision to equity book value per share. LTG is the contemporaneous analysts' consensus long-term growth rate in earnings per share. ∆FOLLOW is a change in
52
analysts' following during the last three months before the end of year zero. INT_6, INT_5, INT_4, INT_3, and INT_2 are intercepts corresponding to the outcomes ranked sixth, fifth, fourth, third, and second, respectively.