extended xbrl taxonomies and financial …€¦ · aaa ais mid-year meeting. ... the work described...
TRANSCRIPT
EXTENDED XBRL TAXONOMIES AND
FINANCIAL ANALYSTS' INFORMATION
Joseph Johnston
Department of Accountancy
City University of Hong Kong
First Draft: May 5th 2015
This Draft: April 14th 2016
Preliminary Draft – Please do not quote
The author thanks workshop participants at the 2015 NTU-SUFE-CityU Research Summer
camp, Airlangga University, Sun Yat-Sen University, Louisiana State University, Nanjing
University, the 2015 Pre-ICIS workshop on Accounting Information Systems, and the 2016
AAA AIS mid-year meeting. I am especially grateful for Brian Lam for his excellent research
assistance. The work described in this paper was fully supported by a grant from the Research
Grants Council of the Hong Kong Special Administrative Region, China (CityU 11502114).
All errors remain my own.
2
Extended XBRL taxonomies and Financial Analysts' Information
ABSTRACT: In this study, I investigate how the use of extended tags in eXtensible Business
Reporting Language (XBRL) filings is associated with the firm’s information environment.
XBRL allows firms to tag data in their financial reports. Prior research suggests that XBRL in
general reduces information processing costs. However, the SEC allows firms to provide
customize tags in their XBRL filings which, on the one hand, may allow firms to customize
financial statements to provide more firms specific information by highlighting financial
statement items that are not included in the standard set of tags. On the other hand, customized
tag may inhibit information search, obfuscate financial statement numbers, and ultimately
reduce the usefulness of XBRL data. Since the use of extended tags is related to the complexity
of the firm, I employ an instrumental variables technique to control for endogeneity. Using
financial analysts’ earnings forecasts error and forecast dispersion as proxies for the firm's
information environment, I find extensions are negatively related to analysts’ forecast error and
dispersion. In additional analysis, I find that the association between extension rate and analysts
information does not vary with earnings management variables but is stronger for firm with
more uncertain information environments and for firms that are more comparable to their
industry peers. Furthermore, extension rate and analysts’ information is insignificant for filings
with technical errors. This study is relevant to regulators creating policies about XBRL
extensions, both in the US and around the world, in suggesting that on average, extensions are
informative to users of the financial statements.
3
I. Introduction
In this study, I examine the association between a firm’s use of extended tags 1 in their
eXtensible Business Reporting Language (XBRL) filings and the firm’s information
environment, as measured by financial analysts’ forecast error and forecast dispersion.
Recently, the SEC has required firms to submit their filings using XBRL with the goal of
decreasing information processing costs and improving investor information by standardizing
financial statement items. The literature generally is consistent with the benefit of decreased
information processing costs but the goal of standardization is less well researched.
Standardization is achieved by applying standard tags to financial statement items that should
be interpreted in a similar manner.2 To allow firms to disclose line items that do not have an
official standardized tag, the SEC permits firms to extend the standard XBRL taxonomy. While
on the one hand, these extended tags could provide investors with important firm-specific
information, there are potential issues in how the firm might use such extended tags. Extended
tags may be used to hide earnings management (Lim, Kim, and Kim 2013), used erroneously
(Debreceny et al 2005, Debreceny, Farewell, Piechocki, Felden, and Graning 2010, Debreceny,
Farewell, Piechocki, Felden, Graning, and d’Eri 2011), or may compromise comparability
(Dhole, Lobo, Mishra, and Pal 2015). I investigate how the use of extended XBRL tags is
associated with financial analysts’ information.
This study is motivated by concerns that company specific extensions to the official SEC
taxonomy reduce the comparability of XBRL filings, eliminating one of the key benefits of
XBRL. A 2011 survey by the CFA institute found that 88 percent of the respondents preferred
to limit or eliminate company-specific tags in favor of having more comparability between
1 An example of the use of XBRL extensions is given in Appendix A. 2 For example, one firm may refer to “operating income” as “operating profits” while another may refer to it as “income from
operations.” Using XBRL, both firms could tag this item with the official tag (“OperatingIncomeLoss”), avoiding any
confusion of semantics. The use of labels in XBRL could facilitate the different word choices while still preserving the
underlying meaning of the words (SEC 2009).
4
firms (CFA Institute 2011). The CFA XBRL Guide for Investors also notes that extended tags
compromise comparability but recognizes that the flexibility of extended tags allows
companies to convey company-specific information in a meaningful way. The SEC, in their
rule mandating XBRL, recognizes this issue as well and noted that the standard taxonomy may
not include all the items needed by companies within particular industries. The SEC also notes
that allowing for firms to create their own tags could improve the interpretation of these line
items by investors and data aggregators. In the SEC Staff Observations from Review of
Interactive Data Financial Statements, the SEC notes the correct way of using extended items,
more recently focusing on use of extended tags when a standard tag or a close equivalent
already exists. (SEC 2009, 2010) Understanding the association between extended tags and
financial analysts’ information allows regulators and practitioners to infer whether extended
tags convey useful information or make XBRL documents more difficult to use.
The results of my study are relevant to regulatory bodies that currently maintain XBRL
regulations as well as jurisdictions interested in implementing XBRL. Jurisdictions vary widely
on their use of extensions in company filings. The US allows firms to use extended tags when
a standard tag for a needed concept does not exist. In contrast, India’s Ministry of Corporate
Affairs requires companies to choose only the standard tag or, if a standard tag doesn’t exist,
the “next-best-fit” element, effectively eliminating the use of extended tags. (Ministry of
Corporate Affairs, Government of India 2012). Examining how the use of extended tags affects
analysts’ information allows these regimes to revise their regulations accordingly. Furthermore,
regimes that may be considering implementing XBRL can make more informed decisions
regarding the use of extended tags. Thus, my study is particularly useful to regulators around
the world.
The use of extensions is very likely affected by the informational demands of the firms.
Firm with more uncertain information ex-ante may have a greater demand for non-standard
5
items. This uncertainty may also be reflected in analysts forecast error and forecast dispersion.
Therefore the use of extension is not exogenous to firm’s information environment. I address
this endogeneity using instrumental variables (IV) regressions where the main instrumental
variable is the number of standard items defined in the US GAAP used by the firm. This allows
me to analyze how the firm’s use of extensions affect analysts’ forecasts error and dispersion
controlling for endogeneity.
I examine the extension rates at several different levels. First, I examine the extension rate
for the complete filing, which is similar to the rate used by other studies. I then make use of a
feature of the XBRL filings that allows me to identify and separate out the different sections of
the quarterly filling. I measure the extension rates for sections identified as financial statements
and as footnote disclosures separately. Finally, I measure extension rates within the balance
sheet, the cash flow statement, and the comprehensive income statement.
Using a sample of 21,800 quarterly filings over the period 2010 to 2014, I find that XBRL
extension rate is positively associated with analysts’ forecast error and analysts’ forecast
dispersion using OLS but negatively associated with analysts’ forecast error and analysts’
forecast dispersion when using IV. I find similar results when I examine the extension rate at
the statement level and at the footnote disclosure level and, by comparing the standardized
coefficients, I find extensions in the financial statements generally have a stronger relation with
analysts’ forecast error and dispersion. When I examine the individual financial statements, I
find a consistently negative relation between analysts’ forecast error and dispersion and
extension rates within the balance sheet, cash flow statement, and income statement: I also
find that the cash flow statement generally has the largest standardized coefficient. Overall, the
evidence suggests that more extensions are associated with a better information environment
and are consistent with the use of extensions being, on average, informative rather than an
obstacle to information collection.
6
In additional analysis, I examine the cross-sectional variability of the association between
analysts’ forecast error and dispersion. Several different firm level factors may influence the
association between extension rate and analysts’ information. First, if extensions capture the
disclosure of firm-specific information, then we would expect firms that operate in more
uncertain environments to disclose more firm-specific information through extensions. Second,
if extensions are used erroneously, we would expect a stronger relationship between extensions
and analysts’ forecasts when there are errors in XBRL filing. Third, if XBRL extensions
compromise comparability, then we would expect a weaker association between extensions
and analysts’ information when firms are more comparable. Finally, if firms are using
extensions to hide earnings management, then we would expect a weaker relationship between
extension rates and analysts’ information when firms have large abnormal accruals. I find a
stronger negative association between XBRL extension rates and analysts’ forecast error and
dispersion when firms operate in more uncertain environments and when firms are less
comparable to their industry peers. I fail to find a significant difference in the association
between extension rates and analysts’ forecast error and dispersion for firms with errors in their
XBRL filing and firms with higher levels of earnings management. This is consistent with the
use of XBRL extension being largely informative in nature about idiosyncratic information.
I contribute to the growing literature on the impacts of XBRL. A number of papers use
XBRL implementation as a proxy for reduced information processing costs. These studies
show that XBRL is related to a better information environment. With particular relevance to
my study, Li, Lin and Ni (2014) and Liu, Wang, and Yao. (2014a) find that firms using XBRL
filings generally have higher analysts’ following and lower analysts’ forecast error. Li et al
(2014) also show that XBRL filings are associated with lower analysts’ forecast dispersion.
However, these studies generally only look at whether the firm files its financial reports in
XBRL and do not consider the use of extended tags within the filings. I find evidence that the
7
use of extended tags does reduce analysts’ forecast error and analysts’ forecast dispersion,
suggesting that extended items do not erode the usefulness of XBRL filing.
Second, I reconcile the mixed evidence in the current literature on the informativeness of
extensions. Hoitash and Hoitash (2014) find that the number of extensions used is associated
with a poorer information environment and suggest that extensions are related to reporting
complexity. On the other hand, Li and Nwaeze (2015) find that the extension rate is negatively
related to the firm’s information environment in the early stages of XBRL implementations but
positively related to the firm’s information environment in later years. My results suggest that,
after correcting for endogeneity, XBRL extensions are robustly positively related to the firm’s
information environment.
The paper is organized as follows. In the next section, I provide a background on XBRL
extensions and a literature review regarding XBRL and the firm’s information environment.
Section 3 describes my methodology and Section 4 provides results. I provide conclusions in
Section 5.
II. Literature Review and Hypothesis Development
Literature Review
Since the SEC started allowing firms to file XBRL documents in its voluntary filing
program, there have been substantial studies investigating the informativeness of XBRL filings,
arguing that XBRL reduces the information processing costs which directly or indirectly
improve the firm’s information environment. Specifically, prior literature finds that XBRL
adoption is related to greater stock returns around filing dates (Efendi, Park, and Subramaniam
2010), lower information risk (Kim, Lim, and No 2012), more quantitative disclosures
(Blankespoor 2012), lower cost of capital (Li et al. 2014), better analysts’ information (Li et al.
8
2014, Liu et al. 2014a),3 more informative stock prices (Dong, Li, Lin, and Ni 2015), more
favorable lending terms (Chen, Kim, and Zhou 2013), and lower earnings management (Lim
et al. 2013). However, these studies only consider the implementation of XBRL, not the use of
extended taxonomies, with the exception of Lim et al. (2013).
Early evidence suggests that XBRL filings contain several errors. These errors include a
validity test (Boritz and No 2008) as well as improper use of extensions (Debreceny et al. 2010,
2011). However, the error rate on average seems to be declining over time (Du, Vasarhelyi,
and Zheng 2013, Bartley, Chen, and Taylor 2011, SEC 2014) with the exception of small firms,
where the improper use of extensions seems to be increasing (SEC 2014).4 Studies also find
that firms with executives who have expertise in information systems tend to have fewer
extended tags, suggesting that extended tags are used out of ignorance (Boritz and No 2013).
While there is ample evidence that there are errors in XBRL filings, it is not clear whether these
errors on average impede analysts’ information collection activities.
Prior literature also suggests that XBRL extension use is related to earnings management.
The basic argument is that firms may strategically choose extended tags for accounts that have
been managed, which makes it more difficult to identify earnings management. Consistent with
this logic, Lim et al. (2013) find that the use of more standard tags is generally negatively
related to earnings management while more extended tag use is positively related to earnings
management. To the extent that firms are using extended tags to mask earnings management,
then XBRL extensions would be detrimental to analysts’ information.
Both practitioners and academic studies have criticized the use of extensions because the
use of extended tags makes it more difficult to compare the performance of firms within an
3 Interestingly, Liu, Yao, Sia, and Wei (2014b) find that early adopters of XBRL in China have significantly higher forecast
errors (lower forecast accuracy) due to the uncertainty regarding the quality of the data. However, their sample only runs to
2006, as XBRL has been mandatory in China since 2004, when XBRL was relatively new to the investing community. 4 The SEC (2014) observed that small firms had an increasing use of improper extensions mainly due to the software commonly
used by these firms. Because my sample requires analysts’ following, many of the small firms are not in my sample.
Specifically, only 6.1% of my sample is classified as non-accelerated filers. Furthermore, in my analysis, I control for software
fixed effects.
9
industry. In a guide to XBRL for investors, the CFA warns of the threat to comparability that
XBRL extensions pose in that it is difficult to compare extended items across firms (CFA 2011).
The counter argument is that having extended tags allows investors to highlight items that are
unique to firms, making it easier to compare common items across firms. For example, in a
2011 survey 45% of respondents who were aware of XBRL indicated that tagging non-
comparable items is useful in valuation in general while another 35% believed tagging non-
comparable items would make it easier for data aggregators to normalize data (CFA 2011).
From an empirical standpoint, Dhole et al. (2015) find that more extended tags reduce
comparability. If XBRL extensions pose a significant threat to analysts’ information, then more
extended tags should increase analysts’ forecast errors and forecast dispersion.
More recently, a few studies have examined the association between the use of extended
items and the firm’s information environment. Hoitash and Hoitash (2014) examine the relation
between document extensions and financial reporting quality and audit fees, arguing that
extensions capture the reporting complexity of the firm. They find consistent with this notion
that firms with more extensions tend to have more restatements, a greater likelihood of internal
control weaknesses, higher abnormal accruals, and higher audit fees. On the other hand, Li and
Nwaeze (2015) examine document extension rates and the firm’s financial information
environment, using market based measures of the firms’ information environment.5 They find
that in the early years, extensions were negatively related to the firm’s information environment
while in later years, extensions were positively related to the firm’s information environment.
I reconcile these results by explicitly considering the endogeneity of XBRL extensions.
5 Specifically, they use event return volatility, information efficiency, change in standard deviation of daily stock returns, and
bid-ask spread to proxy for the firm’s information environment. In unreported test, I find that extension rates are negatively
related to changes in standard deviation of daily stock returns and changes in bid-ask spread using IV methods. However, I do
not find any significant relation between extension rates and event return volatility or information efficiency when I use the
IV technique.
10
Hypothesis Development
While prior literature is largely in agreement about the benefit of XBRL in lowering
information processing costs, there is little that is already known about how the use of extended
taxonomies influences analysts’ information. Many filers contend that extended taxonomies
are needed to allow firms to apply informational context to data items. If firms use extended
tags to convey information imbedded in the data, then analysts would have better information
about the future earnings of the firm, and there would be less divergence of opinion among
analysts. On the other hand, firms may use extended tags out of ignorance (Boritz et al. 2013)
or error (Debreceny et al. 2010), which would likely have no informative content at all.
However, if managers use extended tags opportunistically (Lim et al. 2013) or if extension use
severely compromises comparability (Dhole et al. 2015), then extension use would be
detrimental to analysts’ information and increase forecast dispersion and forecast errors.
Because in the cross section the different uses of extensions likely exist, it is an empirical
question as to which use dominates. This leads to the first hypothesis, stated in the null:
H1) The use of extended tags in XBRL documents is unrelated to analysts’ forecast
dispersion and analysts’ forecast error.
To tease out the different motives in the use of extended taxonomies, I look at the extension
use for particular parts of the XBRL document. I consider three parts of the filing where the
informativeness of the extension is likely to vary. First, I consider extensions in the footnote
disclosures of the XBRL instance document. Items tagged as disclosure notes are unlikely to
influence valuation models used by analysts. Furthermore, disclosure notes are likely the target
of direct search by analysts based on keyword searches. If managers are using extended tags to
convey information to the market, this is the primary place where it would occur. On the other
hand, more complex firms may require more footnote disclosures. Thus I present a null
hypothesis with respect to extensions in the textual disclosures of the firm.
H2) The use of extended tags in footnote disclosure is unrelated to analysts’ forecast
dispersion and forecast error.
11
Next I consider the use of extensions with the financial statements. While financial
statement items may provide additional detail about the financial performance and position of
the firm, it is also likely that the numbers on the financial statements have been manipulated.
The importance of these items is greater because they are more likely the target of direct input
into valuation models, which is the type of interoperability that XBRL has targeted. At this
level, extension could be either informative or opportunistic. Thus I retain a null form
hypothesis with respect to extensions within the financial statements.
H3) The use of extended tags within the financial statements is unrelated to analysts’
forecast dispersion and forecast error.
III. Methodology
In this section I describe the methodology used to test the association between the use of
extended tags in financial reporting and analysts’ information. First, I describe how I measure
the use of extended tags. Second, I discuss the instrumental variables used in my study. Finally,
I describe the tests used to address my hypotheses.
Measuring Extensions
My first task is to identify the use of extended tags for a given filing. I search the SEC XBRL
RSS monthly XBRL feed for 10-K and 10-Q filings to get the filings in each quarter. I then
identify extended tags using the namespace prefix for each item of interest. Specifically,
namespace references that include “sec.gov”, “fasb.org”, “w3.org”, or “xbrl.org” are
considered official taxonomies6and all others are considered extended (unofficial) taxonomies.
To measure extension use, I divide the number of extended items in a particular section by the
number of total items in that section.7 This has the interpretation of examining what happens if
I take one standard tag and make it an extended tag, holding the total number of items constant.
6 I include “w3.org” because this is the namespace for the World Wide Web consortium which specifies some of the
requirements for XML documents in general. 7 In counting, I only consider the item’s name and not the context. Therefore, if an extended item is included in comparative
statements, I only count it once as opposed to counting it once for each comparative statement in which it is included.
12
My next task is to identify the different levels for the rate of extension. First, I consider
the use of extension items within the whole XBRL filing. For this document level extension
rate, I consider each unique item reported in the XBRL filing within each section defined in
the presentation linkbase.8 This is not a straightforward task, as firms have discretion over the
names of the sections of the report. However, many firms follow the convention of naming
their documents using a document identifying number followed by “Document,” “Statement,”
or “Disclosure,” followed by the actual section name as shown in the traditional filing.9 In
counting the tags for the whole filing, I only consider the sections that are indicated as either
“Disclosure” or “Statement”10 using section titles provided in the presentation linkbase of the
XBRL filing.
Second, I consider the extension rates within the disclosure and statement sections of the
filings. I count the number of unique items (disregarding context) within each section of the
XBRL.11 Again, I use the section titles to identify whether a section of the report is a part of
the disclosure or part of the financial statements. Finally, I consider the extension rate within
the individual financial statements. I rely on the statement name to identify the Balance Sheet,
the Cash Flow Statement, and the Income Statement.12 The statement of Changes in Owner’s
Equity has a number of variations and is often not reported as a statement. I therefore cannot
reliably identify this statement. With respect to the Income Statement, firms have latitude to
report either one Comprehensive Statement of Income or two separate statements, the
Statement of Income and the Statement of Other Comprehensive Income.13 To the extent that
8 A presentation linkbase lists the sections of a report and what items should be included in that section. 9 Here “Document” refers to items related to the filing as a whole. These are items such as the filing date, the CIK of the firm
making the filing, the form type, etc. which are sometimes referred to as “Document and Entity Items (DEI).” 10 Including all tags within the whole filing provides qualitatively similar results. 11 I exclude sections indicated as “Parenthetical.” 12 I use a number of keywords and regular expressions to identify the statements by name. The program used to identify the
statements is available upon request. 13 There is also the possibility that other comprehensive income is in the statement of equity, though this option was eliminated
by Topic 220 for fiscal periods from December 15, 2011 onward.
13
I can identify a separate Statement of Other Comprehensive Income, I exclude the related tags
from the count for the Income Statement.14
Instrumental Variables
Because extension use is likely endogenously determined, I use instrumental variables to
estimate the relationship between extension rates and financial analysts’ forecast error and
dispersion. A good instrumental variable should be related to the independent variables of
interest but unrelated to the dependent variables after considering other variables in the model.
The first variable I consider is the number of standard items in the US GAAP taxonomy used
in the filing. This will be related to extension rate because as standard items are added to the
taxonomy, firms will not need to extend their firm-specific taxonomies as much. Furthermore,
there is no reason to believe that the number of standard items in the standard taxonomy will
be related to analysts’ forecast error or dispersion except through extension rates themselves.15
Empirical Model
I use the following model to test the association between extension use and analysts’ forecast
error and analysts’ forecast dispersion.
𝐸𝑟𝑟𝑜𝑟𝑖,𝑡 𝑜𝑟 𝐷𝑖𝑠𝑝𝑖,𝑡 = 𝛼0 + 𝛼1𝐸𝑥𝑡𝑒𝑛𝑑𝑅𝑎𝑡𝑒𝑖,𝑡 + 𝛼2𝑆𝑖𝑧𝑒𝑖,𝑡−1 + 𝛼3𝐴𝑔𝑒𝑖,𝑡−1
+ 𝛼4𝑆𝑡𝑑𝑅𝑂𝐸𝑖,𝑡−1 + 𝛼5𝐼𝑛𝑡𝑎𝑛𝑔𝑖,𝑡−1 + 𝛼6𝑅𝑂𝐴𝑖,𝑡−1
+ 𝛼7𝐶𝑜𝑟𝑟𝑖,𝑡−1 + 𝛼8𝐸𝑃𝑆𝑖,𝑡−1 + 𝛼9𝑄4𝑖,𝑡 + 𝛼10𝐿𝑛𝐶𝑜𝑣𝑖,𝑡
+ 𝛼11𝐿𝑎𝑟𝑔𝑒𝐹𝑖𝑙𝑒𝑟𝑖,𝑡 + 𝛼12𝐴𝑐𝑐𝐹𝑖𝑙𝑒𝑟𝑖,𝑡 + 𝛼13𝐻𝑜𝑟𝑖𝑧𝑜𝑛𝑖,𝑡
+ 𝛼14𝑉𝑖𝑜𝑙𝑎𝑡𝑒𝑖,𝑡 + 𝑆𝑜𝑓𝑡𝑤𝑎𝑟𝑒 𝐸𝑓𝑓𝑒𝑐𝑡𝑠+ 𝐼𝑛𝑑𝑢𝑠𝑡𝑟𝑦 𝐸𝑓𝑓𝑒𝑐𝑡𝑠 + 𝑌𝑒𝑎𝑟 𝐸𝑓𝑓𝑒𝑐𝑡𝑠 + 𝑒𝑖,𝑡
(1)
I model analysts’ forecast error (Error) and analysts’ dispersion (Disp) similar to Li et al.
(2014). Since my focus is on quarterly filings, I measure the analysts’ forecast error as the
magnitude of the median one period ahead quarterly EPS forecast in the 30-day following the
14 In an unreported test, I combine tags identified as either the Income Statement or the Statement of Other Comprehensive
Income and find similar results. 15 While there is no reason to believe that the number of standard tags is related to the firm’s information environment, I cannot
test this assumption because the system is exactly identified and not over-identified.
14
filing minus the respective actual quarterly EPS, scaled by price just prior to the filings date. I
measure dispersion using two variables. First, I measure dispersion using the standard deviation
of analysts’ one period ahead quarterly EPS forecasts (StdDev) issued 30-days following the
firm’s quarterly filing scaled by the most recent prices just prior to the filing. However, this
measure requires at least three forecasts to be issued within a 30-day period and therefore
reduces sample size significantly. As an alternative, I measure dispersion as the range (Range)
of one-period ahead quarterly EPS forecast issued in the 30-day period following the quarterly
filing, scaled by price just prior to the filings date.16 The variable of interest is ExtendRate,
which is the ratio of the extended items to the total number of items in the quarterly filings.
I include a number of control variables consistent with prior literature (Li et al. 2014; Liu
et al. 2014a). Because larger firms have a better information environment as well as more
complex operations which may require more XBRL elements in their filings, I include the log
of total assets at the beginning of the quarter (Size) and the number of years the firm has been
in Compustat (Age). To proxy for the uncertainty in quarterly earnings, I use the standard
deviation of return on equity (StdROE) and the correlation between ROE and stock returns
(Corr) over the previous four quarters. Firms with more intangibles are typically more difficult
to predict future earnings. As such, I include the sum of research and development and
advertising expenses from the previous quarter divided by deflated by market value at the
beginning of the quarter (Intang). I also control for return on assets (ROA) to control for firm
performance and earnings per share for the current year (EPS) to control for earnings surprises.
I include an indicator variable to capture differences in forecast errors for the fourth quarter as
well as differences in the extension rates between quarterly filings and annual fillings.17 I
16 In unreported tests, I use the inter-quartile range and find similar results. 17 While an indicator variable would control for the differences in the average extension rate between quarterly and annual
filings. However, this does not control for the possibility that the relationship between extension rate and analysts’ information
may be different between quarterly and annual filings. In unreported test, I use an interaction between the fourth quarter
indicator and the extension rates and do not find any such difference. I also partition the sample by annual vs quarterly filings
and do not find any significant difference for the coefficient on extension rates for these two groups.
15
control for analysts’ attention using the log of analysts following (LnCov) in the 30 days prior
to the filing. The mandatory filing of XBRL was implemented in three stages. Furthermore,
firms in these categories are more likely to be stable firms with better information environments.
Therefore, I include two dummy variables to capture filing status, one for firms classified as
Large Accelerated Filers (LargeFiler) and one for firms classified Accelerated Filers (AccFiler)
as these firms were required to adopt XBRL in 2009 and 2010 respectively and therefore have
more experience with XBRL Filings. Because estimates issued closer to the fiscal quarter end
dates are generally less uncertain, I include the log of the average number of days between the
forecast issue date and the relevant fiscal period end dates (Horizon). Some firms may have
errors in their XBRL filings that would result in problems parsing the XBRL data properly. I
control for this possibility by including an indicator variable (Violate) for firms that violate any
technical rules in their filings as reported by XBRLCloud.com. The software used to create
XBRL filings may encourage the use of extensions, and it may also be more compatible with
analysts’ internal software. To control for this, I use fixed effect for the software used to create
the XBRL filings. I also have fixed effects for Fama-French 48 industry and year.
IV. Empirical Analysis
Sample Statistics
My sample starts with the 103,202 10K and 10Q XBRL filings for fiscal years 2010 through
2014.18 Because I obtain analysts’ information from I/B/E/S, financial statement numbers from
Compustat, and stock prices and returns from CRSP, I lose 47,451 observations from which I
cannot match firm identifiers within the databases. Of the remaining 55,751 observations, 19
filings cannot be properly parsed to collect document extension rates, and 455 filings do not
clearly indicate the sections of the report. I cannot correctly identify the Balance Sheet, Cash
Flow Statement, or Income Statement for an additional 1,749 filings. I further lose 21,198
18 I start in 2010 because only 76 filings are available for 2009 filings primarily due to the restriction of identifiable sections.
16
filings because I lack the data for my dependant variables, forecast error and forecast
dispersion. For 10,530 of the remaining observations, I lack data for my control variables which
results in a final sample of 22,800 firm-quarter observations from 3,051 firms. Table 1 presents
a summary of the sample selection.
Table 2 presents the distribution of the standard tags used across different fiscal years
as well as the number of tags defined in each taxonomy. Given that the data is quarterly and
the SEC generally allows firms to use the most recent two taxonomies, the distribution is to be
expected. For example, in 2013, 3,732 observations use the 2012 taxonomy and 3,696
observations use the 2013 taxonomy while only 54 and 5 observations use the 2011 and 2014
taxonomies, respectively. Table 2 also shows that the number of standard tags has increased
over time from 13,452 in 2009 to 17,433 in 2014. Thus, there is substantial variation in the
number of tags cross-sectionally so as not to be absorbed by any year fixed effects.
Table 3 presents simple statistic from the sample. The average (median) forecast error is
0.006 (0.002) and the average (median) forecast range is 0.005 (0.001). The average (median)
forecast inter-quartile range is 0.003 (0.001). For my sample, the average (median) extension
rate is 17.3% (16.2%) which is slightly lower than the 20.55% average extension rate reported
by Li and Nwaeze (2015). This may be explained by the declining use of extensions over time.
When I look at the extension rate for the separate sections of the report, I find that extension
are more frequent in the disclosure section, with an average of 21.6% of disclosure items being
extended (median of 20.4%) compared to about only 6.9% of financial statement items being
extended (median of 5.6%). Examining the individual financial statements, I note that the cash
flow statement tends to have the most extensions, with an average 8.7% and a median of 7.1%
of the items being extended. The balance sheet has the least extensions with an average of 4.5%
and a median of 3.1%. The average extension rate for the Income Statement is 6.8% with a
median of 5.0%.
17
Table 4 presents the correlation coefficients for the sample. I find that the document,
disclosure, and overall statement extension rates have Pearson’s correlation coefficients with
analysts’ forecast error of 0.033, 0.031, and 0.037 (See Table 4 column 1) and are significant
at least at the 10% level. The correlation coefficient between these rates and the standard
deviation of analysts’ forecast are 0.080, 0.068, and 0.058 respectively and are significant at
least at the 10% level. The respective correlations are 0.081, 0.068, and 0.058 for the range of
analysts’ forecasts. For the individual financial statements, I find the correlation coefficients
for the balance sheet, cash flow statement, and income statement with analysts’ forecast error
are 0.007, 0.058, and -0.002, although only the extension rate for the cash flow statement is
significant at the 10% level. The correlation coefficients for these statements and forecast
dispersion are 0.020, 0.087, and 0.006 respectively, but again only the correlation for the cash
flow statement is significant. Turning to the correlation between these statement level extension
rates and the range of analysts’ forecasts, the coefficients are 0.025, 0.074, and 0.016
respectively, with only the extension rate on the cash flow statement and the income statement
being significant.
First Stage Regression
Table 5 presents the first stage regression of our various extension rates on the number of tags
in the standard taxonomy. In the first column, we use the document extension rate and in the
second and third columns, we break that extension rate down into the Disclosure and Statement
level extension rate. The fourth through sixth columns present the results for the Balance Sheet,
Cash Flow Statement, and Income Statement extension rates respectively. Consistent with our
conjecture that use of more standard tags is associated with lower extension rates, we find a
negative coefficient on the number of standard tags for all our models. In general the coefficient
is highly significant and passes the under-identification test, suggesting that the total number
of standard tags in the relevant taxonomy is highly correlated with the extension rates.
18
Specifically, the Kleibergen-Paap Rank LM statistic ranges from 11.660 to 85.375 and is
significant at the 1% level for all of our models.19 This suggests that the total number of tags
in the standard taxonomy used by the firm is strongly associated with the extension rate.
Analysts’ Information and XBRL Extension Rate
In Table 6, I present the results for the regression of analysts’ forecast error and forecast
dispersion using both OLS regression and the IV (2SLS) regressions on the section extension
rates for the whole document.2021 The first three columns present the results for the OLS
estimates of the regression of Error, StdDev, and Range on the overall extension rate. We find
significant coefficients of 0.010, 0.004, and 0.009, respectively. This positive relation would
lead one to conclude that extensions degrade the firm’s information environment either because
of complexity, lack of comparability, or erroneous or misleading use. However, using 2SLS,
we find significant coefficients of -0.055, -0.034, and -0.065, respectively, suggesting that
extensions generally improve the analysts’ information. The positive relation found in using
OLS is likely driven by the complexity of the firm that both requires firms to use extended
items and increases financial analysts’ forecast error and dispersion.
Table 7 presents the 2SLS results of the regression of analysts’ forecast dispersion on
extension rates for the footnote disclosures and the financial statement separately. The first
three columns contain the results for extensions within the footnote disclosures. Consistent
with the results for the overall extension rate, we find significant coefficients of -0.043, -0.027,
and -0.051 for Error, StdDev, and Range, respectively. In the last three columns, the variable
of interest is the extension rate within the financial statements. Again, the results are negative
and significant with coefficients of -0.130, -0.078, and -0.155 for Error, StdDev, and Range,
19 These statistics are for the full sample. When I use the sub-sample that has valid standard deviation observations, I come to
a similar conclusion. When examining the standard deviation of estimates as a measure of dispersion, I only use the available
observations in both the first and second stage. However, for brevity, I only report the first stage for the full sample. 20 All of my analysis uses standard errors clustered by firm as suggested by Petersen (2009). 21 Throughout the analysis, I do not report the adjusted R squared because it is not interpretable in the context of 2SLS.
19
respectively. While I cannot statistically test the difference between these extensions in the
disclosure and the statement sections, an examination of the standardized coefficients gives a
sense of their importance in determining analysts’ forecast error and dispersion. These
standardized coefficients indicate that extensions within the financial statements have a
stronger association with analysts’ forecast error (-0.530 compared to -0.311), analysts’
forecast standard deviations (-0.884 compared to -0.557), and forecast range (-0.896 compared
to -0.526). The weight of the evidence is consistent with XBRL extensions improving analysts’
information, with extensions in the financial statements being especially helpful.
Next I turn to the individual financial statements. In Table 8, I find that extensions within
the balance sheet, cash flow statement, and the income statement are negatively related to both
analysts’ forecast error and the standard deviation of forecasts.22 The standard coefficient on
cash flow is the largest in magnitude for both the analysts’ forecast error (-0.870) and forecast
standard deviation (-1.376). In comparison, the standard coefficient for analysts’ forecast error
is -0.445 and -0.597 for extensions on the balance sheet and income statement. The
corresponding coefficients for the standard deviation of analysts’ forecasts is -0.820 and -1.075
for the balance sheet and income statement. The larger weight on extensions in the cash flow
statement may be a manifestation of the degree to which idiosyncratic items are reported in the
cash flow statement while items on the balance sheet and income statement, because they have
been around longer, tend to reflect more common items. I leave it to future research to further
examine the role of extended items on the cash flow statement vis-à-vis the balance sheet and
income statement.
22 I exclude Range in Table 8 for brevity. Untabulated results produce similar results (i.e. extensions on the cash flow statement
has the strongest negative association while extensions on the balance sheet has the weakest).
20
V. Additional Analysis
Cross-sectional determinants of the relationship between analysts’ information and
extension rate.
In my main analysis, I find that on average XBRL extension use is negatively related to
analysts’ forecast and dispersion, implying that manager’s decisions improve the firm’s
information environment. In this section, I examine some cross-sectional variables that may
determine the association between analysts’ information and XBRL extension use.23 These
cross-sectional variables fall into four categories based on the characterization extension use
in the literature. I begin by examining how the relation between extension rate and analysts’
information varies with information uncertainty. If firms use extensions to convey information,
then the relationship between extensions and analysts’ information should be stronger when
information is more uncertain. The second characterization is that XBRL extensions are
commonly used erroneously (Debreceny et al. 2010, Boritz and No 2008, Bartley et al. 2011,
SEC 2014) and therefore compromise the usefulness of XBRL. A similar criticism is that
extensions compromise the comparability of financial statements across firms (Dhole et al.
2015). I therefore examine how comparability moderates the relation between extension rates
and analysts’ information. The last category is related to earnings management, since firms that
are managing earnings may try to mask make their reporting opaquer through the use of
extensions (Lim et al. 2014).
Table 9 presents the results for the IV regression of analysts’ forecast error and forecast
dispersion where the observations are split into groups based on the yearly median of the cross-
sectional variable in question. For brevity, I only report the results for the overall document
extension rate, and I do not report the coefficients for the control variables, although they are
included in the model. I also only report the results for forecast error and standard deviation.
23 To avoid any potential reserves causality, I measure all cross-sectional variables as of the last fiscal year prior to the quarterly
filling.
21
Information Uncertainty
I include two information uncertainty measures to examine the cross-sectional
relationship between XBRL extension use and analysts’ forecast error and dispersion. The first
is the standard deviation of daily stock returns (StdRet) over the 12-month period ending at the
beginning the current fiscal year end. The second is the standard deviation of return on equity
(StdROE) over the four quarters ending at the beginning of the current fiscal year end. Panels
A and B of Table 8 report the results for the cross-sectional variables of StdRet and StdROE. If
managers are using their discretion to be informative, then the correlation between XBRL and
analysts’ forecast error and dispersion should be greater when the information is generally more
uncertain. Using the standard deviation of returns as a measure of information uncertainty, I
find that firms with uncertainty have a negative and significant correlation between extension
rate and analysts’ forecast (coefficient of -0.108) and forecast dispersion (coefficient of -0.057).
For firms with low uncertainty, I do not find a significant correlation between extension rate
and forecast error (coefficient of -0.000), but I do find a negative and significant relation
between extension rate and forecast dispersion. The difference between the high and low
groups is significant for both forecast error (coefficient of -0.108) and forecast dispersion
(coefficient of -0.049). Thus firms with returns volatility generally have a stronger relation
between extension use and analysts’ information.
I find a similar story when I examine the standard deviation of ROE, as reported in Panel
B. Specifically, firms with higher volatility in ROE have a negative and significant correlation
between extension use and forecast error and forecast dispersion, with coefficients of -0.064
and -0.050, respectively. Firms with lower volatility in ROE still have a negative association
between extension use and forecast error and dispersion, with coefficients of -0.020 and -0.007
respectively. Furthermore, the difference between these two groups is negative and significant
(coefficients and -0.044 and -0.043 respectively), suggesting that extension rate is more
strongly associated with analysts’ information when overall information uncertainty is high.
22
XBRL Errors in filings
To test the extent to which errors in XBRL compromise the usefulness of XBRL
extensions, I divide the sample based on whether there are observable errors in the filing. I use
the data from XBRLCloud.com to determine if a filing has any type of error. While erroneous
extension use may not directly trigger a technical error, filers with technical errors are more
likely to also make mistakes in utilizing extended tags. For my sample, there are 4,539 filings
with some sort of technical violation. I report the results for firms with and without technical
errors in Panel C of Table 9.
For firms with a technical error, we find an insignificant coefficient on extension rate for
both analysts’ forecast error and forecast dispersion (coefficients of -0.046 and 0.024). Firms
without a technical error have a negative and significant association between extension rates
and analysts’ forecast error and dispersion (coefficients of -0.050 and -0.034). However, the
differences in the coefficients between the two groups are insignificant. While this is consistent
with errors in XBRL filings eroding the usefulness of XBRL extensions, this analysis comes
with one important caveat. For the firms that have technical errors, the Kleibergen-Paap Rank
LM statistic is insignificant. It may be that firms with technical errors are indeed selecting
extension rates erroneously and therefore the number of standard tags defined in the taxonomy
has no relation to the extension rate for these firms. Therefore, the IV method may not be valid
for these firms and I therefore, caution the reader against making a strong conclusion based on
these results.
Comparability
One of the criticisms against allowing extensions is that they may compromise
comparability between two firms (CFA Institute 2011, Dhole et al 2015). In this section I
examine how differences in comparability mediate the relationship between extension rates
and analysts’ forecast error and dispersion. When firms are generally comparable to their peers,
the use of extensions may actually erode analysts’ information because it is more difficult to
23
assess the comparability of individual line items. On the other hand, when firms are less
comparable, the use of extensions highlights this and makes it easier for analysts to identify the
line items that are not comparable to their peers.
Following Dhole et al. (2015), I use two measures of accounting-based comparability.
Both measures start by estimating the firm-specific accounting function that map economic
events, as measured by stock returns, into earnings. Following DeFranco. Kothari, and Verdi
(2011),24 I estimate the following equation by firm over the past 16 quarters:
𝐸𝑎𝑟𝑛𝑖𝑛𝑔𝑠𝑖,𝑡 = 𝛼𝑖 + 𝛽𝑖𝑅𝐸𝑇𝑖,𝑡 + 𝑒𝑖,𝑡 (4)
Where Earnings is quarterly income before extraordinary items scaled by market value at the
beginning of the quarter and RET is the stock return for the quarter. The estimates of 𝛼�̂�and 𝛽�̂�
are firm specific parameters that map returns to earnings for firm i. Next, we estimate what
firm i’s earnings would have been under firm i’s accounting function (𝛼�̂� and 𝛽�̂�) given the
economic events of firm i (Returni,t). Next, I estimate what firm i’s earnings would have been
under firm j’s accounting function (𝛼�̂� and 𝛽�̂�) given the economic events of firm i. Formally, I
calculate:
𝐸[𝐸𝑎𝑟𝑛𝑖𝑛𝑔𝑠]𝑖,𝑖,𝑡 = �̂�𝑖 + �̂�𝑖𝑅𝐸𝑇𝑖,𝑡 (5)
𝐸[𝐸𝑎𝑟𝑛𝑖𝑛𝑔𝑠]𝑖,𝑗,𝑡 = �̂�𝑗 + �̂�𝑗𝑅𝐸𝑇𝑖,𝑡 (6)
for all pairs i,j within the same 2-digit SIC industry. The difference between 𝐸[𝐸𝑎𝑟𝑛𝑖𝑛𝑔𝑠]𝑖,𝑖,𝑡
and 𝐸[𝐸𝑎𝑟𝑛𝑖𝑛𝑔𝑠]𝑖,𝑗,𝑡 comes about because firm i and firm j use different accounting methods
to capture economic events. In order to mitigate measurement error, to get a measure of
24 I am grateful to Rodrigo Verdi for providing the SAS code here: http://www.mit.edu/~rverdi/acctcomp_2013.sas.
24
comparability between firm i and firm j (CompAccti,j,t), I calculate the negative mean absolute
deviation between 𝐸[𝐸𝑎𝑟𝑛𝑖𝑛𝑔𝑠]𝑖,𝑖,𝑡 and 𝐸[𝐸𝑎𝑟𝑛𝑖𝑛𝑔𝑠]𝑖,𝑗,𝑡 for the 16 prior quarters:
𝐶𝑜𝑚𝑝𝐴𝑐𝑐𝑡𝑖,𝑗,𝑡 = −
1
16∗ ∑ |𝐸[𝐸𝑎𝑟𝑛𝑖𝑛𝑔𝑠]𝑖,𝑖,𝑡 − 𝐸[𝐸𝑎𝑟𝑛𝑖𝑛𝑔𝑠]𝑖,𝑗,𝑡|
𝑡
𝑡−15
(7)
This provides a measure of comparability for each i,j pair within the same 2-digit SIC industry.
To come up with a firm-specific measure of comparability, I use two aggregation methods
suggested by DeFranco et al. (2011). First, I rank CompAccti,j,t for each firm i and quarter t. I
then take the average of the top four firms to come up with comparability for firm i for its four
most comparable firms (CompAcct4). The second measure of comparability (CompAcctInd) is
the mean CompAccti,j,t for all firms within the same 2-digit SIC industry as firm i. The results
for CompAcct4 and CompAcctInd are presented in Panels D and E, respectively, of Table 9.
When I use CompAcct4 as my measure of comparability, I find that firms’ high
comparability have an insignificant coefficient -0.012 for the relation between extension rate
and forecast error, while firms with low comparability have a significant negative coefficient
of -0.066. The difference of 0.053 is significant at the 10% level. We find a similar pattern for
the relation between extension rate and the standard deviation of analysts’ forecasts.
Specifically, we find that the coefficient for the low comparability group is -0.012 (significant
at the 10% level) while the coefficient is -0.045 for the high comparability group and the
difference of 0.034 is significant at the 5% level.
The results are even more striking when I use the average comparability of all of the
firms within the industry (AccCompInd) as shown in Panel E of Table 9. For the low
comparability firms, the coefficient on extension rate is 0.005 (-0.007) for analysts’ forecast
error (standard deviation) and is insignificantly different from zero. For firms that are highly
comparable to their industry peers, the coefficient on extension rate is -0.086 (-0.049) for
analysts’ forecast (standard deviation) and is significant at the 1% level. The difference of 0.091
25
(0.042) is also significantly different from zero at the 1% level. Taken together, these results
suggest that when a firm is highly comparable to its peers, there is little or no relationship
between extension rate and analysts’ information. However, when the firm is not comparable
to its peers, then XBRL extensions are related to a better information environment, highlighting
the idiosyncratic nature of the extensions.
Earnings Management
I use two different variables related to earnings management. The first is the magnitude of
abnormal accrual from the modified-Jones Model (Dechow, Sloan, and Sweeney 1995).
Specifically, I estimate the following model:25
𝑇𝑜𝑡𝐴𝑐𝑐𝑖,𝑡 = 𝛼0 + 𝛼1∆𝑅𝐸𝑉𝑖,𝑡 + 𝛼2𝑃𝑃𝐸𝑖,𝑡 + 𝑒𝑖,𝑡 (8)
Where TotAcc is total accruals defined as income before extraordinary items less cash flow
from operations before extraordinary items. ∆𝑅𝐸𝑉 is the annual change in revenue and PPE is
gross property, plan, and equipment. Equation (4) is run annually by Fama-French 48 industry,
and the coefficients are used to estimate abnormal accruals using the following equation.
𝐴𝑏𝐴𝑐𝑐𝑖,𝑡 = 𝑇𝑜𝑡𝐴𝑐𝑐𝑖,𝑡 − (�̂�0 + �̂�1(∆𝑅𝐸𝑉𝑖,𝑡 − ∆𝑅𝐸𝐶𝑖,𝑡) + �̂�2𝑃𝑃𝐸𝑖,𝑡) (9)
Where ∆𝑅𝐸𝐶 is the annual change in accounts receivable. My first measure of earnings
management is the absolute value of AbAcc, which I denote in the table as Abs(AbAcc).
My second measure of earnings management is the Dechow and Dichev (2002) accruals
quality measured following Francis, LaFond, Olsson, and Schipper (2005). For each year and
Fama-French 48 industry, I run the following regression:26
𝑇𝐶𝐴𝑐𝑐𝑖,𝑡 = 𝛼0 + 𝛼1𝐶𝐹𝑂𝑡−1 + 𝛼2𝐶𝐹𝑂𝑡 + 𝛼3𝐶𝐹𝑂𝑡+1 + 𝛼4∆𝑅𝐸𝑉𝑖,𝑡 + 𝛼5𝑃𝑃𝐸𝑖,𝑡 + 𝑒𝑖,𝑡 (10)
Where TCAcc is measured as income before extraordinary items less operating cash flows
before extraordinary items less PPE, CFO is cash flow from operations before extraordinary
items, and all other variables are defined as in equation 4. I then take the standard deviation of
25 All variables, including the intercept, are scaled by lagged total assets. 26 Again all variables including the intercept are scaled by lagged assets.
26
the residual from equation 5 over the years t-1 to t-5.27 I denote my second measure of earnings
management as AQ.
If managers use extensions to mask earnings management, then the correlation between
extensions and analysts’ forecast error and forecast dispersion should be less negative for firms
with high values Abs(AbAcc) and AQ. In panel F of Table 9, I use Abs(AbAcc) to partition my
sample. Firms with high magnitudes of abnormal accruals have negative and significant
coefficients on extension rate of -0.069 and -0.046 for analysts’ forecast error and standard
deviation respectively, while firms with low magnitudes of abnormal accruals have significant
coefficients of -0.028 and -0.023 for forecast error and dispersion, respectively. However, the
differences between the high and low magnitude of abnormal accruals is insignificant
(coefficients of -0.041 and -0.023, respectively).
When I examine AQ as my partitioning variable, I find similar results, as reported in
panel G of table 9. Specifically, for firms with high AQ, the coefficients on extension rate are
-0.086 and -0.047 on analysts’ forecast error and dispersion while the corresponding
coefficients with low AQ are -0.007, which is insignificantly different from zero, and -0.021.
For forecast error, I find that the difference of the coefficients for the two groups is significant
at the 5% level (with a coefficient of -0.079) and for standard deviation, I find an insignificant
difference of -0.26. Taken together with the results from panel F, there is no evidence to suggest
that the informativeness of extensions is compromised for firms managing earnings.28
Number of Total and Extended Tags
Hoitash and Hoitash (2014) find that both the total number of tags and the number of
extended tags is negatively related to the firm’s information environment. In my main analysis,
27 This lags the measure by one more year to take into account the potential look ahead bias in using CFO in t+1 in equation
5. 28 In reported analysis, I separate AQ into its innate and discretionary components following Francis et al. (2005) and find that extension rate is more negatively related to analysts’ forecast error and dispersion for high levels of both components, though not significantly so.
27
I use the extension rate as my main variable of interest, where the denominator is the total
number of tags in the filing. My findings may be a driven by total number of tags being in the
denominator. I therefore examine the natural log of the total number of tags and of the extended
tags in a filing.
Table 10 presents the first stage regression of total tags and extended tags on the number
of standard tags and other control variables. In the first column, I find a positive and significant
relation between the number of standard tags in the taxonomy and the total tags in the filings.
This is consistent with the notion that when firms have more choices on tags, they increase the
tags they use. Similar to the results reported in Table 5, I find a negative association between
the number of standard tags defined in the taxonomy and the number of extended tags defined
by the firm. Kleibergen-Paap Rank LM statistic is 45.394 for the total number of tags and
142.137 for the number of extended tags indicating that the number of standard tags defined in
a taxonomy is strongly associated with both the total number of tags in a filing and the extended
tags in a filing.
Table 11 presents the OLS and 2SLS results for the total number of tags used in the filing.
Under OLS, we find positive coefficient the total 0.003, 0.001, and 0.003 for Error, StdDev,
and Range, respectively. When I use 2SLS, I continue to find a significant positive association
between the total tags in an XBRL filing and Error, StdDev, and Range, with coefficients of
0.020, 0.012, and 0.024. This is consistent with Hoitash and Hoitash (2014) and confirms their
suggestion of using the total number of tags in a filing as a measure or reporting complexity.
I report the results for the number of extended tags in Table 12. Consistent with Hoitash
and Hoitash (2014) and with the OLS analysis in Table 6, I find a positive and significant
association between the number of extensions within the filing and Error, StdDev, and Range
(coefficients of 0.001, 0.000, and 0.001). However, when I use 2SLS, I find a negative and
significant association between the number of extensions within a filing Error, StdDev, and
28
Range with coefficients of -0.007, -0.005, and -0.009, which is consistent with the 2SLS
analysis in Table 6 and the notion that firms use extended items to try to convey idiosyncratic
information. Together, these results also suggest that the total tags in a filing might be a more
preferred measure of complexity as opposed to the number of extended tags.
VI. Conclusion
In this paper, I examine the association between XBRL extensions and analysts’ information
environment. The SEC requires firms to present their financial reports in XBRL format which
electronically tags financial report data. However, the SEC allows firm to use either a standard
tag or a firm specific (extended) tag for any given data item. The intent of allowing the
extensions is to enable firms to convey information about items that might not be standard.
However, there has been skepticism related to the use of these tags such that extensions may
make the firm’s disclosures more opaque. To provide evidence on the use of extensions in
XBRL, I analyze the association between the extension rate and analysts’ forecast error and
analysts forecast dispersion.
I also suggest that more complex firms may have a greater demand for extended tags.
Therefore, an examination of the relation between XBRL extensions and characteristics that
may be related to complexity indicates that the extension rate may be endogenous to the system.
I suggest using the total number of tags to define the standard taxonomy used by the firm as an
instrument for the extension rate. The extension rate is related to the number of standard tags
within a taxonomy in that the more standard tags available, the less a firm would need to create
its own extension. It is hard to conceive of a situation where the number of standard tags defined
in a taxonomy would be related to the firm’s information environment except through the
extension rate after controlling for fiscal year.
Using a sample of 21,800 firm-quarter observations for the fiscal years running from 2010
to 2014, I find that the association between XBRL extension rate and analysts’ forecast error
29
and analysts’ forecast dispersion is positive when using OLS but is negative when using 2SLS.
This suggests that the use of XBRL extensions is positively associated with the firm’s
information environment and that XBRL are generally used to convey information about the
firm. I also find that there is a stronger association for extensions in the financial statements
relative to extensions in footnote disclosures. In examining extensions within the individual
financial statements, I find that extensions within the cash flow statement have a stronger
association with both analysts’ forecast error and dispersion, whereas extensions within the
balance sheet have the weakest.
In additional analysis, I fail to find evidence that the association between XBRL extension
use and analysts’ information varies with accrual management measures, which is inconsistent
with managers using XBRL extensions to mask earnings management. On the other hand, I do
find that the association between XBRL extension use and analysts’ information does vary with
stock returns volatility and with earnings volatility. This is consistent with managers using
extensions when their information environment is more uncertain.
The results of this study have implications for regulators considering the use of extended
taxonomies for XBRL filings. I find evidence that the extension rate is significantly associated
with lower forecast error and forecast dispersion. This is consistent with managers using their
discretion to try to convey information about the firm to investors. Regulators around the world
should not be overly concerned with allowing managers to use extended tags, but should rather
focus on improving the standard tags to ensure that common line items, especially at the
industry level, are available for firms to use.
30
Appendix A
Example of XBRL Extension Use
As an example of the use of an extended item in an XBRL filing, consider the June 30th 2013
quarterly filing for Seaworld Entertainment. In that filing, there is a piece of XML code that
includes ‘xmlns:seas=”http://www.seaworld.com/20130630”’ which is the namespace
declaration allowing the document to contain unique elements as defined by Seaworld. Within
the XBRL instance document, there is an item with the tag
“seas:FoodMerchandiseAndOtherRevenue.” The “seas:” is the namespace prefix that refers to
the earlier “xmlns:seas=” declaration.29 The namespace prefix can be used to identify whether
an item in the XBRL instance document is extended or not.
Below is an example of the use of an extended tag from the June 30th 2013 Quarterly
filing for Seaworld Entertainment Inc and Six Flags Entertainment. For simplicity, I only show
the components of revenue for each company. The extended items are indicated with asterisks.
Seaworld Six Flags
Admissions $255,001 Theme park admissions $199,666
Food, merchandise and other* 156,291
Theme park food, merchandise
and other* 149,669
Sponsorship, licensing and
other fees* 10,582
Accommodations revenue 3,784
Total revenues 411,292 Total revenues 363,701
* Extended Items
Both Seaworld and Six Flags use a standard tag for their main source of income, namely
“AdmissionsRevenues.” The only difference is the label (specified in a “Label linkbase”) that
each company uses. Both companies also create a new (extended) tag for Food and
Merchandise, which is a large component of revenue. Seaworld uses the tag
“FoodMerchandiseAndOtherRevenue” whereas Six Flags uses the tag
29 The current main schema for US GAAP is declared with something similar to ‘xmlns:us-gaap=”http://xbrl.fasb.org/us-
gaap/2015-01-31”’. While “us-gaap” is the preferred namespace reference, there is nothing keeping firms from declaring a
namespace of “‘xmlns:usgaapofficial=”http://xbrl.fasb.org/us-gaap/2015-01-31” and then using the prefix “usgaapofficial:”
throughout the instance document.
31
“FoodAndBeverageMerchandiseAndOtherRevenue.” Six Flags also uses a standard tag for
Accommodation Revenue and creates an extended tag for Sponsorship, Licensing, and Other
Fees, despite there already being a standard Licensing Revenue tag. Also, both companies use
standard tags for their Total Revenues items (“Revenues”). Depending on how the program
reading the XBRL is designed, the standard tags may be more accessible, since their tag names
are known from the standard taxonomy. While Food and Merchandise probably provides
additional information about the sources of revenue, this revenue may not be easily found by
investors. Since it is not easily found, managers may try to use these items to mask manipulated
revenues. Ex ante, it is not clear whether firms’ use of extended tags convey new information
or make it more difficult to identify the source of particular line items. I elaborate on this in the
hypothesis development.
32
Appendix B
Variable Definition
Error = the magnitude of the median one period ahead quarterly EPS in the 30-
day following the filing minus the respective actual quarterly EPS,
scaled by the stock price just prior to the filing;
StdDev = the standard deviation of one-period ahead quarterly EPS forecast in the
30-day period following the XBRL filing scaled by the stock price just
prior to the filing;
Range = the range of one-period ahead quarterly EPS forecast in the 30-day period
following the XBRL filing scaled by the stock price just prior to the
filing;
ExtendRate = the number of extended items in the XBRL filing divided by the
number of total items in the XBRL filing;
Disc. ExtendRate = the number of extended items in all sections identified as footnote
disclosures divided by the number of total items in all sections
identified as footnote disclosures;
Stm. ExtendRate = the number of extended items in all sections identified as financial
statements divided by the number of total items in all sections
identified as financial statements;
BS ExtendRate = the number of extended items in the section identified as the balance
sheet divided by the number of total items in the section identified as
the balance sheet;
CF ExtendRate = the number of extended items in the section identified as the cash flow
statement divided by the number of total items in the sections identified
as the cash flow statement;
IS ExtendRate = the number of extended items in all sections identified as an income
statement (either the statement of income or the comprehensive
statement of income or the statement of other comprehensive income)
divided by the number of total items in all sections identified as an
income statement;
Size = the natural log of total assets at the beginning of the quarter;
Age = the natural log of the number of years the firm has been in Compustat;
StdROE = the standard deviation of return on equity over the previous four quarters;
Intang = the sum of research and development and advertising expenses from the
previous quarter divided by deflated by market value at the beginning of
the quarter;
ROA = the return on assets for the quarter related to the XBRL filing;
Corr = the correlation between ROE and stock returns over the previous eight
quarters;
EPS = earnings per share for the quarter related to the XBRL filing;
Q4 = an indicator variable equal to one if the filing is for the fourth quarter;
33
LnCov = the log of analysts’ following in the quarter prior to the filing;
Horizon = The natural log of the average number of days between the forecast issue
date and the report date for the one period ahead EPS forecasts within 30
days of the quarterly filing;
LargeFiler = an indicator variable equal to one if the filer is a large accelerated filer;
AccFiler = an indicator variable equal to one if the filer is an accelerated filer;
Violate = an indicator variable equal to one if the XBRL filing violates any of the
filings rules as indicated by XBRLcloud.com.
34
References
Bartley, J., A. Y. S. Chen, and E. Taylor. 2011. A Comparison of XBRL Filings to Corporate
10-Ks – Evidence from the Voluntary Filing Program. Accounting Horizons. 25(2): 227-
245.
Blankespoor, E. 2012. The impact of investor information processing costs on firm disclosure
choice: evidence from the XBRL mandate. Stanford University, working paper.
Boritz, J. E., and W. G. No. 2008. SEC’s XBRL voluntary program on EDGAR: The case for
quality assurance. Current Issues in Auditing 2(2): A36–A50.
Boritz, J. E., and W.G. No. 2013. The quality of interactive data: XBRL versus Compustat,
Yahoo Finance, and Google Finance. University of Waterloo, working paper.
CFA Institute (CFA). 2011. XBRL Member survey report. Retrieved last on January 26th,
2016 from https://www.cfainstitute.org/Survey/cfa_institute_member_survey_xbrl.pdf
Chen G., J.-B. Kim, and J. Zhou, 2013. XBRL adoption and bank loan contracting: early
evidence. Northwestern University, working paper.
Debreceny, R. S., A. Chandra, J. J. Cheh, D. Guithues-Amrhein, N. J. Hannon, P. D.
Hutchison, D. Janvrin, R. A. Jones, B. Lamberton, A. Lymer, M. Mascha, R. Nehmer, S.
Roohani, R. P. Srivastava, S. Trabelsi, T. Tribunella, G. Trites, and M. A. Vasarhelyi.
2005. Financial reporting in XBRL on the SEC’s EDGAR system: A critique and
evaluation. Journal of Information Systems 19 (2): 191–210.
Debreceny, R. S., S. Farewell, M. Piechocki, C. Felden, and A. Graning. 2010. Does it add
up? Early evidence on the data quality of XBRL filings to the SEC. Journal of
Accounting and Public Policy 29(3): 296–306.
Debreceny, R. S., S. Farewell, M. Piechocki, C. Felden, A. Graning, and A. d’Eri. 2011. Flex
or break? Extensions in XBRL disclosures to the SEC. Accounting Horizons 25 (4): 631–
657.
Dechow P. M., and I. D. Dichev. 2002. The Quality of Accruals and Earnings: The Role of
Accrual Estimation Errors. The Accounting Review 77: 35.
Dechow P. M., R. Sloan, and A.P. Sweeney. 1995. Detecting Earnings Management.
Accounting Review 70(2): 193-225.
DeFranco, G., S. P. Kothari, and R. Verdi. 2011. The Benefits of Financial Statement
Comparability. The Journal of Accounting Research. 49(4): 895-931.
Dhole, S., G. J. Lobo, S. Mishra, and A. M. Pal. 2015. Effects of the SEC’s XBRL Mandate
on Financial reporting comparability. International Journal of Accounting Information
Systems. 19:29-44.
Dong Yi, O. Z. Li, Y. Lin, and C. K. Ni 2015. Information processing cost and stock return
synchronicity: evidence from XBRL adoption. Journal of Finance and Quantitative
Analysis Forthcoming.
35
Du, H., M. A. Vasarhelyi, and X. Zheng. 2013. XBRL mandate: Thousands of filing errors
and so what? Journal of Information Systems 27(1):61-78.
Efendi, J., J. D. Park, and C. Subramaniam. 2010. Do XBRL Reports Have Incremental
Information Content? – An Empirical Analysis. Working Paper, University of Texas –
Arlington and Towson University.
Francis J., R. LaFond, P. M. Olsson, K. Schipper. 2005. The market pricing of accruals
quality. Journal of Accounting & Economics 39, 295-327.
Hoitash, R. and U. Hoitash. 2014. Measuring Accounting Complexity with XBRL. Working
Paper, Bentley University and Northeastern University.
Kim, J. W., J. H. Lim, and W. G. No. 2012. The effect of mandatory XBRL reporting across
the financial information environment: evidence in the first waves of mandated U.S.
filers. Journal of Information Systems 26(1): 127-153.
Li, O. Z., Y. Lin, and C. Ni. 2014. Does XBRL adoption reduce the cost of equity capital?
Working paper, National University of Singapore and City University of Hong Kong.
Li, S. and E. Nwaeze. 2015. The Association between Extensions in XBRL Disclosures and
Financial Information Environment. Journal of Accounting Information Systems
Forthcoming.
Lim J. H., J. B. Kim, and J. Kim, 2013. Does XRBL adoption constrain managerial
opportunism in financial reporting? Evidence from mandated U.S. filers. University of
Waterloo, working paper.
Liu, C., T. Wang, and L. J. Yao. 2014a. XBRL’s impact on analyst forecast behavior: An
empirical study. Journal of Accounting and Public Policy. 33(1):69-82.
Liu, C., L. J. Yao., C. L. Sia, and K. K. Wei 2014b. The Impact of Early XBRL Adoption on
Analysts’ Forecast Accuracy: Empirical Evidence from China. Electronic Markets. 24(1):
47-55.
Ministry of Corporate Affairs (India). 2012. XBRL Filing Manual, Version 2.0. Available
from: http://www.mca.gov.in/Ministry/pdf/XBRL_Filing_Manual_v2.0_14thOct2012.pdf
Accessed last: Jan 26, 2016.
Petersen, M. A. 2009. Estimating Standard Errors in Finance Panel Data Sets: Comparing
Approaches. Review of Financial Studies 22(1): 435-480.
Securities and Exchange Commission (SEC). 2009. Staff Observations from Review of
Interactive Data Financial Statements (October 6). Available at:
http://www.sec.gov/spotlight/xbrl/staff-review-observations-100609.shtml
Securities and Exchange Commission (SEC). 2010. Staff Observations from Review of
Interactive Data Financial Statements (November 1). Available at:
http://www.sec.gov/spotlight/xbrl/staff-review-observations-110110.shtml
36
Securities and Exchange Commission (SEC). 2014. Staff Observations of Custom Tag Rates
(July 7). Available at: https://www.sec.gov/dera/reportspubs/assessment-custom-tag-
rates-xbrl.html
37
Table 1
Sample Selection
Attrition Sample
Size
10K and 10Q XBRL filings between 2010 and 2014 103,202
Matched with Compustat, IBES, and CRSP (47,451) 55,751
Missing extension data (2,223) 53,528
Missing analysts’ forecast data (21,198) 32,330
Missing controls (10,530) 21,800
Table 2
Sample Frequency by Schema and Fiscal Year
Schema
2010
2011
2012
2013
2014
Total
Obs.
Total
Tags
us-gaap - 2009 544 492 16 0 0 1,052 13,452
us-gaap - 2011 2 1,941 1,798 54 0 3,795 15,725
us-gaap - 2012 0 7 3,555 3,732 172 7,466 17,087
us-gaap - 2013 0 0 10 3,696 3,912 7,618 17,736
us-gaap - 2014 0 0 0 5 1,864 1,869 17,433
Total 546 2440 5379 7487 5,948 21,800
38
Table 3
Summary Statistics
Mean Std Q1 Median Q4
Error 0.006 0.014 0.001 0.002 0.005
StdDev 0.003 0.005 0.001 0.001 0.003
Range 0.005 0.010 0.000 0.001 0.005
ExtendRate 0.173 0.083 0.113 0.162 0.221
Disc. ExtendRate 0.216 0.099 0.145 0.204 0.275
Stm. ExtendRate 0.069 0.055 0.028 0.056 0.096
BS ExtendRate 0.045 0.053 0.000 0.031 0.069
CF ExtendRate 0.087 0.077 0.031 0.071 0.125
IS ExtendRate 0.068 0.075 0.000 0.050 0.105
Size 7.789 1.984 6.496 7.863 9.101
Age 3.055 0.670 2.565 2.996 3.638
StdROE 0.107 0.473 0.007 0.015 0.038
Intang 0.005 0.013 0.000 0.000 0.005
ROA 0.002 0.047 0.001 0.009 0.020
CORR 0.063 0.406 -0.235 0.079 0.373
EPS 0.463 0.807 0.034 0.343 0.760
Q4 0.226 0.418 0.000 0.000 0.000
LnCov 2.114 0.871 1.609 2.197 2.773
LargeFiler 0.689 0.463 0.000 1.000 1.000
AccFiler 0.250 0.433 0.000 0.000 1.000
Horizon 3.670 0.679 3.555 3.892 4.025
Violate 0.208 0.406 0.000 0.000 0.000 This table presents summary statistics for my sample of 21,800 firm-quarter observations for the period from
2010 to 2014, with the exception of StdDev, which is only available for 12,473 firm-quarter observations. All
variables are defined in Appendix B.
Table 4
Pearson Correlations Coefficients
(1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11)
(1) Error 1.000
(2) StdDev 0.579 1.000
(3) Range 0.385 0.961 1.000
(4) ExtendRate 0.033 0.080 0.081 1.000
(5) Disc. ExtendRate 0.031 0.068 0.074 0.961 1.000
(6) Stm. ExtendRate 0.037 0.058 0.055 0.656 0.512 1.000
(7) BS ExtendRate 0.007 0.020 0.025 0.502 0.408 0.730 1.000
(8) CF ExtendRate 0.058 0.087 0.074 0.557 0.435 0.849 0.463 1.000
(9) IS ExtendRate -0.002 0.006 0.016 0.469 0.363 0.716 0.380 0.393 1.000
(10) Size -0.209 -0.183 -0.064 0.356 0.312 0.231 0.217 0.146 0.224 1.000
(11) Age -0.127 -0.124 -0.097 0.034 0.043 -0.046 0.027 -0.080 -0.004 0.388 1.000
(12) StdROE 0.124 0.178 0.145 -0.001 0.006 -0.004 -0.037 0.019 -0.006 -0.168 -0.061
(13) Intang 0.188 0.239 0.138 -0.074 -0.061 -0.085 -0.107 -0.033 -0.090 -0.348 -0.103
(14) ROA -0.288 -0.363 -0.249 -0.037 -0.045 -0.034 0.024 -0.073 0.018 0.364 0.181
(15) CORR 0.002 -0.008 -0.005 -0.048 -0.045 -0.033 -0.035 -0.024 -0.027 -0.062 -0.035
(16) EPS -0.218 -0.262 -0.187 0.058 0.052 0.019 0.052 -0.020 0.044 0.374 0.210
(17) Q4 -0.026 0.025 -0.018 0.144 0.030 0.081 0.026 0.086 0.043 0.008 -0.013
(18) LnCov -0.253 -0.192 0.008 0.102 0.082 0.035 0.045 -0.022 0.102 0.592 0.145
(19) LargeFiler -0.254 -0.249 -0.103 0.127 0.103 0.058 0.070 0.008 0.101 0.617 0.271
(20) AccFiler 0.165 0.223 0.101 -0.134 -0.117 -0.062 -0.077 -0.019 -0.084 -0.496 -0.245
(21) Horizon -0.058 -0.104 0.020 -0.106 -0.029 -0.083 -0.034 -0.086 -0.040 0.090 0.074
(22) Violate -0.007 -0.011 -0.025 -0.024 -0.053 0.008 -0.004 0.024 -0.023 0.006 -0.023
40
Table 4 (cont.)
(12) (13) (14) (15) (16) (17) (18) (19) (20) (21)
(12) StdROE 1.000
(13) Intang 0.187 1.000
(14) ROA -0.295 -0.468 1.000
(15) CORR -0.023 -0.013 0.015 1.000
(16) EPS -0.100 -0.189 0.456 -0.007 1.000
(17) Q4 -0.013 -0.016 0.022 0.002 0.023 1.000
(18) LnCov -0.072 -0.146 0.237 -0.044 0.255 0.035 1.000
(19) LargeFiler -0.118 -0.223 0.324 -0.041 0.305 0.025 0.605 1.000
(20) AccFiler 0.080 0.141 -0.222 0.033 -0.259 -0.025 -0.463 -0.860 1.000
(21) Horizon -0.006 -0.025 0.054 -0.014 0.053 -0.656 0.090 0.132 -0.086 1.000
(22) Violate 0.002 -0.038 -0.012 0.005 -0.001 0.118 -0.076 -0.041 0.020 -0.104 This table presents Pearson’s correlation coefficients for my sample of 21,800 firm-quarter observations for the period from 2010 to 2014, with the exception of StdDev, which is only
available for 12,473 firm-quarter observations. Correlations in bold are significant at least at the 10% level. All variables are defined in Appendix B.
41
Table 5
First Stage Regression of Extension Rates Number of Standard Elements
ExtendRate
Disc. ExtendRate
Stm.
ExtendRate BS
ExtendRate CF
ExtendRate IS
ExtendRate
Total Taxonomy Tags -0.013 -0.017 -0.006 -0.006 -0.005 -0.007 (-9.78)*** (-10.10)*** (-6.02)*** (-6.77)*** (-3.47)*** (-4.86)***
Size 0.015 0.016 0.007 0.004 0.009 0.007 (13.23)*** (11.21)*** (7.86)*** (4.69)*** (7.04)*** (5.32)***
Age -0.008 -0.007 -0.008 -0.003 -0.012 -0.005 (-3.72)*** (-2.51)** (-4.97)*** (-1.86)* (-5.25)*** (-2.19)**
StdROE 0.005 0.005 0.002 -0.001 0.004 0.003 (2.25)** (2.06)** (1.28) (-0.66) (1.97)** (1.60)
Intang 0.231 0.226 0.101 0.006 0.151 0.172 (2.51)** (2.02)** (1.51) (0.11) (1.56) (1.92)*
ROA -0.179 -0.198 -0.090 -0.032 -0.151 -0.045 (-6.09)*** (-5.61)*** (-4.43)*** (-1.71)* (-4.98)*** (-1.76)*
CORR -0.003 -0.003 -0.001 -0.002 -0.001 -0.001 (-1.19) (-1.16) (-0.86) (-1.20) (-0.64) (-0.33)
EPS -0.002 -0.001 -0.001 0.000 -0.002 -0.002 (-1.46) (-0.74) (-0.90) (0.17) (-1.17) (-0.95)
Q4 0.021 -0.000 0.005 -0.001 0.010 0.003 (11.70)*** (-0.15) (4.08)*** (-0.63) (5.53)*** (1.43)
LnCov -0.006 -0.007 -0.003 -0.002 -0.006 -0.000 (-3.29)*** (-2.84)*** (-2.30)** (-0.96) (-3.11)*** (-0.14)
LargeFiler -0.022 -0.029 -0.008 -0.008 -0.009 -0.001 (-3.90)*** (-4.50)*** (-1.89)* (-2.00)** (-1.67)* (-0.22)
AccFiler -0.015 -0.021 -0.002 -0.004 -0.002 0.001 (-3.19)*** (-3.82)*** (-0.65) (-1.25) (-0.34) (0.20)
Horizon -0.001 0.000 -0.003 -0.002 -0.003 -0.002 (-0.60) (0.02) (-2.58)** (-2.19)** (-1.99)** (-1.72)*
Violate 0.003 0.001 0.002 0.001 0.003 -0.002 (1.50) (0.45) (1.22) (1.06) (1.58) (-1.25)
Adjusted R2 0.372 0.317 0.248 0.170 0.181 0.160
KP Rank LM Statistic 80.868 85.375 33.894 42.638 11.660 22.859
P-value <0.001 <0.001 <0.001 <0.001 0.001 <0.001
This table presents first stage regression of XBRL extension rates on the total taxonomy tags for my sample of 21,800 firm-quarter for the period
42
from 2010 to 2014. The dependent variables are indicated in the column heading. ExtendRate is the number of extended items in the XBRL filing
divided by the number of total items in the XBRL filing; Disc. ExtendRate is the number of extended items in all sections identified as footnote
disclosures divided by the number of total items in all sections identified as footnote disclosures; Stm. ExtendRate is the number of extended items
in all sections identified as financial statements divided by the number of total items in all sections identified as financial statements; BS
ExtendRate is the number of extended items in the section identified as the balance sheet divided by the number of total items in the section
identified as the balance sheet; CF ExtendRate is the number of extended items in the section identified as the cash flow statement divided by the
number of total items in the sections identified as the cash flow statement; IS ExtendRate is the number of extended items in all sections identified
as an income statement (either the statement of income or the comprehensive statement of income) divided by the number of total items in all
sections identified as an income statement; The instrumental variable is the total number of standard tags defined by the official US GAAP
taxonomy used in the filing. All other variables are defined in Appendix B. ***, **, * indicate significance at the 1%, 5%, and 10% levels of
significance. T-statistics are presented in parentheses and are adjusted for arbitrary heteroskedasticity and standard errors are clustered by firm.
Industry, Year, and Software fixed effects are included in the model but are not reported for brevity.
43
Table 6
Regression of Analysts’ Forecast Properties on Extension Rate
OLS 2SLS
Error StdDev Range Error StdDev Range
ExtendRate 0.010 0.004 0.009 -0.055 -0.034 -0.065 (4.23)*** (4.52)*** (5.58)*** (-4.03)*** (-5.01)*** (-5.00)***
Size 0.001 0.000 0.000 0.002 0.001 0.001 (3.62)*** (2.32)** (1.68)* (5.59)*** (5.25)*** (5.20)***
Age -0.001 -0.000 -0.001 -0.001 -0.001 -0.001 (-3.47)*** (-1.56) (-2.77)*** (-4.56)*** (-3.10)*** (-4.09)***
StdROE 0.001 0.001 0.001 0.001 0.001 0.002 (2.28)** (4.02)*** (4.50)*** (2.74)*** (4.62)*** (4.59)***
Intang 0.104 0.058 0.054 0.119 0.066 0.072 (4.67)*** (5.73)*** (3.62)*** (5.14)*** (5.77)*** (4.17)***
ROA -0.041 -0.020 -0.028 -0.053 -0.026 -0.042 (-7.37)*** (-7.15)*** (-7.48)*** (-8.53)*** (-7.18)*** (-8.06)***
CORR -0.000 -0.000 0.000 -0.000 -0.000 -0.000 (-0.75) (-0.47) (0.31) (-1.19) (-0.75) (-0.43)
EPS -0.001 -0.000 -0.001 -0.001 -0.001 -0.001 (-5.49)*** (-4.30)*** (-6.13)*** (-5.45)*** (-4.32)*** (-5.76)***
Q4 -0.002 -0.000 -0.000 -0.001 0.000 0.001 (-4.85)*** (-2.85)*** (-0.06) (-1.75)* (0.60) (3.68)***
LnCov -0.003 -0.001 0.001 -0.003 -0.001 0.000 (-9.13)*** (-5.65)*** (3.55)*** (-9.04)*** (-5.66)*** (1.04)
LargeFiler -0.006 -0.001 -0.001 -0.008 -0.002 -0.002 (-6.47)*** (-3.10)*** (-1.41) (-7.24)*** (-4.30)*** (-3.31)***
AccFiler -0.004 -0.000 0.001 -0.005 -0.001 -0.000 (-4.40)*** (-0.72) (2.04)** (-5.07)*** (-1.49) (-0.03)
Horizon -0.001 -0.001 0.001 -0.001 -0.001 0.001 (-1.94)* (-3.43)*** (5.91)*** (-2.16)** (-3.72)*** (4.53)***
Violate -0.000 0.000 0.000 0.000 0.000 0.000 (-0.01) (0.53) (1.61) (0.53) (1.46) (2.14)**
This table presents regression of analysts’ forecast properties on XBRL extension rates for my sample of 21,800 firm-quarter observations (12,473
firm-quarter observations for models that include StdDev) for the period from 2010 to 2014. The first three columns present the OLS estimates
whereas the last three columns present the 2SLS estimates. The dependent variables are indicated in the column heading. Error is the magnitude of
the median one period ahead quarterly EPS in the 30-day following the filing minus the respective actual quarterly EPS, scaled by the stock price
44
just prior to the filing; StdDev is the standard deviation of one-period ahead quarterly EPS forecast in the 30-day period following the XBRL
filing scaled by the stock price just prior to the filing; and Range is the range of one-period ahead quarterly EPS forecast in the 30-day period
following the XBRL filing scaled by the stock price just prior to the filing. The independent variables of interest is the document extension rate
(ExtendRate) defined as the number of extended items in the XBRL filing divided by the number of total items in the XBRL filing; All other
variables are defined in Appendix B. ***, **, * indicate significance at the 1%, 5%, and 10% levels of significance. T-statistics are presented in
parentheses and are adjusted for arbitrary heteroscedasticity, and standard errors are clustered by firm. Industry, Year, and Software fixed effects
are included in the model but are not reported for brevity.
45
Table 7
Regression of Analysts’ Forecast Properties on Extension Rates by Section
Disclosures Statements
Error StdDev Range Error StdDev Range
Extension Rate -0.043 -0.027 -0.051 -0.130 -0.078 -0.155 (-4.05)*** (-5.06)*** (-5.07)*** (-3.57)*** (-4.18)*** (-4.22)***
Size 0.001 0.001 0.001 0.002 0.001 0.001 (5.73)*** (5.17)*** (5.16)*** (4.81)*** (4.24)*** (4.31)***
Age -0.001 -0.000 -0.001 -0.002 -0.001 -0.002 (-4.22)*** (-2.60)*** (-3.66)*** (-4.73)*** (-3.53)*** (-4.37)***
StdROE 0.001 0.001 0.002 0.001 0.001 0.002 (2.71)*** (4.60)*** (4.59)*** (2.51)** (4.21)*** (4.09)***
Intang 0.116 0.063 0.068 0.120 0.067 0.072 (5.04)*** (5.63)*** (4.03)*** (4.93)*** (5.52)*** (3.73)***
ROA -0.051 -0.025 -0.040 -0.055 -0.026 -0.044 (-8.51)*** (-7.20)*** (-8.12)*** (-7.94)*** (-6.71)*** (-7.13)***
CORR -0.000 -0.000 -0.000 -0.000 -0.000 -0.000 (-1.18) (-0.84) (-0.39) (-1.15) (-0.85) (-0.47)
EPS -0.001 -0.001 -0.001 -0.001 -0.001 -0.001 (-5.41)*** (-4.24)*** (-5.73)*** (-4.74)*** (-3.51)*** (-4.60)***
Q4 -0.002 -0.001 -0.000 -0.001 -0.000 0.001 (-5.10)*** (-3.25)*** (-1.05) (-2.96)*** (-1.52) (1.88)*
LnCov -0.003 -0.001 0.000 -0.004 -0.001 0.000 (-9.11)*** (-5.62)*** (1.37) (-8.24)*** (-4.61)*** (0.51)
LargeFiler -0.008 -0.002 -0.002 -0.007 -0.002 -0.002 (-7.34)*** (-4.54)*** (-3.41)*** (-6.56)*** (-3.39)*** (-2.49)**
AccFiler -0.005 -0.001 -0.000 -0.005 -0.000 0.001 (-5.16)*** (-1.76)* (-0.18) (-4.41)*** (-0.77) (0.81)
Horizon -0.001 -0.001 0.001 -0.001 -0.001 0.000 (-2.03)** (-3.49)*** (4.98)*** (-2.84)*** (-4.05)*** (1.70)*
Violate 0.000 0.000 0.000 0.000 0.000 0.001 (0.22) (1.07) (1.67)* (0.65) (1.43) (1.95)*
Standardized
Coefficient -0.311 -0.557 -0.526 -0.530 -0.884 -0.896
This table presents the 2SLS regression estimates of analysts’ forecast properties on XBRL extension rates for my sample of 21,800 firm-quarter
observations (12,473 firm-quarter observations for models that include StdDev) for the period from 2010 to 2014. The first three columns use the
46
extension rate within footnote disclosures as the independent variable of interest. The last three columns use the extension rate within financial
statement as the independent variable of interest. The dependent variables are indicated in the column heading. Error is the magnitude of the median
one-period ahead quarterly EPS in the 30-day following the filing minus the respective actual quarterly EPS, scaled by the stock price just prior to
the filing; StdDev is the standard deviation of one-period ahead quarterly EPS forecast in the 30-day period following the XBRL filing scaled
by the stock price just prior to the filing; and Range is the range of one-period ahead quarterly EPS forecast in the 30-day period following the
XBRL filing scaled by the stock price just prior to the filing. All other variables are defined in Appendix B. The last row reports the standardized
(beta) coefficients for the extension rate variable. ***, **, * indicate significance at the 1%, 5%, and 10% levels of significance. T-statistics are
presented in parentheses and are adjusted for arbitrary heteroscedasticity, and standard errors are clustered by firm. Industry, Year, and Software
fixed effects are included in the model but are not reported for brevity.
47
Table 8
Regression of Analysts’ Forecast Properties on Extension Rates by Statement
Balance Sheet Cash Flow Statement Income Statement
Error StdDev Error StdDev Error StdDev
Extension Rate -0.115 -0.074 -0.154 -0.088 -0.108 -0.067 (-3.72)*** (-4.22)*** (-2.72)*** (-2.95)*** (-3.30)*** (-3.54)***
Size 0.001 0.001 0.002 0.001 0.001 0.001 (5.10)*** (3.79)*** (3.54)*** (3.23)*** (4.56)*** (3.67)***
Age -0.001 -0.001 -0.003 -0.001 -0.002 -0.001 (-4.04)*** (-2.83)*** (-3.58)*** (-2.77)*** (-3.95)*** (-2.41)**
StdROE 0.001 0.001 0.002 0.001 0.001 0.001 (2.03)** (3.35)*** (2.65)*** (4.00)*** (2.62)*** (3.83)***
Intang 0.107 0.055 0.130 0.077 0.125 0.074 (4.62)*** (4.70)*** (4.52)*** (4.79)*** (5.07)*** (5.44)***
ROA -0.047 -0.022 -0.066 -0.032 -0.048 -0.022 (-7.77)*** (-5.96)*** (-6.06)*** (-5.51)*** (-7.42)*** (-5.84)***
CORR -0.000 -0.000 -0.000 -0.000 -0.000 -0.000 (-1.32) (-1.45) (-1.00) (-0.67) (-0.85) (-0.32)
EPS -0.001 -0.001 -0.002 -0.001 -0.002 -0.001 (-4.45)*** (-3.22)*** (-3.91)*** (-2.94)*** (-4.86)*** (-3.19)***
Q4 -0.002 -0.001 -0.000 0.000 -0.002 -0.001 (-4.98)*** (-3.46)*** (-0.44) (0.71) (-3.93)*** (-2.52)**
LnCov -0.003 -0.001 -0.004 -0.002 -0.003 -0.001 (-8.18)*** (-4.06)*** (-6.56)*** (-3.93)*** (-7.53)*** (-3.74)***
LargeFiler -0.007 -0.002 -0.008 -0.003 -0.006 -0.001 (-6.84)*** (-3.72)*** (-5.54)*** (-2.95)*** (-5.90)*** (-1.41)
AccFiler -0.005 -0.001 -0.005 -0.001 -0.004 0.000 (-4.71)*** (-1.23) (-3.77)*** (-1.03) (-4.11)*** (0.45)
Horizon -0.001 -0.001 -0.001 -0.001 -0.001 -0.001 (-2.67)*** (-4.02)*** (-2.59)*** (-3.04)*** (-2.62)*** (-3.68)***
Violate 0.000 0.000 0.000 0.000 -0.000 -0.000 (0.54) (1.18) (1.05) (1.57) (-0.58) (-0.12)
Standardized
Coefficient -0.445 -0.820 -0.870 -1.376 -0.597 -1.075
This table presents the 2SLS regression estimates of analysts forecast properties on XBRL extension rates for my sample of 21,800 firm-quarter
observations(12,473 firm-quarter observations for models that include StdDev) for the period from 2010 to 2014. The first two columns use the extension
48
rate within the balance sheet as the independent variable of interest. The next two columns use the extension rate within cash flow statement as the
independent variable of interest. The last two columns use the extension rate within income statement as the independent variable of interest. The
dependent variables are indicated in the column heading. Error is the magnitude of the median one period ahead quarterly EPS in the 30-day
following the filing minus the respective actual quarterly EPS, scaled by the stock price just prior to the filing and StdDev is the standard deviation
of one-period ahead quarterly EPS forecast in the 30-day period following the XBRL filing scaled by the stock price just prior to the filing. All
other variables are defined in Appendix B. The last row reports the standardized (beta) coefficients for the extension rate variable. ***, **, * indicate
significance at the 1%, 5%, and 10% levels of significance. T-statistics are presented in parentheses and are adjusted for arbitrary heteroscedasticity,
and standard errors are clustered by firm. Industry, Year, and Software fixed effects are included in the model but are not reported for brevity.
49
Table 9
Cross-sectional differences in the association between XBRL Extensions and Analysts’ Forecast Error and Forecast Dispersion
Panel A: StdRet
Error StdDev
High Low Diff High Low Diff
Extension Rate -0.108 -0.000 -0.108 -0.057 -0.008 -0.049
(-3.11)*** (-0.04) (-3.02)*** (-3.73)*** (-1.96)** (-3.13)***
Observations 10,397 10,404 5,700 6,121
Kleibergen-Paap Rank LM 23.695 62.322 22.778 49.289
P-value <0.001 <0.001 <0.001 <0.001
Panel B: StdROE
Error StdDev
High Low Diff High Low Diff
Extension Rate -0.064 -0.020 -0.044 -0.050 -0.007 -0.043
(-2.72)*** (-1.94)* (-1.72)* (-3.76)*** (-2.06)** (-3.16)***
Observations 10,653 10,654 6,157 6,013
Kleibergen-Paap Rank LM 40.814 45.798 28.384 37.830
P-value <0.001 <0.001 <0.001 <0.001
Panel C: XBRL Technical Errors
Error StdDev
Tech. Error No Tech Error Diff Tech. Error No Tech Error Diff
Extension Rate -0.046 -0.050 0.004 0.024 -0.034 0.057
(-0.32) (-3.82)*** (0.02) (0.08) (-5.07)*** (0.19)
Observations 4,539 17,261 2,505 9,968
Kleibergen-Paap Rank LM 1.191 76.328 0.032 57.925
P-value 0.275 <0.001 0.857 <0.001 Continued on the next page
50
Table 9 (Continued)
Panel D: AcctComp4
Error StdDev
High Low Diff High Low Diff
Extension Rate -0.012 -0.066 0.053 -0.012 -0.045 0.034
(-1.14) (-2.46)** (1.86)* (-1.89)* (-3.05)*** (2.07)**
Observations 6,775 6,787 3,850 3,819
Kleibergen-Paap Rank LM 31.135 21.295 33.241 17.114
P-value <0.001 <0.001 <0.001 <0.001
Panel E: AcctCompInd
Error StdDev
High Low Diff High Low Diff
Extension Rate 0.005 -0.086 0.091 -0.007 -0.049 0.042
(0.42) (-2.95)*** (2.88)*** (-1.23) (-3.45)*** (2.75)***
Observations 6,791 6,771 3,789 3,880
Kleibergen-Paap Rank LM 36.519 22.204 34.698 20.470
P-value <0.001 <0.001 <0.001 <0.001
Panel F: Abs(AbAcc)
Error StdDev
High Low Diff High Low Diff
Extension Rate -0.069 -0.028 -0.041 -0.046 -0.023 -0.023
(-1.93)* (-1.97)** (-1.05) (-2.44)** (-2.64)*** (-1.08)
Observations 7,203 7,202 4,005 4,467
Kleibergen-Paap Rank LM 21.208 44.139 16.991 29.441
P-value <0.001 <0.001 <0.001 <0.001 Continued on the next page
51
Table 9 (Continued)
Panel G: AQ
Error StdDev
High Low Diff High Low Diff
Extension Rate -0.086 -0.007 -0.079 -0.047 -0.021 -0.026
(-2.72)*** (-0.52) (-2.22)** (-2.74)*** (-2.36)** (-1.36)
Observations 7,330 7,323 4,139 4,618
Kleibergen-Paap Rank LM 24.060 34.677 17.456 27.988
P-value <0.001 <0.001 <0.001 <0.001
This table presents regression of analysts’ forecast properties on XBRL over all extension rates for my sample for the period from 2010 to 2014.
Subsamples are formed based on the annual median value of the respective partitioning variable. Firm-quarter observations vary with the different
sub-samples and are reported in the table. Panel A uses Abs(AbAcc), the magnitude of the Modified-Jones Model abnormal accruals for the previous
fiscal year, as the partitioning variable. The partitioning variable in Panel B is the AQ which is the accruals estimation error measured following
Francis et al. (2005). The partitioning variables in Panel C and D are the standard deviation of returns (StdRet) measured over the previous fiscal
year and the standard deviation of ROE (StdROE) measured over the four quarter period ending at the beginning of the current fiscal year. The
dependent variables are indicated in the column heading. Error is the magnitude of the median one-period ahead quarterly EPS in the 30-day
following the filing minus the respective actual quarterly EPS, scaled by the stock price just prior to the filing; and StdDev is the standard deviation
of one-period ahead quarterly EPS forecast in the 30-day period following the XBRL filing scaled by the stock price just prior to the filing. The
independent variables of interest is the document extension rate (ExtendRate) defined as the number of extended items in the XBRL filing divided
by the number of total items in the XBRL filing; All other variables are defined in Appendix B. ***, **, * indicate significance at the 1%, 5%, and
10% levels of significance. T-statistics are presented in parentheses and are adjusted for arbitrary heteroscedasticity, and standard errors are clustered
by firm. Industry, Year, and Software fixed effects are included in the model but are not reported for brevity. Control variables indicated in equations
1 and 2 are included in the model but not reported.
Table 10
First Stage Regression of Number of Tags on the Number of Standard Tags
Total Tags Extended Tags
Total Taxonomy Tags 0.037 -0.028 (6.93)*** (-2.53)**
Size 0.110 0.193 (22.34)*** (19.25)***
Age -0.030 -0.081 (-3.40)*** (-4.51)***
StdROE 0.019 0.048 (2.55)** (2.48)**
Intang 0.182 1.496 (0.51) (1.79)*
ROA -0.266 -1.323 (-2.50)** (-5.11)***
CORR -0.003 -0.020 (-0.32) (-1.08)
EPS -0.029 -0.045 (-4.72)*** (-3.62)***
Q4 0.613 0.758 (74.95)*** (46.64)***
LnCov -0.063 -0.093 (-7.76)*** (-5.67)***
LargeFiler -0.027 -0.116 (-1.26) (-2.69)***
AccFiler -0.040 -0.102 (-2.20)** (-2.67)***
Horizon -0.016 -0.017 (-3.19)*** (-1.57)
Violate 0.081 0.104 (11.72)*** (6.54)***
Adjusted R2 0.641 0.503
KP Rank LM Statistic 45.394 142.137
P-value <0.001 <0.001
This table presents first stage regression of XBRL extension rates on the total taxonomy tags for my
sample of 21,800 firm-quarter for the period from 2010 to 2014. The dependent variables are indicated
in the column heading. Total Tags is the total number of unique tags defined within the filings, and
Extended Tags is the number of unique extended items used in a filing. The instrumental variable (Total
Taxonomy Tags) is the total number of standard tags defined by the official US GAAP taxonomy used
in the filing. All other variables are defined in Appendix B. ***, **, * indicate significance at the 1%,
5%, and 10% levels of significance. T-statistics are presented in parentheses and are adjusted for
arbitrary heteroscedasticity, and standard errors are clustered by firm. Industry, Year, and Software
fixed effects are included in the model but are not reported for brevity
Table 11
Regression of Analysts’ Forecast Properties on the Total Number of Tags
OLS 2SLS
Error StdDev Range Error StdDev Range
Total Tags 0.003 0.001 0.003 0.020 0.012 0.024
(5.80)*** (4.54)*** (6.84)*** (3.95)*** (4.51)*** (4.79)***
Size 0.000 0.000 0.000 -0.002 -0.001 -0.002
(2.37)** (1.58) (0.39) (-2.65)*** (-3.62)*** (-4.10)***
Age -0.001 -0.000 -0.001 -0.000 0.000 -0.000
(-3.42)*** (-1.59) (-2.74)*** (-1.18) (0.88) (-0.01)
StdROE 0.001 0.001 0.001 0.001 0.000 0.001
(2.26)** (4.09)*** (4.50)*** (1.54) (2.57)** (3.14)***
Intang 0.106 0.059 0.056 0.103 0.056 0.052
(4.77)*** (5.83)*** (3.75)*** (4.59)*** (5.34)*** (3.31)***
ROA -0.042 -0.021 -0.029 -0.038 -0.019 -0.024
(-7.63)*** (-7.24)*** (-7.67)*** (-6.34)*** (-6.31)*** (-5.41)***
CORR -0.000 -0.000 0.000 -0.000 -0.000 0.000
(-0.82) (-0.52) (0.23) (-0.58) (-0.25) (0.40)
EPS -0.001 -0.000 -0.001 -0.001 -0.000 -0.001
(-5.27)*** (-4.14)*** (-5.90)*** (-2.74)*** (-0.73) (-2.17)**
Q4 -0.003 -0.001 -0.001 -0.014 -0.008 -0.015
(-6.83)*** (-4.94)*** (-4.74)*** (-4.37)*** (-4.65)*** (-4.72)***
LnCov -0.003 -0.001 0.001 -0.002 -0.000 0.002
(-8.69)*** (-5.20)*** (3.98)*** (-3.88)*** (-0.04) (5.34)***
LargeFiler -0.006 -0.001 -0.001 -0.006 -0.001 -0.000
(-6.60)*** (-3.19)*** (-1.64) (-5.48)*** (-1.30) (-0.44)
AccFiler -0.004 -0.000 0.001 -0.004 0.000 0.002
(-4.43)*** (-0.70) (1.97)** (-3.48)*** (0.50) (2.67)***
Horizon -0.001 -0.001 0.001 -0.000 -0.000 0.001
(-1.86)* (-3.27)*** (6.08)*** (-0.95) (-0.57) (6.01)***
Violate -0.000 -0.000 0.000 -0.002 -0.001 -0.002
(-0.69) (-0.03) (0.59) (-3.23)*** (-3.66)*** (-3.61)***
54
This table presents regression of analysts’ forecast properties on XBRL extension rates for my sample of 21,800 firm-quarter observations (12,473 firm-quarter
observations for models that include StdDev) for the period from 2010 to 2014. The first three columns present the OLS estimates whereas the last three columns
present the 2SLS estimates. The dependent variables are indicated in the column heading. Error is the magnitude of the median one-period ahead quarterly EPS
in the 30-day following the filing minus the respective actual quarterly EPS, scaled by the stock price just prior to the filing; StdDev is the standard deviation
of one-period ahead quarterly EPS forecast in the 30-day period following the XBRL filing scaled by the stock price just prior to the filing; and Range is
the range of one-period ahead quarterly EPS forecast in the 30-day period following the XBRL filing scaled by the stock price just prior to the filing. The
independent variables of interest is the total number of unique tags defined within the filings. All other variables are defined in Appendix B. ***, **, * indicate
significance at the 1%, 5%, and 10% levels of significance. T-statistics are presented in parentheses and are adjusted for arbitrary heteroscedasticity, and standard
errors are clustered by firm. Industry, Year, and Software fixed effects are included in the model but are not reported for brevity.
55
Table 12
Regression of Analysts’ Forecast Properties on the Number of Extended Tags
OLS 2SLS
Error StdDev Range Error StdDev Range
Extended Tags 0.001 0.000 0.001 -0.007 -0.005 -0.009
(2.18)** (3.04)*** (3.02)*** (-3.73)*** (-5.17)*** (-4.97)***
Size 0.000 0.000 0.000 0.000 0.000 0.000
(2.30)** (1.50) (0.30) (2.78)*** (2.11)** (1.21)
Age -0.001 -0.000 -0.001 -0.001 -0.000 -0.001
(-3.34)*** (-1.48) (-2.64)*** (-4.00)*** (-2.45)** (-3.50)***
StdROE 0.001 0.001 0.001 0.001 0.001 0.002
(2.24)** (4.01)*** (4.48)*** (2.41)** (4.38)*** (4.41)***
Intang 0.105 0.058 0.055 0.115 0.063 0.067
(4.73)*** (5.79)*** (3.69)*** (5.07)*** (5.80)*** (4.14)***
ROA -0.041 -0.020 -0.029 -0.048 -0.024 -0.037
(-7.46)*** (-7.18)*** (-7.55)*** (-8.31)*** (-7.46)*** (-8.17)***
CORR -0.000 -0.000 0.000 -0.000 -0.000 -0.000
(-0.77) (-0.49) (0.29) (-1.14) (-0.70) (-0.32)
EPS -0.001 -0.000 -0.001 -0.001 -0.000 -0.001
(-5.28)*** (-4.14)*** (-5.91)*** (-5.06)*** (-3.85)*** (-5.48)***
Q4 -0.003 -0.001 -0.001 -0.005 -0.002 -0.004
(-6.21)*** (-4.29)*** (-3.89)*** (-7.14)*** (-6.65)*** (-6.66)***
LnCov -0.003 -0.001 0.001 -0.003 -0.001 0.001
(-8.71)*** (-5.27)*** (3.94)*** (-8.19)*** (-4.08)*** (4.00)***
LargeFiler -0.006 -0.001 -0.001 -0.007 -0.002 -0.002
(-6.55)*** (-3.10)*** (-1.54) (-6.85)*** (-3.83)*** (-2.38)**
AccFiler -0.004 -0.000 0.001 -0.005 -0.001 0.001
(-4.41)*** (-0.67) (2.03)** (-4.58)*** (-1.01) (1.21)
Horizon -0.001 -0.001 0.001 -0.001 -0.001 0.001
(-1.87)* (-3.31)*** (6.05)*** (-1.68)* (-2.60)*** (5.86)***
Violate -0.000 0.000 0.000 -0.000 -0.000 -0.000
(-0.62) (0.04) (0.72) (-1.30) (-0.68) (-0.67)
56
This table presents regression of analysts’ forecast properties on XBRL extension rates for my sample of 21,800 firm-quarter observations (12,473 firm-quarter
observations for models that include StdDev) for the period from 2010 to 2014. The first three columns present the OLS estimates whereas the last three columns
present the 2SLS estimates. The dependent variables are indicated in the column heading. Error is the magnitude of the median one-period ahead quarterly EPS
in the 30-day following the filing minus the respective actual quarterly EPS, scaled by the stock price just prior to the filing; StdDev is the standard deviation
of one-period ahead quarterly EPS forecast in the 30-day period following the XBRL filing scaled by the stock price just prior to the filing; and Range is
the range of one-period ahead quarterly EPS forecast in the 30-day period following the XBRL filing scaled by the stock price just prior to the filing. The
independent variables of interest are the natural log of the number of unique extended tags defined in the filing, All other variables are defined in Appendix B.
***, **, * indicate significance at the 1%, 5%, and 10% levels of significance. T-statistics are presented in parentheses and are adjusted for arbitrary
heteroscedasticity, and standard errors are clustered by firm. Industry, Year, and Software fixed effects are included in the model but are not reported for brevity.