a model of the optimal tax mix including capital...
TRANSCRIPT
-
1
Draft Paper
A model of the optimal tax mix including capital
taxation
Abstract
This paper uses a three-tax linear model to study numerically the optimal tax mix. The
three aggregate taxes relate to labour, capital and commodities respectively. The most striking
result is that with a linear expenditure system (LES) based on econometric estimates, indirect
(i.e. commodity) taxation dominates the optimal tax mix, with over 80% of tax revenue coming
from indirect taxes. Although some analytical justification is presented for this surprising result,
it should be tested numerically using other econometrically estimated utility-demand systems.
Comprehensive computational modelling of the optimal tax mix has been so far missing from
the literature. Considering the results presented here, there seems to be considerable scope for
further research in this area. From a theoretical perspective, a major innovation in this paper is
the introduction of capital taxation into a static tax mix model. The analytical part of the paper
provides approximate predictions for optimal linear tax rates (including capital tax) called the
modified inverse elasticity rule. Using these formulas, it is possible to provide some
explanations for the dominance of indirect taxation in the optimal tax mix.
keywords: Optimal taxation, capital taxation, income taxation, indirect taxation
JEL Classification: H21, H23, H26, C63
Introduction
This paper uses a three-tax linear model to study numerically the optimal tax mix. The
three aggregate linear taxes relate to labour, capital and commodities respectively. The most
striking result is that with a linear expenditure system (LES) based on econometric estimates,
indirect (i.e. commodity) taxation dominates the optimal tax mix, with over 80% of tax revenue
coming from indirect taxes. From a theoretical perspective, a major innovation in this paper is
the introduction of capital taxation into a static tax mix model. The analytical part of the paper
provides approximate predictions for optimal linear tax rates (including capital tax), called the
modified inverse elasticity rules. Using these formulas, it is possible to examine the reasons for
the dominance of optimal indirect taxation.
The three-tax model studied here is a fairly comprehensive one, incorporating labour,
capital and indirect taxes. It is assumed that there is only one linear labour income tax rate,
denoted 𝜏𝐿 , and a single capital income tax rate, denoted 𝜏𝑘. Labour income is denoted zi = wiLi, where wi is the fixed wage rate for household i and Li is the corresponding labour supply.
Capital income is capital (k) multiplied by a fixed average pre-tax real rate of return (r), that is
kr. r and wi are assumed to be fixed. The net after-tax rate of return is 𝑟(1 − 𝜏𝑘). ̀ r´ is measured after taking away depreciation and the remuneration of self-employed.
-
2
It is assumed that capital income is taxed separately from labour income. 𝜏𝑐 is a uniform proportional indirect tax rate on all commodities, 𝑝𝑗 are constant producer prices and 𝑞𝑗 are
quantities. Total consumption is: C= ∑ 𝑝𝑗𝑞𝑗𝑗 .
This model treats capital as merely an income generating tool, which means that unlike
commodities and leisure, it does not enter directly into the social welfare function (U). It is
assumed that there is only one type of capital (k), with a uniform rate of return (r). However,
wages and commodity prices can be heterogeneous. Then the linear model is given as:
maximize U = ∑ 𝑢𝑖(𝐪𝒊, 𝐿𝑖)𝑖 (1)
s.t. the budget constraint ∑ (𝜏𝐾𝑖 𝑘𝑖𝑟 + 𝜏𝐿𝑤𝑖𝐿𝑖 + 𝜏𝐶 ∑ 𝑝𝑗𝑞𝑗𝑖 𝑗 ) ≥ 𝐻𝑏 + 𝐺 (1a)
Income redistribution is carried out by providing a uniform demogrant (b) to every member of
the population. G is fixed public goods expenditure requirement. H is total population. In the
numerical work a linear expenditure (LES) utility function was used.1 Note that capital is
missing from the social welfare function in (1) but enters constraints (1a).
Aggregate capital supply is an increasing function of the after-tax rate of return to
capital, 𝑟(1 − 𝜏𝐾), and total real income M.
Symbolically: 𝐾 = 𝜑(𝑟(1 − 𝜏𝐾), 𝑀) (2)
A specific form of this function will be presented in eq. (18).
By definition:
rK + wL = 𝑀 = ∑ 𝑝𝑗𝑞𝑗𝑗 + 𝐺 = 𝐶 + 𝐺 (3)
In these equations: K is the aggregator of capital stock; L is the aggregator of labour supply. C
is total consumption. 𝑀 is price-indexed total income.
Defining lump-sum income as y = rK, then from (3):
𝜕𝐶
𝜕𝑦 = ∑ 𝑝
𝑗
𝜕𝑞𝑗
𝜕𝑦𝑗= 1 + 𝑤
𝜕𝐿
𝜕𝑦+ 𝑟
𝜕𝐾
𝜕𝑦 (4)
The social marginal utility of income is: 𝑔𝑖 = 𝜕𝑢𝑖
𝜕𝑚𝑖 =
𝜕𝑢𝑖
𝜕𝑦 (5)
Essentially, 𝑔𝑖 reflects political value judgments concerning income distribution. The relevant utility transformation parameter is referred to in the literature as the inequality aversion rate.
In the present model an inequality aversion rate of one corresponds to LES utility. A rate of
zero corresponds to equal marginal utilities for everyone, i.e. the absence of distributional
objectives. A rate between zero and one is a convex combination of the two.
1 This model is not zero homogeneous due to the presence of an endogenously determined lump-sum grant
(b). The indeterminacy of the optimum with zero homogeneous utility is discussed in Mirrlees (1976, pp. 330-
332)
-
3
The model described above, excluding capital taxation, is close to the standard income-
commodity tax models in the optimal taxation literature. Like other optimal taxation models, it
can highlight some aspects of the equity-efficiency trade-off. However, it has severe limitations
due to its high level of abstraction. First, population is heterogenous not only due to differences
in wage rates and capital incomes, but because of other features as well. Perhaps the most
important among these is household composition. Also, political preferences are only partly
captured by the single inequality aversion rate and the uniform demogrant. Distributional
preferences cover complex political considerations in relation to incomes, household
composition, place of residence, source of income, and with support grants (b), also in respect
to the provision of education and health services. Altogether, optimal taxation models, like the
present one, can offer some insights, but only a limited guide to policy.
The paper is structured as follows: The next section presents a literature review on the
subject. It is followed by approximate formulas for optimal linear capital, labour and
commodity tax rates, based on the specifications described in eq. (1). Details of the
mathematical analysis are presented in the Online Supplemental Appendix. The next sections
examine the computational model. It starts by explaining the optimisation methodology. This
is followed by numerical results. The interior point numerical optima will be compared with
the predictions from the modified inverse elasticity formulas. Finally, some explanations will
be provided to account for the dominance of indirect taxation in the optimal tax mix.
The final section presents summary and conclusions. The main conclusion is that there
is a need for further computational research, to test the validity of the conjecture about the
dominance of indirect taxation, using other empirically based utility-demand functions.
The location of the computer program used to produce the numerical results appears in
the references under Revesz (2019). The same website also contains the user’s guide. The
interested reader can experiment with this program for checking the numerical results presented
here and for testing results under different parameters and specifications.
Related literature
It is impossible to review in this short paper the large literature pertaining to the tax mix
issue. The discussion here starts with optimal taxation theory. The optimal tax mix model
involving labour and commodity taxes has been introduced into the literature by Atkinson and
Stiglitz (1976) and Mirrlees (1976). An important result is the Atkinson-Stiglitz (1976)
theorem. This theorem states that provided income tax is the control theoretic non-linear
solution and utility is weakly separable between commodities and leisure, there is no need to
apply commodity taxation in conjunction with income tax. Income tax will be sufficient to
generate all tax revenue. This is exactly the opposite to what is coming out from the present
linear model, which suggests that under linear taxation, the bulk of tax revenue should be
generated from indirect taxation rather than income tax.
-
4
The Atkinson-Stiglitz theorem is based on a number of not too realistic assumptions,
such as the absence of tax evasion, the absence of administration and compliance costs, weakly
separable utility between commodities and leisure, no externalities, no paternalistic concerns
and the assumption that the control theoretic continuous and differentiable income tax function
can be implemented in practice. Because of these assumptions, the Atkinson-Stiglitz (1976)
theorem had been criticised in the literature, particularly in connection with the debate about
indirect tax uniformity. The interested reader can find comprehensive reviews in Tuomala
(2016, Chapter 12) and Revesz (2018 pp. 3-28).
Some of the above-mentioned omissions also apply in the present model, however if tax
evasion, administrative and compliance costs, leisure complements-substitutes, externalities
and paternalistic concerns would have been included, they would have further reinforced the
conclusion in favour of indirect taxation.
Another subject from the optimal taxation literature that has been taken up in the present
model is capital taxation. There is a large and disparate literature on this subject. A recent
literature review on optimal capital taxation and related topics is presented by Bastani and
Waldenström (2018). A shorter literature review is contained in Piketty and Saez (2012, pp. 1-
3). Studies that show some similarities in their mathematical framework to the present model
on capital taxation include Saez and Stantcheva (2018) and Piketty and Saez (2012, 2013).
Note, the present model is largely irrelevant to several issues raised in the capital taxation
literature. These include the treatment of contributions to retirement benefit funds (Diamond,
1999), and the role of capital taxation in alleviating the uncertainties of life (Conesa et al., 2008;
Golosov et al., 2006; Krueger and Ludwig, 2018).
Apart from the Atkinson-Stiglitz theorem and optimal capital taxation, there is little else
in the optimal taxation literature that is closely related to the subject matter of this paper.
A more relevant strand in the literature is what could be described as policy-related
empirical studies on the tax mix. These studies examine empirical observations on the direct-
indirect tax mix combined with economic reasoning and econometric estimates, particularly on
the relationship between the tax mix and economic growth. These reports include Fitoussi
(2005), Martinez-Vazquez et. al (2009, pp. 1-27), Arnold et al. (2011) and European
Commission (2013, pp. 42-51). Generally, these policy-related reports are in favour of indirect
taxation, advocating greater reliance on commodity and property taxation at the expense of
direct taxes. IMF (2014, pp. 36-43) also deals with this subject, but sees a limited role for
indirect taxation in the advisable tax mix, because of its regressivity.
The main arguments advanced against income taxation (both labour and capital) are:
• The labour and saving disincentives related to progressive income taxation, which are absent with indirect taxation.
• The lower evasion propensities of indirect taxes on goods and services produced and/or marketed through large organisations.
• Simpler administration and lower administrative and compliance costs of indirect taxes compared with direct taxes.
-
5
One may add to these several other factors in favour of indirect taxation, such as the
need for higher taxes to offset negative externalities, more scope for taxing/subsidising leisure
complements and substitutes, as well as more scope for accommodating paternalistic concerns.
None of these factors appear in the present model. Effectively, the argument in favour of
indirect taxation presented here is separate and additional to arguments that appeared elsewhere
in the literature. Generally, the above-mentioned empirical studies are in line with the main
conclusion of this paper, but unlike the present study, they are not based on comprehensive
mathematical modelling in the optimal taxation tradition.
The modified inverse elasticity rules
This section concentrates mainly on capital taxation, but labour and commodity taxation
will be covered as well. The present optimal capital and labour income taxation model is in
some respects a static version of the linear steady-state model developed by Saez and
Stantcheva (2018).
Regarding capital taxation some definitions and explanations are needed. Suppose we
have an international panel data spanning more than 20 years, which links capital income per
person to the average capital income tax rate. From that longitudinal and cross-country
database, it is possible to obtain through linear regression a parameter estimate for the derivative
of real (inflation adjusted) capital income per person (kr) over the average capital income tax
rate (𝜏𝑘), as well as its derivative with respect to real per capita income (m). To the best of my kmowledge, no extensive cross-country research has been carried out so far to estimate the tax
elasticity of capital, although the required data are available in many countries. Such data are
also available for more disaggregated estimates of the tax elasticity of capital supply, according
to type of investment and source of investment. For example: domestic savings, capital inflow
and outflow, foreign direct investment (FDI), investment in machinery and equipment,
bequests, corporate income, etc.
Denoting total capital as K, the following discussion will use 𝜕𝐾
𝜕(1−𝜏𝐾) instead of
𝜕𝐾
𝜏𝐾 ,
in order to produce comparable expressions to Saez and Stantcheva (2018). Assuming that the
central parameter estimates in the model are based on observations over a long period of time,
the static model may be regarded to represent a long-run model. The tax elasticity of capital
supply is defined in respect to the after-tax return to capital as:
𝑒𝐾 = (1−𝜏𝐾 )
𝐾𝑟
𝜕𝐾𝑟
𝜕(1−𝜏𝐾) (6)
Saez and Stantcheva (2018) used three estimates for 𝑒𝐾 in their numerical work. These are 0.25, 0.5 and 1. Perhaps 𝑒𝐾 = 1 is the most realistic, bearing in mind the strong international mobility of capital. In any case, in the present computational work all three estimates are used.
In equation (1a), ∑ 𝜏𝐾𝑖 𝑘𝑖𝑟 represents “real” capital tax revenue and not nominal revenue. The difference between the two is the dead-weight loss associated with raising the
capital tax rate from 0 to 𝜏𝐾. From the theory of excess burden (Hoy et al (2011), given a linear
supply curve, this amounts to the triangle 𝑟𝜕𝐾
𝜕𝜏𝐾 𝜏𝐾
2
2 . Hence, in nominal terms equation eq. (1a)
should appear as:
∑ (𝜏𝐾𝑖 𝑘𝑖𝑟 −𝑟𝜕𝐾
𝜕𝜏𝐾 𝜏𝐾
2
2 + 𝜏𝐿𝑤𝑖𝐿𝑖 + 𝜏𝐶 ∑ 𝑝𝑗𝑞𝑗𝑖 𝑗 ) ≥ 𝐻𝑏 + 𝐺 (1b)
-
6
It is applied to reduce “real” capital income in the numerical simulations. It is also used to work
out the inverse elasticity rule for capital in the Online Supplemental Appendix.
The specifications described above yield an approximate formula for the optimal
capital income tax rate. This formula can be expanded also to optimal linear labour income tax.
The approximation to the optimal linear capital and labour income tax rates follows the same
procedure described in respect to approximate optimal indirect tax rates in Revesz (2014, pp.
47-51).
The solution for the optimal capital tax formula is based on deriving the welfare function
in (1) in respect to 𝜏𝐾 and then equalizing the sum of resulting components to zero. There are three welfare effects to consider following a change in 𝜏𝐾: (i) the income effect on capital owners, (ii) the effect on social utility of variations in tax revenue and consequently in the
uniform lump-sum grant (b), (iii) the variations in dead-weight costs that arise due to tax related
distortions in the allocation of resources (based on definition (1b)).
Details of the derivations for the approximately optimal capital tax rate are presented in
the Online Supplemental Appendix. The end result, titled the modified inverse elasticity rule of
capital taxation (from A.15 in the Appendix) is:
𝜏𝐾
(1−𝜏𝐾)=
− �̅�𝑘�̅�𝐻
+ �̅�𝑏�̅�𝐻
( 1 − 𝜏𝐶𝜕𝑐
𝜕𝑦 − 𝜏𝐿
𝑤𝜕𝐿
𝜕𝑦)
𝑒𝐾{1
(1− 𝜏𝐾) −
𝜏𝐾2(1− 𝜏𝐾)
(1+ 𝑒𝐾)} (7)
The average social marginal utility of income of capital income recipients is defined as �̅�𝑘, the average for demogrant recipients is �̅�𝑏 , while the average for the entire population is �̅�𝐻. Since
everybody is receiving the demogrant �̅�𝑏 = ∑ 𝑔𝑖𝑖
𝐻⁄ , while �̅�𝐻 is the income-weighted
average: �̅�𝐻 = ∑ 𝑔𝑖𝑖 𝑚𝑖
𝑀⁄
The 𝑤𝜕𝐿𝑖
𝜕𝑦𝑖 term represents the change in labour income in response to variation in lump-
sum income. 𝜕𝑚𝑖
𝜕𝑦𝑖= 𝑤
𝜕𝐿𝑖
𝜕𝑦𝑖 is known in the labour supply literature as the income effect (see
Hum and Simpson (1994, pp. 72-74)). Its estimated value usually ranges between -0.1 to -0.4.
It is not difficult to see that the denominator on the right-hand side is usually close to
𝑒𝐾. When 𝑒𝐾 = 1 then the denominator will be 𝑒𝐾. When 𝑒𝐾 = 2 then the denominator will
be 𝑒𝐾{1 - 𝜏𝐾
2(1− 𝜏𝐾) }. When 𝑒𝐾 = 0.5 then the denominator will be: 𝑒𝐾{1 +
𝜏𝐾
4(1− 𝜏𝐾) }.
Using the methodology described in the Online Supplemental Appendix, it is possible
to generalise the inverse elasticity rule of capital taxation to linear labour income tax as well.
In this case the (compensated) elasticity of labour supply with respect to the net after-tax
return, 𝑤(1 − 𝜏𝐿), is defined as:
𝑒𝐿(u0) = (1−𝜏𝐿 )𝑤
𝑤𝐿
𝜕𝐿(u0)
𝜕(1−𝜏𝐿) . (8)
-
7
Econometric estimates of eL(u0) usually range between 0.15 and 0.5 (Hum and Simpson, 1994,
pp. 72-74). Higher estimates can be obtained when considering not only changes in working
hours, but also long-term responses related to occupational choice and human capital
accumulation. In the present numerical work, the mean value of eL(u0) is given between 0.40
and 0.48, depending on the scenario.
The average social marginal utility of wage earners is defined as: 𝑔𝐿 =∑ 𝑤𝑖𝐿𝑖𝑔𝑖𝑖
∑ 𝑤𝑖𝐿𝑖𝑖. The
marginal capital-income ratio is 𝜕𝐾
𝜕𝑚 , where total income m = Kr + wL. In this paper
𝜕𝐾
𝜕𝑚 =
𝜕𝐾
𝜕𝑦
defines the net change in capital stock related to a change in static income, after taking away
depreciation. Based on empirical evidence, it is around 3. Using these new variables, and
following the same steps outlined in the Online Supplemental Appendix, will lead to the
following modified inverse elasticity rule for the optimal linear labour income tax:
τL
(1-τL)=
- g̅Lg̅H
+ g̅bg̅H
( 1 - τC∂c
∂y - 𝜏𝐾𝑟
𝜕𝐾
𝜕𝑚)
𝑒𝐿(𝑢0){1
(1− 𝜏𝐿) −
𝜏𝐿
2(1− 𝜏𝐿)(1+ 𝑒𝐿(𝑢0))}
(9)
Deriving the modified inverse elasticity rule in respect to commodity taxation follows
the methodology described in Revesz (2014, pp. 47-51). Eq. (9) there gives the predicted linear
tax rate for a single composite good as:
𝜏𝐶 ≈ 1 + 𝜏𝐶 −
g̅bg̅C
(1 + 𝜏𝐶 + 𝜀�̅�𝐿𝜏𝐶)
𝜀�̅�(𝑢0) (10)
Here g̅C
represents the weighted average social marginal utility of consumers and g̅b that of
demogrant recipients. �̅�𝑐𝐿 =𝜕𝐿
𝜕(1+𝜏𝐶)
(1+𝜏𝐶)
𝐿 is the elasticity of labour supply over the cost of
consumption. Taking zero homogeneous labour supply, where b is an exogeneous variable
the following condition must be satisfied:
𝜕𝐿
𝜕(1+𝜏𝐶)𝑐 +
𝜕𝐿
𝜕(1+𝜏𝐿) 𝐿 +
𝜕𝐿
𝜕(1+𝜏𝐾) 𝐾 +
𝜕𝐿
𝜕𝑦 𝑏 = 0 (11a)
Assuming that the last two terms are relatively small, leads to the following approximation:
𝜀�̅�𝐿 = −𝜖�̅�(1+𝜏𝐶)
(1+𝜏𝐿) , (11b)
where 𝜖𝐿 = 𝜕𝐿
𝜕(1+𝜏𝐿)
(1+𝜏𝐿)
𝐿 is the uncompensated elasticity of labour supply. This approximation
opens the way for estimating the unknown 𝜀�̅�𝐿 using estimates for ∈̅𝐿 . According to empirical estimates (Hum and Simpson, 1994, pp. 72-74), ∈̅𝐿 lies in the range between +0.4 and -0.15, hence 𝜀�̅�𝐿 will be oppositely signed in a similar range. The relatively small value of 𝜀�̅�𝐿 will be of considerable importance when explaining the dominance of indirect taxation.
When τL and τK are also included in the model, they should be added to the effect of
change in labour supply on tax revenue. Consequently, in the present model a better
approximation to (10) will be:
𝜏𝐶 ≈ 1+𝜏𝐶 −
g̅b
g̅H(1+ 𝜏𝐶+ �̅�𝐶𝐿 (𝜏𝐶 + 𝜏𝐿𝑊
𝜕𝐿
𝜕𝑦 + 𝜏𝐾𝑟
𝜕𝐾
𝜕𝑦)
�̅�𝑐(𝑢0) (12)
This is the predictive formula for 𝜏𝐶 used in the numerical analysis.
-
8
Finally, a few words about the �̅�𝐿(𝑢0) and 𝜀�̅�(𝑢0) terms appearing in the denominators of (9) and (12). According to demand theory, the dead-weight loss of taxation is represented by
the triangle defined by the difference between the post-tax and pre-tax price multiplied by the
change in the quantity demanded. In its turn, the change in quantity is given by the change in
price multiplied by the derivative of compensated demand (not ordinary demand) (see Hines,
1998). In symbolic terms the dead-weight loss (D) following an increase in the labour tax from
zero to 𝜏𝐿 will be:
𝐷(𝐿) = 𝜏𝐿
2(
𝜕𝑊𝐿
𝜕𝜏𝐿(𝑢0)𝜏𝐿) =
𝜏𝐿2
2
𝜕𝑊𝐿
𝜕𝜏𝐿(𝑢0) ≈ −
𝜏𝐿2
2𝑒𝐿(𝑢0)
𝑊𝐿
(1−𝜏𝐿) (13)
Similarly, for consumption tax: 𝐷(𝐶) ≈ − 𝜏𝐶
2
2𝑒𝐶(𝑢0)
𝐶
(1−𝜏𝑐) (14)
where 𝑢0 designates compensated labour supply and compensated commodity demand respectively. In the case of capital supply, the dead-weight loss is defined not by a utility
function, but by constraint (1b). In this case the dead-weight loss is defined as: −𝑟𝜕𝐾
𝜕𝜏𝐾 𝜏𝐾
2
2 , and
there was no need to apply a compensated elasticity approach.
The framework and specifications of the numerical model
There are several utility functions, including the LES, CES, AIDS, NLPS and direct and
indirect addilog, which define explicitly not only utility, but also the corresponding demand
and labour supply functions (see Thomas, 1987) for a given set of post-tax prices and incomes.
In the present computational work Linear Expenditure System (LES) utility-demand
was used. The reason for this choice was that already much earlier, we developed a numerical
tax model dealing with the question whether indirect taxation should be uniform or otherwise
(see Revesz 1997, 2014). The original idea was to extend this model to include also the
optimisation of labour and capital income tax.
Later, it was discovered that the original optimisation method, based on “gradient”
iterations, was not well-suited to this task, hence we turned to the three-tax “grid” search
optimisation method. Nonetheless, the basic model has been retained, including the LES utility
function, 15 taxpayers and 9 or 18 commodities.
LES appears in many publications. For one, see Thomas (1987, pp. 71-74). The present
version uses instead of income (m) the earning parameters w and y, as well as total time (Z)
and leisure (L). The LES utility function is defined as:
𝑢 = ∑ 𝛽𝑖 log (𝑞𝑖𝑖 − 𝛼𝑖) (15)
where qi represents the consumption of good i. 𝛼𝑖 and 𝛽𝑖 are function parameters. The corresponding demand functions for commodities are:
𝑞𝑖 = 𝛼𝑖 + 𝛽𝑖
𝑝𝑖 (𝑍𝑤 + 𝑦 − ∑ 𝑝𝑗 𝛼𝑗 𝑗 ) (16)
In the present model w = W(1 - 𝜏𝐿) and y = kr(1 - 𝜏𝐾) + b
Labour supply (L) is given as:
𝐿 = 𝑍 − 𝑞𝐿 = 𝑍(1 − 𝛽𝐿 ) − 𝛼𝐿 − 𝛽𝐿
𝑤(𝑦 − ∑ 𝑝𝑗𝑗 𝛼𝑗 ) ` (17)
-
9
where 𝑞𝐿 stands for leisure. Z is the upper limit of time available to work, therefore 𝑞𝐿 = 𝑍 − 𝐿
In the computational model it is assumed that constant elasticity capital supply from (3)
is given in the form: 𝑟𝐾 = 𝑟(𝐾0 +𝜕𝐾
𝜕𝑚(𝑚 𝑃⁄ − 𝑚0) )(1 − 𝜏𝐾)
𝑒𝐾 (18)
where 𝜕𝐾
𝜕𝑚 is the change in net capital (after depreciation) and m/P is price-indexed total income,
given as: m = (wL + rK)/P. From (18) we can obtain a more precise definition of dead-weight
loss than in (1b). The change in dead-weight costs when moving from 0 to 𝜏𝐾 will be then:
∆𝐷(𝐾) = − 𝑟𝑒𝐾(𝐾0 +𝜕𝐾
𝜕𝑚(𝑚 𝑃⁄ − 𝑚0))
(1− 𝜏𝐾)(𝑒𝐾−1)𝜏𝐾
2
2 (19)
The numerical model that produced the results presented in Table 1 has the following
features: Initially an ordinary LES function was used, covering 9 commodities and labour. Later
a two-part LES covering 18 goods was introduced. There are 15 taxpayers in the model. The
LES utility function satisfies quasi-concavity. Combined with a convex budget constraint that
ensures a unique global optimum. The inequality aversion rate was set to 0.7, in order to obtain
an overall tax burden in the range between 40% and 50%, as observed in several advanced
economies. The demand parameter estimates for the 9 goods model were taken from Deaton
and Muellbauer (1980, p 71). The labour and capital income distributions are blended from
several empirical distributions. Usually, expenditure on public goods is fixed, set to 10% of
initial tax-free total income. The computer program related to this paper (Revesz (2019))
displays in the DATA lines all the parameters and income values adopted in the numerical
work.
The search for the optimum was carried out using a “grid” search method. This method
involves checking all tax combinations in order to find the optimum. In the present model
discrete tax rates were separated by intervals of 0.01. With indirect taxation the calculations
were carried out from 0.1 to 1.1, implying 100 discrete taxes. With labour and capital taxation
the range was from 0.1 to 0.7, implying 60 discrete taxes. Altogether, 100*60*60 = 360,000
combinations were examined. At each round, the program compared the utility level of the
present combination with the highest utility recorded earlier. If the utility of the present
combination turned out to be higher than the earlier one, then it replaced the top utility
combination. Otherwise, no change was made.
The model was restricted to only three taxes, because this way the program could be run
in less than an hour on a personal computer. A four-tax model (say with two indirect tax rates,
one for luxuries and one for necessities, or with two income tax brackets) could run on more
powerful computers. Beyond that, the number of combinations will run into billions. It is
possible that less computation-intensive methods could be used to locate the global optimum,
but for the time being this subject has not been taken up.
The specifications in Table 1 are restricted to the choice of taxes, the capital supply
elasticity and the percentage of public goods from total income. These choices can be made
through prompts in the Revesz (2019) program. Other choices can be made in the program by
changing DATA lines, as explained in the user’s guide. Overall, the percentage share results
indicate that indirect taxes dominate the optimal tax mix, usually accounting for over 80% of
total tax revenue.
-
10
Table1
Numerical results with ordinary LES
No. of
taxes
Elasticity
of capital
% public
goods
Indirect
tax
Labour
tax
Capital
tax
change
welfare
2 - 0 Actual 0.61 0.1
Predicted 0.55 -
% share 86 14
Compared with 40% uniform tax % change 10 10 2
2 - 10 Actual 0.68 0.1
Predicted 0.61 -
% share 86 14
Compared with 40% uniform tax % change 8 8 2
2 1 10 Actual 0.42 0.10
Predicted 0.59 -
% Share 91 9
Compared with 40% uniform tax % change 9 -7 59 5
3 1 10 Actual 0.60 0.10 0.10
Predicted 0.61 - -
% Share 84 11 5
Compared with 40% uniform tax % change 22 4 69 14
3 0.25 10 Actual 0.59 0.10 0.12
Predicted 0.60 - 0.17
% Share 83 11 6
Compared with 40% uniform tax % change 15 10 21 6
3 0.25 0 Actual 0.48 0.10 0.12
Predicted 0.51 - 0.20
% Share 82 11 7
Compared with 40% uniform tax % change 17 14 22 6
Source: Own calculations using the program in Revesz (2019).
Predictions are only presented for cases where the actual tax rate is an interior point,
located above the 0.1 lower boundary line. For optimum values located at the boundaries, it is
not possible to apply predictive formulas, because these only apply at the interior. In all cases
the indirect tax rate is an interior point, but only in a few cases does this apply to labour or
capital tax.
The numbers in the fourth row require further explanations. The first three numbers
relate to quantities corresponding to tax rates, e.g. with indirect tax the corresponding quantity
will be total consumption. Each quantity in the optimal solution is compared with the
corresponding quantity obtained when all taxes are set to 40%. This percentage was chosen for
comparison because it is close the average tax burden in most scenarios. The purpose of the
exercise is to provide the reader a comparative view about the changes involved in moving from
a uniform solution to the highly differentiated optimal solution.
The change in welfare is more complicated. It is intended to reflect the change in utility
in monetary terms. It is based on dividing the change in utility by the Lagrangian using the
following formula: ∆𝑈∗ = 𝑈𝑜𝑝𝑡− 𝑈40%
𝜆𝑀𝑜𝑝𝑡 (20)
where 𝜆 = 𝑈(𝜏, 𝐺 + 1) − 𝑈(𝜏, 𝐺) and 𝑀𝑜𝑝𝑡 is total income in the optimal situation.
-
11
Equation (20) provides some tentative indication about the proportional change in welfare in
monetary terms.
The first scenario in Table 1 is a two-tax model involving only indirect and labour
taxation and no expenditure on public goods. The second scenario is the same, but with 10%
expenditure on public goods included. The results from both scenarios illustrate the clear
dominance of indirect taxation in the optimal tax mix.
The third scenario shows a situation where indirect taxation is missing. In this case
optimal labour income tax is much higher than the capital tax rate, which does not even move
above the boundary line, i.e. 0.1. This is in line with the inverse elasticity rule. In this model
the compensated elasticity of labour supply is 0.4, whereas that of capital is 1. Therefore, a
much higher optimal tax rate on labour than on capital is to be expected.
It should be noted that the optimums do not represent sharp peaks. They are top solutions
among many other solutions with similar high utilities. These high-utility solutions sometimes
involve tax combinations that are markedly different from the combination at the optimum, but
to a lesser extent, even in these cases the dominance of indirect taxation is maintained.
Notice that predicted tax rates are usually not far away from the actuals. This fairly close
concordance between actual and predicted tax rates, will help to explain later the puzzling
dominance of indirect taxation in the optimal tax mix. The next three scenarios, covering three-
tax models, further reinforce the conjecture about the dominance of indirect taxation.
Table 2
Numerical results with two-part LES
No of
taxes
Elasticity
of capital
% public
goods
Indirect
tax
Labour
tax
Capital
tax
Change
welfare
2 0 Actual 0.64, 1.28 0.10
Predicted 0.74 -
% Share 87 13
Compared with 40% uniform tax % change 5 6 2
2 1 10 Actual - 0.48 0.10
Predicted - 0.60 -
% Share - 92 8
Compared with 40% uniform tax % change 4 -11 54 3
3 1 0 Actual 0.55, 1.10 0.10 0.11
Predicted 0.0.81 -
% Share 84 11 5
Compared with 40% uniform tax % change 18 1 66 13
Source: Own calculations using the program in Revesz (2019).
So far, the discussion was concerned with ordinary LES covering 9 goods. In earlier
work (see Revesz, 2014, p. 18; 2018, p. 39) LES was divided into two functions with different
parameters, one for low wage earners the other for high wage earners. The two-part LES models
cover 18 goods, 9 for necessities and 9 for luxuries. This arrangement was needed in order to
open the way to non-linear Engel curves with LES. Notice that because of the introduction of
9 additional goods, the utility function of two-part LES is not the same as that of ordinary LES.
-
12
Some simulations were done with two-part LES. In was assumed in these simulations
that the tax rate on the nine luxuries is twice the tax rate on the nine necessities. This way, still
only one consumption tax had to be optimised, the second one was just consequential. As shown
in Table 2, the pattern of results with two-part LES turned out to be very similar to ordinary
LES. The introduction of the luxury tax made indirect taxation even more attractive, hence the
dominance of indirect taxation has increased slightly.
An explanation for the dominance of indirect taxation
In all models that included indirect taxation, it emerged as the dominant source of
revenue. This section will try to explain this phenomenon using the modified inverse elasticity
rules, described in eqs. (7), (9) and (12).
As noted earlier, there is reasonably good agreement between predicted values and
actual values, hence the inverse elasticity rules may help to gain a better understanding of the
mechanics driving the puzzling outcomes. These explanations will look not only at variables,
but also at some of the estimated numbers behind them.
Looking at the modified inverse elasticity rules in (7), (9) and (12) perhaps the most
striking feature is the large difference in the structure of predictive equation for 𝜏𝐶 in (12)
compared with the predictive equations for 𝜏𝐾 in (7) and for 𝜏𝐿 in (9). Not surprisingly, this difference will provide a major clue for the explanation.
In eqs. (7) and (9) the 𝜏𝐶 term is multiplied by 𝜕𝑐
𝜕𝑦. From (4):
𝜕𝑐
𝜕𝑦 = ∑ 𝑝𝑗
𝜕𝑞𝑗
𝜕𝑦𝑗
= 1 + 𝑤𝜕𝐿
𝜕𝑦+ 𝑟
𝜕𝐾
𝜕𝑦
The “income term”, 𝑤𝜕𝐿
𝜕𝑦 , in the program stays constant at -0.24. The interest rate, r, stays
constant at 0.07 and the capital-income ratio 𝜕𝐾
𝜕𝑦 is given as 3. Thus, in numerical terms
𝜕𝑐
𝜕𝑦 = 1
- 0.24 + 0.21 = 0.97. Hence, in the capital and labour formulas negative 𝜏𝐶 is multiplied by nearly 1. Assuming a 20% tax on labour and capital, the other terms inside the brackets are
small: 𝜏𝐿𝑤𝜕𝐿
𝜕𝑦= 0.2 ∗ (−0.24) = −0.048 and 𝜏𝐾𝑟
𝜕𝐾
𝜕𝑚 = 0.2*0.07*3 = 0.042
The situation is markedly different in the consumption tax equation (12). Here negative
𝜏𝐶 is multiplied by the labour cross-elasticity term, 𝜀�̅�𝐿, which is close to the negative of ∈̅𝐿 , according to (11b). In the present model 𝜀�̅�𝐿 ranges between -0.1 and -0.3, depending on the scenario. Hence, there is a situation where 𝜏𝐶 is multiplied by nearly 1 in the labour and capital tax equations, and by a negative number close to zero in the consumption tax equation.
That means that in the labour and capital equations 𝜏𝐶 has a strong negative effect in reducing optimal tax rates, whereas in the consumption tax equation 𝜏𝐶 has a small positive effect. This appears to be the main reason for high optimal tax rates on consumption and low tax rates on
capital and labour.
-
13
This conjecture is supported by another observation that contradicts a narrow
interpretation of the modified inverse elasticity rules. The compensated consumption elasticity
𝜀�̅�(𝑢0) lies in the range -0.8 to -0.85, depending on the scenario. By contrast, the compensated elasticity of labour 𝜀�̅�(𝑢0) lies in the range of 0.4 to 0.48. From a straight interpretation of the inverse elasticity rules, that implies that consumption should be taxed at a lower rate than labour
income. But in practice exactly the opposite happens, apparently because of the different roles
of 𝜏𝐶 in these predictive equations. This explanation implies that in tax mix models, optimal tax rates might be substantially different from those in non-mix models.
Some policy implications
At this stage the question arises: how do tax mixes look like in the real world? According
to statistics presented by Martinez-Vazquez et al. (2009, p. 5), in 2005 the average worldwide
direct-indirect tax ratio stood at around 0.95. However, there are large variations between
countries. The average for developed countries was around 2.05 while for developing countries
it was 0.7. Developing countries rely much less on direct taxation, because they lack adequate
accounting capabilities. Also, social security contributions, which are classified as direct tax,
are much less important in the developing world.
It appears that little change has occurred in recent years in Organisation for Economic
Cooperation and Development (OECD) countries. OECD (2019, Figure 3.20) reports that the
average standard VAT rate in OECD countries has increased from 18% in 2000 to 19.3% in
2019.
Fitoussi (2005) nominates two main reasons against shifting from income tax to
consumption tax. First such a move may increase, at least temporarily, inflationary pressures
and related disequilibrium problems. Second, such a move is likely to reduce tax progressivity.
The first point seems realistic, even if the tax reform is designed to be revenue neutral. The
second point is more debatable. Even if the current indirect taxation system is regressive, a
reform could be highly progressive, particularly if it involves taxing at higher rates luxury goods
and imposing higher property rates on luxury housing. Such a reform appears administratively
feasible in the computer age, though its political acceptability may be in doubt. All in all, apart
from possible inflationary pressures and adjustment problems, there seems to be no strong
economic reason against replacing part of income tax by progressive indirect taxation.
Summary and conclusions
This paper presented analytical and computational results in relation to a static three-
tax model of labour, capital and indirect taxation. Perhaps, the most significant finding in this
paper, is the dominant role of indirect taxation in the optimal tax mix. It is possible that this
result was partly due to some unique characteristics of the LES utility function.
Although the numerical findings are also supported by some analytical results, there is
a need for further research, to test the validity of the conjecture about the dominance of indirect
taxation in other models, under different utility function and capital supply specifications. The
explicit utility-demand functions that could be tested apart from LES include: CES, AIDS,
NLPS and direct and indirect addilog.
-
14
Another interesting topic in future numerical research could be, the estimated welfare
losses involved in adopting a “politically acceptable” solution, rather than the mathematically
optimal solution. The presence of many sub-optimal solutions with a similar utility level to the
optimum, suggests that it may be unwise to concentrate exclusively on the mathematical
optimum.
Some attention was given to approximate formulas for optimal linear capital and labour
income taxes, titled the modified inverse elasticity rule. These formulas were used to examine
the optimal linear taxation of capital and labour. They were helpful in explaining the puzzling
numerical results on the dominance of indirect taxation.
The incorporation of capital taxation does not appear to make much difference in the
results. Even in a two-tax model without capital taxation, there is a strong predilection in favour
of indirect taxation. But in this context, let us not forget that low capital taxation is politically
acceptable only in societies that are not strongly averse to inherited wealth or some other socio-
political aspects of high capital concentration, as implicitly assumed in the present model.
It is interesting to note that the modified inverse elasticity rule is an important element
of optimal taxation not only in respect to capital and labour, as indicated earlier, but also in the
field commodity taxation. Originally, the inverse elasticity rule was proposed by Ramsey
(1927) as an efficient method to generate a given amount of tax revenue through commodity
taxation. Apparently, the modified version of it, which takes into account also equity
considerations and the presence of other taxes, is applicable to all major forms of taxation.
However, as indicated in the explanation about the dominance of indirect taxation, the inverse
elasticity rule is not always the most important factor in determining optimal taxes. In a tax mix
model, the size and configuration of other taxes is sometimes more important.
So far, comprehensive numerical modelling of the tax mix has been missing from the
optimal taxation literature. Hopefully, the tantalising results from the present model, will
stimulate further analytical and computational research.
References
Arnold, J. M., Brys, B., Heady, C., Johansson, A., Schwellnus, C. and Vartia, L. (2011), “Tax
policy for economic recovery and growth”, The Economic Journal, 121 (550): F59-F80.
Atkinson, A. B. and J. E. Stiglitz (1976), “The design of tax structure: direct versus indirect
taxation”, Journal of Public Economics, 6: 55-75.
Bastani, S. and D. Waldenström (2018), How Should Capital be Taxed? Theory and Evidence
from Sweden, CESifo Working Paper No. 7004,
https://cepr.org/sites/default/files/news/FreeDP_Jan31.pdf
Conesa, J., Sagiri K. and D. Krueger (2009), ̋ Taxing capital? Not a bad idea after all˝, American
Economic Review, 99(1): 25-48.
Deaton, A. and Muellbauer, J. (1980), Economics and Consumer Behavior, Cambridge
University Press, New-York
Diamond, P. E. (1999), Taxation, Incomplete Markets, and Social Security, Munich Lectures
in Economics, The MIT Press, Cambridge, MA.
https://cepr.org/sites/default/files/news/FreeDP_Jan31.pdf
-
15
European Commission (2013) “Tax reforms in EU member states 2013”, European Economy
https://ec.europa.eu/economy_finance/publications/european_economy/2013/ee5_en.h
tm
Fitoussi, J.P. (2005), Shifting The Tax Burden: From Direct To Indirect Tax, HAL Archives-
Ouverters, France, https://hal-sciencespo.archives-ouvertes.fr/hal-00972763/document
Golosov, M. A., Tsyvinski, A. and I. Werning (2006), ˝New dynamic public finance: a user´s
guide˝, in NBER Macroeconomics Annual 2006, vol. 21: 317-388.
Hines, J. R. (1998), Three Sides of Harberger Triangles, NBER Working Paper 6852,
Cambridge, MA.
Hoy, M., Livernois, J., McKenna, C., Rees, R. and T. Stengos (2011), Mathematics for
Economics, The MIT Press, Cambridge, MA.
Hum D. and W. Simpson (1994), ˝Labour supply estimation and public policy˝, Journal of
Economic Surveys, 8(1): 59-81.
International Monetary Fund (IMF) (2014), Fiscal Policy and Income Inequality, IMF Policy
Paper 23. Washington DC.
https://www.imf.org/external/np/pp/eng/2014/012314.pdf
Krueger, D. and A. Ludwig (2018), Optimal Taxes on Capital in the OLG Model with
Uninsurable Idiosyncratic Income Risks, NBER Working Paper No. 24335, Cambridge,
MA
Martinez-Vazquez, J., Vulovic, V. and Y. Liu (2009), Direct versus Indirect Taxation: Trends,
Theory and Economic Significance, Andrew Young School of Policy Studies, Georgia
State University, https://ideas.repec.org/p/ays/ispwps/paper1014.html
Mirrlees, J. A. (1976), “Optimal tax theory: a synthesis”, Journal of Public Economics, 7: 327-
358.
Organisation for Economic Co-operation and Development (OECD) (2019), Tax Policy
Reforms 2019, www.oecd.org/tax/tax-policy-reforms-26173433.htm
Piketty, T. and E. Saez (2012), The Theory of Optimal Capital Taxation, NBER Working Paper
17989, Cambridge, MA.
Piketty, T. and E. Saez (2013), ˝The theory of optimal inheritance taxation˝, Econometrica,
82(4): 1241-1272.
Ramsey F. P. (1927), ˝A contribution to the theory of taxation˝, Economic Journal, 37(145):
47- 61.
Revesz, J. T. (1997) “Uniform versus non-uniform indirect taxation – some numerical results”,
Public Finance/Finances Publiques, 52: 210-234.
Revesz, J. T. (2014), “A numerical model of optimal differentiated indirect taxation”, Hacienda
Pública Española/Review of Public Economics, 211– (4/2014): 9–66.
Revesz, J. T. (2018), Some contributions to the theory of optimal indirect taxation, Editorial
Academica Espanola, Spain. Also available from Amazon.com
Revesz, J. T. (2019), ‘grid1’ - A Program to Find the Optimum in a Tax Mix Model, This
computer program and associated user’s guide are located in the website:
http://www.john1revesz.com
https://ec.europa.eu/economy_finance/publications/european_economy/2013/ee5_en.htmhttps://ec.europa.eu/economy_finance/publications/european_economy/2013/ee5_en.htmhttps://hal-sciencespo.archives-ouvertes.fr/hal-00972763/documenthttps://www.imf.org/external/np/pp/eng/2014/012314.pdfhttps://ideas.repec.org/p/ays/ispwps/paper1014.htmlhttp://www.oecd.org/tax/tax-policy-reforms-26173433.htmhttp://www.john1revesz.com/
-
16
Saez, E. and S. Stantcheva (2018), ˝A simpler theory of optimal capital taxation˝, Journal of
Public Economics, 162, pp. 120-142. Available online:
https://doi.org/10.1016/j.jpubeco.2017.10.004
Thomas, R. L. (1987), Applied Demand Analysis, UK., Longman Group, London, United
Kingdom.
Tuomala, M. (2016), Optimal Redistributive Taxation, Oxford University Press, Oxford,
United Kingdom.
Appendix
Proof for the optimal capital income tax formula The solution for the optimal capital tax formula is based on deriving the welfare function
in (1) in respect to 𝜏𝐾 and then equalizing the sum total of resulting components to zero. There are three welfare effects to consider following a change in 𝜏𝐾: (i) the income effect on capital owners, (ii) the effect on social utility of variations in the uniform lump-sum grant due to
changes in total tax revenue, (iii) the variations in dead-weight costs that arise due to tax related
distortions in the allocation of resources. The formula developed using this approach will be
examined here in more detail.
First comes the utility effect on capital owners. Assuming 𝜏𝐾 is increased by a small amount ∆𝜏𝐾. Then the change in welfare will be:
− ∑𝜕𝑢𝑖
𝜕𝑦𝑖𝑖 𝑘𝑖𝑟∆𝜏𝐾 = − ∑
𝜕𝑢𝑖
𝜕𝑦𝑖𝑖
𝑘𝑖 ∑ 𝑘𝑖𝑖 𝑟
∑ 𝑘𝑖𝑖 ∆𝜏𝐾 = − ∑ �̅�𝐾𝐾𝑖 𝑟∆𝜏𝐾 (A.1)
where �̅�𝐾 is the capital income weighted average social marginal utility of income of capital owners.
The change in welfare due to a variation in tax revenue is much more complicated than
that. Assuming fixed public goods expenditure requirements (G), all the tax generated above G
will be redistributed, by providing a uniform lump-sum grant (b) to everyone (Hb). The average
social marginal utility of income of this uniform grant will be: ∑𝑔𝑖
𝐻𝑖= �̅�𝑏, (A.2)
This is the unweighted average of the social marginal utilities of everyone.
Total tax revenue is given as:
𝑅 = ∑ (𝜏𝐾𝑘𝑖𝑟 +𝑖 𝜏𝐿𝑤𝐿𝑖 + 𝜏𝐶𝑐𝑖) (A.3)
where ci represents total consumption (in producer prices) of taxpayer i. The variation in total
tax revenue following ∆𝜏𝐾 change in the tax rate can be divided into two parts. First the direct
variation in capital income tax revenue: 𝜕𝑅1
𝜕𝜏𝐾∆𝜏𝐾 = ∑ (𝑘𝑖𝑟𝑖 − 𝜏𝐾
𝜕𝑘𝑖
𝜕(1−𝜏𝐾)𝑟)∆𝜏𝐾 (A.4)
Note, 𝜕𝑘𝑖
𝜕𝜏𝐾= −
𝜕𝑘𝑖
𝜕(1−𝜏𝐾)
https://doi.org/10.1016/j.jpubeco.2017.10.004
-
17
Second, the change in the tax rate will cause an oppositely signed change in consumer
income, amounting approximately to − ∑ 𝑘𝑖𝑟∆𝜏𝐾𝑖 . This change in household income will cause a change in consumption tax and labour income tax revenue, as shown in (A.5).
𝜕𝑅2
𝜕𝜏𝐾∆𝜏𝐾 = ∑ 𝑘𝑖𝑟𝑖 (− 𝜏𝐶
𝜕𝑐
𝜕𝑦− 𝜏𝐿
𝑤𝜕𝐿𝑖
𝜕𝑦𝑖)∆𝜏𝐾 (A.5)
Combining the two revenue effects leads to:
𝜕𝑅
𝜕𝜏𝐾∆𝜏𝐾 = 𝐾𝑟(1 − 𝜏𝐶
𝜕𝑐
𝜕𝑦− 𝜏𝐿
𝑤𝜕𝐿
𝜕𝑦)∆𝜏𝐾 − 𝜏𝐾
𝜕𝐾
𝜕(1−𝜏𝐾)𝑟∆𝜏𝐾 (A.6)
𝜕𝑐
𝜕𝑦 is defined from (4).
From definition (6), the derivative 𝜕𝐾𝑟
𝜕(1−𝜏𝐾) = 𝑒𝐾
𝐾𝑟
(1−𝜏𝐾) (A.7)
Replacing the derivative by the elasticity term in (A.6), we obtain:
∂R
∂τK∆τK = Kr (1- τC
∂c
∂y- τL
w∂L
∂y) ∆τK - τK𝑒𝐾
Kr
(1-τK)∆τK (A.8)
Combining (A.2) and (A.8), the change in social welfare due to the change in total tax
revenue will be: ∆𝑅 = �̅�𝑏[Kr (1- τC∂c
∂y- τL
w∂L
∂y) ∆τK - τK𝑒𝐾
Kr
(1-τK)∆τK] (A.9)
The third component of welfare change is related to dead-weight loss (D), defined in
eq. (1b). This is redefined in (A.10).
𝐷(𝐾) = − 𝜏𝐾
2(
𝜕𝐾𝑟
𝜕𝜏𝐾𝜏𝐾) = −
𝜏𝐾2
2
𝜕𝐾𝑟
𝜕𝜏𝐾≈
𝜏𝐾2
2𝑒𝐾
𝐾𝑟
(1−𝜏𝐾) (A.10)
Note, the third expression is only valid if the elasticity of capital supply stays at the
constant rate, 𝑒𝐾, along the domain from zero to 𝜏𝐾. If this condition is not satisfied, one can only talk about an approximation. The approximate change in the social welfare value of dead-
weight costs following ∆𝜏𝐾 variation in the tax rate will be then:
∆𝐷(𝐾) = �̅�𝐻𝜕𝐷
𝜕(1−𝜏𝐾)∆𝜏𝐾 = −
�̅�𝐻
2𝑒𝑘𝑟
𝜕(𝜏𝐾
2 𝐾(1−𝜏𝐾)
⁄ )
𝜕𝜏𝐾∆𝜏𝐾 =
= - �̅�𝐻Kr𝑒𝐾𝜏𝐾
(1− 𝜏𝐾){
1
(1− 𝜏𝐾)−
𝜏𝐾
2(1− 𝜏𝐾)(1 + 𝑒𝐾)} ∆𝜏𝐾 (A.11)
𝑔𝐻 represents the income weighted average social marginal utility of income of the entire population. In the absence of more detailed information, we assume that variations in dead-
weight costs are distributed in proportion to the marginal utilities of incomes of consumers.
We can now combine the three welfare variations from (A.1), (A.9) and (A.11) to yield
𝜕𝑈
𝜕𝜏𝐾∆𝜏𝐾 = −�̅�𝑘𝐾𝑟∆𝜏𝐾 + �̅�𝑏 [Kr
1
(1− 𝜏𝐾)( 1 – τC
∂c
∂y – τL
w∂L
∂y) ∆τK] −
− �̅�𝐻𝑒𝐾𝐾𝑟𝜏𝐾
(1− 𝜏𝐾){
1
(1− 𝜏𝐾)−
𝜏𝐾
2(1− 𝜏𝐾)(1 + 𝑒𝐾)} ∆𝜏𝐾 (A.12)
In order to be at the optimum, the terms on the right-hand side must add up to zero.
Eliminating ∆𝜏𝐾 from both sides, dividing all terms by Kr and setting the left-hand side to zero, we have:
0 = −�̅�𝑘 + �̅�𝑏 ( 1 – τC∂c
∂y – τL
w∂L
∂y) − �̅�𝐻𝑒𝐾
𝜏𝐾
(1− 𝜏𝐾){
1
(1− 𝜏𝐾)−
𝜏𝐾
2(1− 𝜏𝐾)(1 + 𝑒𝐾)} (A.13)
-
18
Transferring the last term to the left-hand side and dividing all terms by �̅�𝐻𝑒𝐾, we get:
𝜏𝐾
(1−𝜏𝐾){
1
(1− 𝜏𝐾)−
𝜏𝐾
2(1− 𝜏𝐾)(1 + 𝑒𝐾)} = −
�̅�𝑘
�̅�𝐻𝑒𝐾+
�̅�𝑏
�̅�𝐻𝑒𝐾 ( 1 - τC
∂c
∂y - τL
w∂L
∂y) (A.14)
Or in a more explicit form:
𝜏𝐾
(1−𝜏𝐾)=
− �̅�𝑘�̅�𝐻
+ �̅�𝑏�̅�𝐻
( 1 − 𝜏𝐶𝜕𝑐
𝜕𝑦 − 𝜏𝐿
𝑤𝜕𝐿
𝜕𝑦)
𝑒𝐾{1
(1− 𝜏𝐾) −
𝜏𝐾2(1− 𝜏𝐾)
(1+ 𝑒𝐾)} (A.15)
This is the approximation for the optimal linear capital income tax rate. It will be
referred to it as the modified inverse elasticity rule for capital taxation.
It is of some interest to compare formula (A.15) with the one presented by Saez and
Stantcheva (2018). The steady-state tax formulas presented in their work are derived from the
following dynamic social welfare function for consumer i:
Vi = 𝛿𝑖 ∫ [𝑐𝑖(𝑡) + 𝑎𝑖(𝑘𝑖(𝑡)) − ℎ𝑖(𝑧𝑖(𝑡))]∞
0𝑒−𝑡𝛿𝑖𝑑𝑡 (A.16)
where 𝛿𝑖 is the time discount rate. In the steady-state solution time discounting disappears, and the problem becomes one of maximizing ∑ 𝑔𝑖[𝑐𝑖 + 𝑎𝑖(𝑘𝑖) − ℎ𝑖(𝑧𝑖)]𝑖 subject to the budget constraint 𝑐𝑖 = 𝑟𝑘𝑖 + 𝑧𝑖 − 𝑇(𝑘𝑖𝑟, 𝑧𝑖), where 𝑇(𝑘𝑖𝑟, 𝑧𝑖) is the combined tax (or support grant) to person i on capital and labour income.
Notice that the social welfare term is strongly separable between consumption and labour
income, is linear in consumption and contains a separate term, 𝑎𝑖(𝑘𝑖(𝑡)), to represent the
intrinsic utility derived from holding wealth. By contrast the utility function defined in (1) does
not aggregate consumption into a single variable, does not assume strong separability between
consumption and labour income, is not linear in consumption, and there is no intrinsic utility
from holding wealth. Wealth only contributes to utility by virtue of generating income.
Moreover, the present model is a static model that does not assume steady-state conditions, as
defined in dynamic growth models. The model of Saez and Stantcheva (2018) excludes ’income
effect’ by virtue of the utility function chosen. By contrast, in the numerical work discussed
here, 𝑤𝜕𝐿𝑖
𝜕𝑦𝑖 is set to -0.24.
From the definition of utility in (A.16), they obtain the following steady-state linear
capital income tax equation: 𝜏𝐾 = 1−�̅�𝑘
1− �̅�𝑘+ 𝑒𝐾 (A.17)
The definitions of �̅�𝑘 and 𝑒𝐾 in (A.17) are exactly the same as in (A.15). The differences between the formulas are mainly due to the dissimilarities between the utility
functions in (1) and (A.16). The presence of indirect taxes in the present model is another
reason. Despite differences in appearance, in polar cases the two formulas lead to similar
results.
When 𝑒𝐾 is very high, both (A.15) and (A.17) suggest that optimal 𝜏𝐾 will approach zero. When 𝑒𝐾 approaches zero, then 𝜏𝐾 in both (A.15) and (A.17) will converge to one, which is the highest permissible value.
When the social marginal utility of all income groups is the same (absence of
distributional objectives) then according to (A.17) �̅�𝑘 equals to 1 and optimal 𝜏𝐾 will be zero. In this case, (A.15) is reduced to the equation:
-
19
𝜏𝐾
(1−𝜏𝐾)=
− 1 + ( 1 − 𝜏𝐶𝜕𝑐
𝜕𝑦 − 𝜏𝐿
𝑤𝜕𝐿𝑖𝜕𝑦𝑖
)
𝑒𝐾{1
(1− 𝜏𝐾) −
𝜏𝐾2(1− 𝜏𝐾)
(1+ 𝑒𝐾)} (A.18)
Provided G = 0, and 𝜏𝑐 and 𝜏𝐿 are zero, then 𝜏𝐾 = 0, in line with (A.17).
When the social marginal utility of capital owners approaches zero, then according to
(A.17) 𝜏𝐾 = 1
(1 + 𝑒𝐾)⁄ . This is the revenue maximizing rate. Eq. (A.15) is reduced to a
simpler form presented in (A.19):
τK
(1-τK)≈
g̅bg̅H
( 1 - τC∂c
∂y - τL
w∂Li∂yi
)
eK (A.19)
Denoting �̅�𝑏
�̅�𝐻 𝑎𝑠 �̅�𝑏𝐻, when 𝜏𝐶 and 𝜏𝐿 are both zero, then (A.19) is reduced to:
𝜏𝐾
(1−𝜏𝐾)≈
𝑔𝑏ℎ
𝑒𝐾 (A.20)
�̅�𝑏𝐻 is the ratio between the average social marginal utility of demogrant recipients and that of the entire population. The relationship between the two is discussed in Revesz (2014, pp. 21-
22). Adopting an estimated value between 1.1 and 1.6 appears realistic. Numerical
comparisons, using plausible values for �̅�𝑏𝐻 between 1.1 and 1.6 and for 𝑒𝐾 between 0.5 and 2, indicate that optimal 𝜏𝐾 from (A.17) and from (A.20) tend to be fairly close. The similarity of polar outcomes from the present model and the static steady-state model of Saez and
Stantcheva (2018), suggests that the two models are not far apart.
User’s guide to the computer program ‘grid1’
This program carries out optimisation of linear commodity, labour and capital income
tax rates. We used the “grid” method to find the optimum, which means that it examined all
possible combinations of linear taxes. It yielded the numerical results discussed in the paper
titled “A model of the optimal tax mix including capital taxation”. The user can specify values
such as utility function parameters, wage distribution and the inequality aversion of society.
The original program was developed more than 20 years ago and was used in publications
dealing with optimal commodity taxation. (Revesz, 1997, 2014, 2018). The current version of
the program focuses only on the optimal mix of three linear taxes. It contains in addition to
commodity taxation, the optimisation of labour and capital income taxes. Prompts enable the
user to choose any combination of these three broad categories of taxes.
-
20
Program installation and running
This program, located on the john1revesz.com website, is written in a programming
language called QBasic, which two decades ago was part of Microsoft Windows. The
program is written as a “source code” in a Word file, without being converted into machine
language, as are most computer programs sold on the market. A special compiler is needed to
convert it into machine code. A QBasic compiler for 64 bit machines is available free of
charge from the following website: http://www.qb64.net/ . Note, the QBasic compiler will
accept the source code only in Word format.
Having a QBasic compiler, the installation is simple. Open the file named qb64 and
then Edit and Paste the entire content of the ‘grid’ Word file into the QBasic screen. Next,
click Run and then Start. Less than two minutes later the first prompt will appear on the
screen. The prompts present a menu of options. This menu provides access to various options
such as type of utility function, inequality aversion rate, the selection of taxes to be optimised,
the choice of ordinary or two-part LES, the parameter of constant elasticity capital, as well as
the size of expenditure on public goods as a percentage of total income.
After you finished with the prompts, processing will start automatically and will finish
in less than an hour. The output file, called ‘tax2.txt’, will be located in the same folder as the
QBasic compiler. In case you encounter any problem when responding to prompts, the
quickest way to overcome it is to exit and start again. Click on the cross at the upper right
hand corner of the QBasic screen. After the screen disappears, start running the program
again.
Finally, all the library files associated with the compiler should be kept in the same
folder as the compiler file. Because the program is a source code in Word format, it could be
corrupted accidentally or during data entry. For that reason, it is advisable to keep a couple of
copies in a separate folder.
DATA lines
Virtually all parameters related to consumer demand and labour supply can be
changed by the user, as well as data related to income distribution. All the numbers in the
DATA lines close to the beginning of the source code can be altered. Above the DATA lines
are text lines (marked at the left by asterisk to indicate that it is a comment and not a program
line) explaining what the DATA lines refer to. These include initial wage rates, initial capital
incomes and LES utility-demand parameters. After reading the paper, most of these
parameters are self-explanatory.
Usually there are a number of DATA lines for each item. Those marked with an
asterisk at the beginning of the line are ignored by the program. Only the unmarked DATA
line is currently active. Considerable care must be exercised when entering new numbers. If
the number of entries in the DATA line does not correspond to the number of READs
specified underneath, then the program will be corrupted without warning. If there are too few
entries, the READ instruction will assume that the missing numbers are zero. If there are too
many entries, the surplus numbers will be picked up by the following READ instruction
relating to a different variable or parameter. As a result, the output from the program will be
meaningless.
http://www.john1revesz.com/http://www.qb64.net/
-
21
Please check the parameter summary tables appearing at the beginning of the printout,
to ensure that correct numbers are used by the program. Always make sure that there is only
one DATA line without an asterisk for each variable.
Note, every time you change DATA lines on the ‘grid1’ Word file, you have to copy
and paste the file again into the QBasic screen
Utility and demand parameters
The current version of the program enables the user to specify LES utility-demand
parameters according to his/her choice, for the two wage groups in the model. The relevant
DATA lines for 9 goods and 18 goods utility-demand appear close to the beginning of the
source code. The 9 goods utility function is allocated to the 8 lower wage earners, the 18
goods utility function is allocated to the 7 higher wage earners. The leisure parameters are
defined under them, through equations with numbers on the right hand side. The program
checks that the betas of commodities and leisure add up to one. If this condition is not
satisfied, a warning will appear at the very start of the printout and program execution will
terminate. To ensure the global quasi-concavity of LES utility, all betas must be positive.
Labour and capital income tax schedules
The constant elasticity of capital supply is entered through a prompt. No such prompt
appears for labour supply, because the compensated elasticity of labour supply is determined
by the program for each taxpayer, according to his/her wage rate, lump-sum income and
relevant LES parameters, specified in DATA lines.
Inequality aversion
The program contains two social welfare functions to describe the inequality aversion
of society, one is LES utility (a logarithmic function) the other is maximin. Maximin
considers only the utility level of the lowest wage group. It leads to very high optimal tax
rates. Even LES leads to fairly high tax rates. To overcome this problem, an inequality
aversion rate (z) was introduced, as explained in Revesz (2018, Appendix 3.3 section C.4).
The inequality aversion rate ranges between zero and one. Zero represents the absence of
distributional objectives, one represents LES utility. Any number between zero and one is a
convex combination of the two. The user can choose this rate in the second prompt.
Sometimes when z is low, the search for the optimum does not converge properly. In these
cases an error message is issued, and you should try a somewhat higher value of z.