a model of the optimal tax mix including capital...

1

Draft Paper

A model of the optimal tax mix including capital

taxation

Abstract

This paper uses a three-tax linear model to study numerically the optimal tax mix. The

three aggregate taxes relate to labour, capital and commodities respectively. The most striking

result is that with a linear expenditure system (LES) based on econometric estimates, indirect

(i.e. commodity) taxation dominates the optimal tax mix, with over 80% of tax revenue coming

from indirect taxes. Although some analytical justification is presented for this surprising result,

it should be tested numerically using other econometrically estimated utility-demand systems.

Comprehensive computational modelling of the optimal tax mix has been so far missing from

the literature. Considering the results presented here, there seems to be considerable scope for

further research in this area. From a theoretical perspective, a major innovation in this paper is

the introduction of capital taxation into a static tax mix model. The analytical part of the paper

provides approximate predictions for optimal linear tax rates (including capital tax) called the

modified inverse elasticity rule. Using these formulas, it is possible to provide some

explanations for the dominance of indirect taxation in the optimal tax mix.

keywords: Optimal taxation, capital taxation, income taxation, indirect taxation

JEL Classification: H21, H23, H26, C63

Introduction

This paper uses a three-tax linear model to study numerically the optimal tax mix. The

three aggregate linear taxes relate to labour, capital and commodities respectively. The most

striking result is that with a linear expenditure system (LES) based on econometric estimates,

indirect (i.e. commodity) taxation dominates the optimal tax mix, with over 80% of tax revenue

coming from indirect taxes. From a theoretical perspective, a major innovation in this paper is

the introduction of capital taxation into a static tax mix model. The analytical part of the paper

provides approximate predictions for optimal linear tax rates (including capital tax), called the

modified inverse elasticity rules. Using these formulas, it is possible to examine the reasons for

the dominance of optimal indirect taxation.

The three-tax model studied here is a fairly comprehensive one, incorporating labour,

capital and indirect taxes. It is assumed that there is only one linear labour income tax rate,

denoted 𝜏𝐿 , and a single capital income tax rate, denoted 𝜏𝑘. Labour income is denoted zi = wiLi, where wi is the fixed wage rate for household i and Li is the corresponding labour supply.

Capital income is capital (k) multiplied by a fixed average pre-tax real rate of return (r), that is

kr. r and wi are assumed to be fixed. The net after-tax rate of return is 𝑟(1 − 𝜏𝑘). ̀ r´ is measured after taking away depreciation and the remuneration of self-employed.

2

It is assumed that capital income is taxed separately from labour income. 𝜏𝑐 is a uniform proportional indirect tax rate on all commodities, 𝑝𝑗 are constant producer prices and 𝑞𝑗 are

quantities. Total consumption is: C= ∑ 𝑝𝑗𝑞𝑗𝑗 .

This model treats capital as merely an income generating tool, which means that unlike

commodities and leisure, it does not enter directly into the social welfare function (U). It is

assumed that there is only one type of capital (k), with a uniform rate of return (r). However,

wages and commodity prices can be heterogeneous. Then the linear model is given as:

maximize U = ∑ 𝑢𝑖(𝐪𝒊, 𝐿𝑖)𝑖 (1)

s.t. the budget constraint ∑ (𝜏𝐾𝑖 𝑘𝑖𝑟 + 𝜏𝐿𝑤𝑖𝐿𝑖 + 𝜏𝐶 ∑ 𝑝𝑗𝑞𝑗𝑖 𝑗 ) ≥ 𝐻𝑏 + 𝐺 (1a)

Income redistribution is carried out by providing a uniform demogrant (b) to every member of

the population. G is fixed public goods expenditure requirement. H is total population. In the

numerical work a linear expenditure (LES) utility function was used.1 Note that capital is

missing from the social welfare function in (1) but enters constraints (1a).

Aggregate capital supply is an increasing function of the after-tax rate of return to

capital, 𝑟(1 − 𝜏𝐾), and total real income M.

Symbolically: 𝐾 = 𝜑(𝑟(1 − 𝜏𝐾), 𝑀) (2)

A specific form of this function will be presented in eq. (18).

By definition:

rK + wL = 𝑀 = ∑ 𝑝𝑗𝑞𝑗𝑗 + 𝐺 = 𝐶 + 𝐺 (3)

In these equations: K is the aggregator of capital stock; L is the aggregator of labour supply. C

is total consumption. 𝑀 is price-indexed total income.

Defining lump-sum income as y = rK, then from (3):

𝜕𝐶

𝜕𝑦 = ∑ 𝑝

𝑗

𝜕𝑞𝑗

𝜕𝑦𝑗= 1 + 𝑤

𝜕𝐿

𝜕𝑦+ 𝑟

𝜕𝐾

𝜕𝑦 (4)

The social marginal utility of income is: 𝑔𝑖 = 𝜕𝑢𝑖

𝜕𝑚𝑖 =

𝜕𝑢𝑖

𝜕𝑦 (5)

Essentially, 𝑔𝑖 reflects political value judgments concerning income distribution. The relevant utility transformation parameter is referred to in the literature as the inequality aversion rate.

In the present model an inequality aversion rate of one corresponds to LES utility. A rate of

zero corresponds to equal marginal utilities for everyone, i.e. the absence of distributional

objectives. A rate between zero and one is a convex combination of the two.

1 This model is not zero homogeneous due to the presence of an endogenously determined lump-sum grant

(b). The indeterminacy of the optimum with zero homogeneous utility is discussed in Mirrlees (1976, pp. 330-

332)

3

The model described above, excluding capital taxation, is close to the standard income-

commodity tax models in the optimal taxation literature. Like other optimal taxation models, it

can highlight some aspects of the equity-efficiency trade-off. However, it has severe limitations

due to its high level of abstraction. First, population is heterogenous not only due to differences

in wage rates and capital incomes, but because of other features as well. Perhaps the most

important among these is household composition. Also, political preferences are only partly

captured by the single inequality aversion rate and the uniform demogrant. Distributional

preferences cover complex political considerations in relation to incomes, household

composition, place of residence, source of income, and with support grants (b), also in respect

to the provision of education and health services. Altogether, optimal taxation models, like the

present one, can offer some insights, but only a limited guide to policy.

The paper is structured as follows: The next section presents a literature review on the

subject. It is followed by approximate formulas for optimal linear capital, labour and

commodity tax rates, based on the specifications described in eq. (1). Details of the

mathematical analysis are presented in the Online Supplemental Appendix. The next sections

examine the computational model. It starts by explaining the optimisation methodology. This

is followed by numerical results. The interior point numerical optima will be compared with

the predictions from the modified inverse elasticity formulas. Finally, some explanations will

be provided to account for the dominance of indirect taxation in the optimal tax mix.

The final section presents summary and conclusions. The main conclusion is that there

is a need for further computational research, to test the validity of the conjecture about the

dominance of indirect taxation, using other empirically based utility-demand functions.

The location of the computer program used to produce the numerical results appears in

the references under Revesz (2019). The same website also contains the user’s guide. The

interested reader can experiment with this program for checking the numerical results presented

here and for testing results under different parameters and specifications.

Related literature

It is impossible to review in this short paper the large literature pertaining to the tax mix

issue. The discussion here starts with optimal taxation theory. The optimal tax mix model

involving labour and commodity taxes has been introduced into the literature by Atkinson and

Stiglitz (1976) and Mirrlees (1976). An important result is the Atkinson-Stiglitz (1976)

theorem. This theorem states that provided income tax is the control theoretic non-linear

solution and utility is weakly separable between commodities and leisure, there is no need to

apply commodity taxation in conjunction with income tax. Income tax will be sufficient to

generate all tax revenue. This is exactly the opposite to what is coming out from the present

linear model, which suggests that under linear taxation, the bulk of tax revenue should be

generated from indirect taxation rather than income tax.

4

The Atkinson-Stiglitz theorem is based on a number of not too realistic assumptions,

such as the absence of tax evasion, the absence of administration and compliance costs, weakly

separable utility between commodities and leisure, no externalities, no paternalistic concerns

and the assumption that the control theoretic continuous and differentiable income tax function

can be implemented in practice. Because of these assumptions, the Atkinson-Stiglitz (1976)

theorem had been criticised in the literature, particularly in connection with the debate about

indirect tax uniformity. The interested reader can find comprehensive reviews in Tuomala

(2016, Chapter 12) and Revesz (2018 pp. 3-28).

Some of the above-mentioned omissions also apply in the present model, however if tax

evasion, administrative and compliance costs, leisure complements-substitutes, externalities

and paternalistic concerns would have been included, they would have further reinforced the

conclusion in favour of indirect taxation.

Another subject from the optimal taxation literature that has been taken up in the present

model is capital taxation. There is a large and disparate literature on this subject. A recent

literature review on optimal capital taxation and related topics is presented by Bastani and

Waldenström (2018). A shorter literature review is contained in Piketty and Saez (2012, pp. 1-

3). Studies that show some similarities in their mathematical framework to the present model

on capital taxation include Saez and Stantcheva (2018) and Piketty and Saez (2012, 2013).

Note, the present model is largely irrelevant to several issues raised in the capital taxation

literature. These include the treatment of contributions to retirement benefit funds (Diamond,

1999), and the role of capital taxation in alleviating the uncertainties of life (Conesa et al., 2008;

Golosov et al., 2006; Krueger and Ludwig, 2018).

Apart from the Atkinson-Stiglitz theorem and optimal capital taxation, there is little else

in the optimal taxation literature that is closely related to the subject matter of this paper.

A more relevant strand in the literature is what could be described as policy-related

empirical studies on the tax mix. These studies examine empirical observations on the direct-

indirect tax mix combined with economic reasoning and econometric estimates, particularly on

the relationship between the tax mix and economic growth. These reports include Fitoussi

(2005), Martinez-Vazquez et. al (2009, pp. 1-27), Arnold et al. (2011) and European

Commission (2013, pp. 42-51). Generally, these policy-related reports are in favour of indirect

taxation, advocating greater reliance on commodity and property taxation at the expense of

direct taxes. IMF (2014, pp. 36-43) also deals with this subject, but sees a limited role for

indirect taxation in the advisable tax mix, because of its regressivity.

The main arguments advanced against income taxation (both labour and capital) are:

• The labour and saving disincentives related to progressive income taxation, which are absent with indirect taxation.

• The lower evasion propensities of indirect taxes on goods and services produced and/or marketed through large organisations.

• Simpler administration and lower administrative and compliance costs of indirect taxes compared with direct taxes.

5

One may add to these several other factors in favour of indirect taxation, such as the

need for higher taxes to offset negative externalities, more scope for taxing/subsidising leisure

complements and substitutes, as well as more scope for accommodating paternalistic concerns.

None of these factors appear in the present model. Effectively, the argument in favour of

indirect taxation presented here is separate and additional to arguments that appeared elsewhere

in the literature. Generally, the above-mentioned empirical studies are in line with the main

conclusion of this paper, but unlike the present study, they are not based on comprehensive

mathematical modelling in the optimal taxation tradition.

The modified inverse elasticity rules

This section concentrates mainly on capital taxation, but labour and commodity taxation

will be covered as well. The present optimal capital and labour income taxation model is in

some respects a static version of the linear steady-state model developed by Saez and

Stantcheva (2018).

Regarding capital taxation some definitions and explanations are needed. Suppose we

have an international panel data spanning more than 20 years, which links capital income per

person to the average capital income tax rate. From that longitudinal and cross-country

database, it is possible to obtain through linear regression a parameter estimate for the derivative

of real (inflation adjusted) capital income per person (kr) over the average capital income tax

rate (𝜏𝑘), as well as its derivative with respect to real per capita income (m). To the best of my kmowledge, no extensive cross-country research has been carried out so far to estimate the tax

elasticity of capital, although the required data are available in many countries. Such data are

also available for more disaggregated estimates of the tax elasticity of capital supply, according

to type of investment and source of investment. For example: domestic savings, capital inflow

and outflow, foreign direct investment (FDI), investment in machinery and equipment,

bequests, corporate income, etc.

Denoting total capital as K, the following discussion will use 𝜕𝐾

𝜕(1−𝜏𝐾) instead of

𝜕𝐾

𝜏𝐾 ,

in order to produce comparable expressions to Saez and Stantcheva (2018). Assuming that the

central parameter estimates in the model are based on observations over a long period of time,

the static model may be regarded to represent a long-run model. The tax elasticity of capital

supply is defined in respect to the after-tax return to capital as:

𝑒𝐾 = (1−𝜏𝐾 )

𝐾𝑟

𝜕𝐾𝑟

𝜕(1−𝜏𝐾) (6)

Saez and Stantcheva (2018) used three estimates for 𝑒𝐾 in their numerical work. These are 0.25, 0.5 and 1. Perhaps 𝑒𝐾 = 1 is the most realistic, bearing in mind the strong international mobility of capital. In any case, in the present computational work all three estimates are used.

In equation (1a), ∑ 𝜏𝐾𝑖 𝑘𝑖𝑟 represents “real” capital tax revenue and not nominal revenue. The difference between the two is the dead-weight loss associated with raising the

capital tax rate from 0 to 𝜏𝐾. From the theory of excess burden (Hoy et al (2011), given a linear

supply curve, this amounts to the triangle 𝑟𝜕𝐾

𝜕𝜏𝐾 𝜏𝐾

2

2 . Hence, in nominal terms equation eq. (1a)

should appear as:

∑ (𝜏𝐾𝑖 𝑘𝑖𝑟 −𝑟𝜕𝐾


2

2 + 𝜏𝐿𝑤𝑖𝐿𝑖 + 𝜏𝐶 ∑ 𝑝𝑗𝑞𝑗𝑖 𝑗 ) ≥ 𝐻𝑏 + 𝐺 (1b)

6

It is applied to reduce “real” capital income in the numerical simulations. It is also used to work

out the inverse elasticity rule for capital in the Online Supplemental Appendix.

The specifications described above yield an approximate formula for the optimal

capital income tax rate. This formula can be expanded also to optimal linear labour income tax.

The approximation to the optimal linear capital and labour income tax rates follows the same

procedure described in respect to approximate optimal indirect tax rates in Revesz (2014, pp.

47-51).

The solution for the optimal capital tax formula is based on deriving the welfare function

in (1) in respect to 𝜏𝐾 and then equalizing the sum of resulting components to zero. There are three welfare effects to consider following a change in 𝜏𝐾: (i) the income effect on capital owners, (ii) the effect on social utility of variations in tax revenue and consequently in the

uniform lump-sum grant (b), (iii) the variations in dead-weight costs that arise due to tax related

distortions in the allocation of resources (based on definition (1b)).

Details of the derivations for the approximately optimal capital tax rate are presented in

the Online Supplemental Appendix. The end result, titled the modified inverse elasticity rule of

capital taxation (from A.15 in the Appendix) is:

𝜏𝐾

(1−𝜏𝐾)=

− �̅�𝑘�̅�𝐻

+ �̅�𝑏�̅�𝐻

( 1 − 𝜏𝐶𝜕𝑐

𝜕𝑦 − 𝜏𝐿

𝑤𝜕𝐿

𝜕𝑦)

𝑒𝐾{1

(1− 𝜏𝐾) −

𝜏𝐾2(1− 𝜏𝐾)

(1+ 𝑒𝐾)} (7)

The average social marginal utility of income of capital income recipients is defined as �̅�𝑘, the average for demogrant recipients is �̅�𝑏 , while the average for the entire population is �̅�𝐻. Since

everybody is receiving the demogrant �̅�𝑏 = ∑ 𝑔𝑖𝑖

𝐻⁄ , while �̅�𝐻 is the income-weighted

average: �̅�𝐻 = ∑ 𝑔𝑖𝑖 𝑚𝑖

𝑀⁄

The 𝑤𝜕𝐿𝑖

𝜕𝑦𝑖 term represents the change in labour income in response to variation in lump-

sum income. 𝜕𝑚𝑖

𝜕𝑦𝑖= 𝑤

𝜕𝐿𝑖

𝜕𝑦𝑖 is known in the labour supply literature as the income effect (see

Hum and Simpson (1994, pp. 72-74)). Its estimated value usually ranges between -0.1 to -0.4.

It is not difficult to see that the denominator on the right-hand side is usually close to

𝑒𝐾. When 𝑒𝐾 = 1 then the denominator will be 𝑒𝐾. When 𝑒𝐾 = 2 then the denominator will

be 𝑒𝐾{1 - 𝜏𝐾

2(1− 𝜏𝐾) }. When 𝑒𝐾 = 0.5 then the denominator will be: 𝑒𝐾{1 +

𝜏𝐾

4(1− 𝜏𝐾) }.

Using the methodology described in the Online Supplemental Appendix, it is possible

to generalise the inverse elasticity rule of capital taxation to linear labour income tax as well.

In this case the (compensated) elasticity of labour supply with respect to the net after-tax

return, 𝑤(1 − 𝜏𝐿), is defined as:

𝑒𝐿(u0) = (1−𝜏𝐿 )𝑤

𝑤𝐿

𝜕𝐿(u0)

𝜕(1−𝜏𝐿) . (8)

7

Econometric estimates of eL(u0) usually range between 0.15 and 0.5 (Hum and Simpson, 1994,

pp. 72-74). Higher estimates can be obtained when considering not only changes in working

hours, but also long-term responses related to occupational choice and human capital

accumulation. In the present numerical work, the mean value of eL(u0) is given between 0.40

and 0.48, depending on the scenario.

The average social marginal utility of wage earners is defined as: 𝑔𝐿 =∑ 𝑤𝑖𝐿𝑖𝑔𝑖𝑖

∑ 𝑤𝑖𝐿𝑖𝑖. The

marginal capital-income ratio is 𝜕𝐾

𝜕𝑚 , where total income m = Kr + wL. In this paper

𝜕𝐾

𝜕𝑚 =

𝜕𝐾

𝜕𝑦

defines the net change in capital stock related to a change in static income, after taking away

depreciation. Based on empirical evidence, it is around 3. Using these new variables, and

following the same steps outlined in the Online Supplemental Appendix, will lead to the

following modified inverse elasticity rule for the optimal linear labour income tax:

τL

(1-τL)=

- g̅Lg̅H

+ g̅bg̅H

( 1 - τC∂c

∂y - 𝜏𝐾𝑟

𝜕𝐾

𝜕𝑚)

𝑒𝐿(𝑢0){1

(1− 𝜏𝐿) −

𝜏𝐿

2(1− 𝜏𝐿)(1+ 𝑒𝐿(𝑢0))}

(9)

Deriving the modified inverse elasticity rule in respect to commodity taxation follows

the methodology described in Revesz (2014, pp. 47-51). Eq. (9) there gives the predicted linear

tax rate for a single composite good as:

𝜏𝐶 ≈ 1 + 𝜏𝐶 −

g̅bg̅C

(1 + 𝜏𝐶 + 𝜀�̅�𝐿𝜏𝐶)

𝜀�̅�(𝑢0) (10)

Here g̅C

represents the weighted average social marginal utility of consumers and g̅b that of

demogrant recipients. �̅�𝑐𝐿 =𝜕𝐿

𝜕(1+𝜏𝐶)

(1+𝜏𝐶)

𝐿 is the elasticity of labour supply over the cost of

consumption. Taking zero homogeneous labour supply, where b is an exogeneous variable

the following condition must be satisfied:

𝜕𝐿

𝜕(1+𝜏𝐶)𝑐 +

𝜕𝐿

𝜕(1+𝜏𝐿) 𝐿 +

𝜕𝐿

𝜕(1+𝜏𝐾) 𝐾 +

𝜕𝐿

𝜕𝑦 𝑏 = 0 (11a)

Assuming that the last two terms are relatively small, leads to the following approximation:

𝜀�̅�𝐿 = −𝜖�̅�(1+𝜏𝐶)

(1+𝜏𝐿) , (11b)

where 𝜖𝐿 = 𝜕𝐿

𝜕(1+𝜏𝐿)

(1+𝜏𝐿)

𝐿 is the uncompensated elasticity of labour supply. This approximation

opens the way for estimating the unknown 𝜀�̅�𝐿 using estimates for ∈̅𝐿 . According to empirical estimates (Hum and Simpson, 1994, pp. 72-74), ∈̅𝐿 lies in the range between +0.4 and -0.15, hence 𝜀�̅�𝐿 will be oppositely signed in a similar range. The relatively small value of 𝜀�̅�𝐿 will be of considerable importance when explaining the dominance of indirect taxation.

When τL and τK are also included in the model, they should be added to the effect of

change in labour supply on tax revenue. Consequently, in the present model a better

approximation to (10) will be:

𝜏𝐶 ≈ 1+𝜏𝐶 −

g̅b

g̅H(1+ 𝜏𝐶+ �̅�𝐶𝐿 (𝜏𝐶 + 𝜏𝐿𝑊

𝜕𝐿

𝜕𝑦 + 𝜏𝐾𝑟

𝜕𝐾

𝜕𝑦)

�̅�𝑐(𝑢0) (12)

This is the predictive formula for 𝜏𝐶 used in the numerical analysis.

8

Finally, a few words about the �̅�𝐿(𝑢0) and 𝜀�̅�(𝑢0) terms appearing in the denominators of (9) and (12). According to demand theory, the dead-weight loss of taxation is represented by

the triangle defined by the difference between the post-tax and pre-tax price multiplied by the

change in the quantity demanded. In its turn, the change in quantity is given by the change in

price multiplied by the derivative of compensated demand (not ordinary demand) (see Hines,

1998). In symbolic terms the dead-weight loss (D) following an increase in the labour tax from

zero to 𝜏𝐿 will be:

𝐷(𝐿) = 𝜏𝐿

2(

𝜕𝑊𝐿

𝜕𝜏𝐿(𝑢0)𝜏𝐿) =

𝜏𝐿2

2

𝜕𝑊𝐿

𝜕𝜏𝐿(𝑢0) ≈ −

𝜏𝐿2

2𝑒𝐿(𝑢0)

𝑊𝐿

(1−𝜏𝐿) (13)

Similarly, for consumption tax: 𝐷(𝐶) ≈ − 𝜏𝐶

2

2𝑒𝐶(𝑢0)

𝐶

(1−𝜏𝑐) (14)

where 𝑢0 designates compensated labour supply and compensated commodity demand respectively. In the case of capital supply, the dead-weight loss is defined not by a utility

function, but by constraint (1b). In this case the dead-weight loss is defined as: −𝑟𝜕𝐾


2

2 , and

there was no need to apply a compensated elasticity approach.

The framework and specifications of the numerical model

There are several utility functions, including the LES, CES, AIDS, NLPS and direct and

indirect addilog, which define explicitly not only utility, but also the corresponding demand

and labour supply functions (see Thomas, 1987) for a given set of post-tax prices and incomes.

In the present computational work Linear Expenditure System (LES) utility-demand

was used. The reason for this choice was that already much earlier, we developed a numerical

tax model dealing with the question whether indirect taxation should be uniform or otherwise

(see Revesz 1997, 2014). The original idea was to extend this model to include also the

optimisation of labour and capital income tax.

Later, it was discovered that the original optimisation method, based on “gradient”

iterations, was not well-suited to this task, hence we turned to the three-tax “grid” search

optimisation method. Nonetheless, the basic model has been retained, including the LES utility

function, 15 taxpayers and 9 or 18 commodities.

LES appears in many publications. For one, see Thomas (1987, pp. 71-74). The present

version uses instead of income (m) the earning parameters w and y, as well as total time (Z)

and leisure (L). The LES utility function is defined as:

𝑢 = ∑ 𝛽𝑖 log (𝑞𝑖𝑖 − 𝛼𝑖) (15)

where qi represents the consumption of good i. 𝛼𝑖 and 𝛽𝑖 are function parameters. The corresponding demand functions for commodities are:

𝑞𝑖 = 𝛼𝑖 + 𝛽𝑖

𝑝𝑖 (𝑍𝑤 + 𝑦 − ∑ 𝑝𝑗 𝛼𝑗 𝑗 ) (16)

In the present model w = W(1 - 𝜏𝐿) and y = kr(1 - 𝜏𝐾) + b

Labour supply (L) is given as:

𝐿 = 𝑍 − 𝑞𝐿 = 𝑍(1 − 𝛽𝐿 ) − 𝛼𝐿 − 𝛽𝐿

𝑤(𝑦 − ∑ 𝑝𝑗𝑗 𝛼𝑗 ) ` (17)

9

where 𝑞𝐿 stands for leisure. Z is the upper limit of time available to work, therefore 𝑞𝐿 = 𝑍 − 𝐿

In the computational model it is assumed that constant elasticity capital supply from (3)

is given in the form: 𝑟𝐾 = 𝑟(𝐾0 +𝜕𝐾

𝜕𝑚(𝑚 𝑃⁄ − 𝑚0) )(1 − 𝜏𝐾)

𝑒𝐾 (18)

where 𝜕𝐾

𝜕𝑚 is the change in net capital (after depreciation) and m/P is price-indexed total income,

given as: m = (wL + rK)/P. From (18) we can obtain a more precise definition of dead-weight

loss than in (1b). The change in dead-weight costs when moving from 0 to 𝜏𝐾 will be then:

∆𝐷(𝐾) = − 𝑟𝑒𝐾(𝐾0 +𝜕𝐾

𝜕𝑚(𝑚 𝑃⁄ − 𝑚0))

(1− 𝜏𝐾)(𝑒𝐾−1)𝜏𝐾

2

2 (19)

The numerical model that produced the results presented in Table 1 has the following

features: Initially an ordinary LES function was used, covering 9 commodities and labour. Later

a two-part LES covering 18 goods was introduced. There are 15 taxpayers in the model. The

LES utility function satisfies quasi-concavity. Combined with a convex budget constraint that

ensures a unique global optimum. The inequality aversion rate was set to 0.7, in order to obtain

an overall tax burden in the range between 40% and 50%, as observed in several advanced

economies. The demand parameter estimates for the 9 goods model were taken from Deaton

and Muellbauer (1980, p 71). The labour and capital income distributions are blended from

several empirical distributions. Usually, expenditure on public goods is fixed, set to 10% of

initial tax-free total income. The computer program related to this paper (Revesz (2019))

displays in the DATA lines all the parameters and income values adopted in the numerical

work.

The search for the optimum was carried out using a “grid” search method. This method

involves checking all tax combinations in order to find the optimum. In the present model

discrete tax rates were separated by intervals of 0.01. With indirect taxation the calculations

were carried out from 0.1 to 1.1, implying 100 discrete taxes. With labour and capital taxation

the range was from 0.1 to 0.7, implying 60 discrete taxes. Altogether, 100*60*60 = 360,000

combinations were examined. At each round, the program compared the utility level of the

present combination with the highest utility recorded earlier. If the utility of the present

combination turned out to be higher than the earlier one, then it replaced the top utility

combination. Otherwise, no change was made.

The model was restricted to only three taxes, because this way the program could be run

in less than an hour on a personal computer. A four-tax model (say with two indirect tax rates,

one for luxuries and one for necessities, or with two income tax brackets) could run on more

powerful computers. Beyond that, the number of combinations will run into billions. It is

possible that less computation-intensive methods could be used to locate the global optimum,

but for the time being this subject has not been taken up.

The specifications in Table 1 are restricted to the choice of taxes, the capital supply

elasticity and the percentage of public goods from total income. These choices can be made

through prompts in the Revesz (2019) program. Other choices can be made in the program by

changing DATA lines, as explained in the user’s guide. Overall, the percentage share results

indicate that indirect taxes dominate the optimal tax mix, usually accounting for over 80% of

total tax revenue.

10

Table1

Numerical results with ordinary LES

No. of

taxes

Elasticity

of capital

% public

goods

Indirect

tax

Labour

tax

Capital

tax

change

welfare

2 - 0 Actual 0.61 0.1

Predicted 0.55 -

% share 86 14

Compared with 40% uniform tax % change 10 10 2

2 - 10 Actual 0.68 0.1

Predicted 0.61 -

% share 86 14


2 1 10 Actual 0.42 0.10

Predicted 0.59 -

% Share 91 9

Compared with 40% uniform tax % change 9 -7 59 5

3 1 10 Actual 0.60 0.10 0.10

Predicted 0.61 - -

% Share 84 11 5

Compared with 40% uniform tax % change 22 4 69 14

3 0.25 10 Actual 0.59 0.10 0.12

Predicted 0.60 - 0.17

% Share 83 11 6


3 0.25 0 Actual 0.48 0.10 0.12

Predicted 0.51 - 0.20

% Share 82 11 7


Source: Own calculations using the program in Revesz (2019).

Predictions are only presented for cases where the actual tax rate is an interior point,

located above the 0.1 lower boundary line. For optimum values located at the boundaries, it is

not possible to apply predictive formulas, because these only apply at the interior. In all cases

the indirect tax rate is an interior point, but only in a few cases does this apply to labour or

capital tax.

The numbers in the fourth row require further explanations. The first three numbers

relate to quantities corresponding to tax rates, e.g. with indirect tax the corresponding quantity

will be total consumption. Each quantity in the optimal solution is compared with the

corresponding quantity obtained when all taxes are set to 40%. This percentage was chosen for

comparison because it is close the average tax burden in most scenarios. The purpose of the

exercise is to provide the reader a comparative view about the changes involved in moving from

a uniform solution to the highly differentiated optimal solution.

The change in welfare is more complicated. It is intended to reflect the change in utility

in monetary terms. It is based on dividing the change in utility by the Lagrangian using the

following formula: ∆𝑈∗ = 𝑈𝑜𝑝𝑡− 𝑈40%

𝜆𝑀𝑜𝑝𝑡 (20)

where 𝜆 = 𝑈(𝜏, 𝐺 + 1) − 𝑈(𝜏, 𝐺) and 𝑀𝑜𝑝𝑡 is total income in the optimal situation.

11

Equation (20) provides some tentative indication about the proportional change in welfare in

monetary terms.

The first scenario in Table 1 is a two-tax model involving only indirect and labour

taxation and no expenditure on public goods. The second scenario is the same, but with 10%

expenditure on public goods included. The results from both scenarios illustrate the clear

dominance of indirect taxation in the optimal tax mix.

The third scenario shows a situation where indirect taxation is missing. In this case

optimal labour income tax is much higher than the capital tax rate, which does not even move

above the boundary line, i.e. 0.1. This is in line with the inverse elasticity rule. In this model

the compensated elasticity of labour supply is 0.4, whereas that of capital is 1. Therefore, a

much higher optimal tax rate on labour than on capital is to be expected.

It should be noted that the optimums do not represent sharp peaks. They are top solutions

among many other solutions with similar high utilities. These high-utility solutions sometimes

involve tax combinations that are markedly different from the combination at the optimum, but

to a lesser extent, even in these cases the dominance of indirect taxation is maintained.

Notice that predicted tax rates are usually not far away from the actuals. This fairly close

concordance between actual and predicted tax rates, will help to explain later the puzzling

dominance of indirect taxation in the optimal tax mix. The next three scenarios, covering three-

tax models, further reinforce the conjecture about the dominance of indirect taxation.

Table 2

Numerical results with two-part LES

No of

taxes

Elasticity

of capital

% public

goods

Indirect

tax

Labour

tax

Capital

tax

Change

welfare

2 0 Actual 0.64, 1.28 0.10

Predicted 0.74 -

% Share 87 13


2 1 10 Actual - 0.48 0.10

Predicted - 0.60 -

% Share - 92 8

Compared with 40% uniform tax % change 4 -11 54 3

3 1 0 Actual 0.55, 1.10 0.10 0.11

Predicted 0.0.81 -

% Share 84 11 5


Source: Own calculations using the program in Revesz (2019).

So far, the discussion was concerned with ordinary LES covering 9 goods. In earlier

work (see Revesz, 2014, p. 18; 2018, p. 39) LES was divided into two functions with different

parameters, one for low wage earners the other for high wage earners. The two-part LES models

cover 18 goods, 9 for necessities and 9 for luxuries. This arrangement was needed in order to

open the way to non-linear Engel curves with LES. Notice that because of the introduction of

9 additional goods, the utility function of two-part LES is not the same as that of ordinary LES.

12

Some simulations were done with two-part LES. In was assumed in these simulations

that the tax rate on the nine luxuries is twice the tax rate on the nine necessities. This way, still

only one consumption tax had to be optimised, the second one was just consequential. As shown

in Table 2, the pattern of results with two-part LES turned out to be very similar to ordinary

LES. The introduction of the luxury tax made indirect taxation even more attractive, hence the

dominance of indirect taxation has increased slightly.

An explanation for the dominance of indirect taxation

In all models that included indirect taxation, it emerged as the dominant source of

revenue. This section will try to explain this phenomenon using the modified inverse elasticity

rules, described in eqs. (7), (9) and (12).

As noted earlier, there is reasonably good agreement between predicted values and

actual values, hence the inverse elasticity rules may help to gain a better understanding of the

mechanics driving the puzzling outcomes. These explanations will look not only at variables,

but also at some of the estimated numbers behind them.

Looking at the modified inverse elasticity rules in (7), (9) and (12) perhaps the most

striking feature is the large difference in the structure of predictive equation for 𝜏𝐶 in (12)

compared with the predictive equations for 𝜏𝐾 in (7) and for 𝜏𝐿 in (9). Not surprisingly, this difference will provide a major clue for the explanation.

In eqs. (7) and (9) the 𝜏𝐶 term is multiplied by 𝜕𝑐

𝜕𝑦. From (4):

𝜕𝑐

𝜕𝑦 = ∑ 𝑝𝑗

𝜕𝑞𝑗

𝜕𝑦𝑗

= 1 + 𝑤𝜕𝐿

𝜕𝑦+ 𝑟

𝜕𝐾

𝜕𝑦

The “income term”, 𝑤𝜕𝐿

𝜕𝑦 , in the program stays constant at -0.24. The interest rate, r, stays

constant at 0.07 and the capital-income ratio 𝜕𝐾

𝜕𝑦 is given as 3. Thus, in numerical terms

𝜕𝑐

𝜕𝑦 = 1

- 0.24 + 0.21 = 0.97. Hence, in the capital and labour formulas negative 𝜏𝐶 is multiplied by nearly 1. Assuming a 20% tax on labour and capital, the other terms inside the brackets are

small: 𝜏𝐿𝑤𝜕𝐿

𝜕𝑦= 0.2 ∗ (−0.24) = −0.048 and 𝜏𝐾𝑟

𝜕𝐾

𝜕𝑚 = 0.2*0.07*3 = 0.042

The situation is markedly different in the consumption tax equation (12). Here negative

𝜏𝐶 is multiplied by the labour cross-elasticity term, 𝜀�̅�𝐿, which is close to the negative of ∈̅𝐿 , according to (11b). In the present model 𝜀�̅�𝐿 ranges between -0.1 and -0.3, depending on the scenario. Hence, there is a situation where 𝜏𝐶 is multiplied by nearly 1 in the labour and capital tax equations, and by a negative number close to zero in the consumption tax equation.

That means that in the labour and capital equations 𝜏𝐶 has a strong negative effect in reducing optimal tax rates, whereas in the consumption tax equation 𝜏𝐶 has a small positive effect. This appears to be the main reason for high optimal tax rates on consumption and low tax rates on

capital and labour.

13

This conjecture is supported by another observation that contradicts a narrow

interpretation of the modified inverse elasticity rules. The compensated consumption elasticity

𝜀�̅�(𝑢0) lies in the range -0.8 to -0.85, depending on the scenario. By contrast, the compensated elasticity of labour 𝜀�̅�(𝑢0) lies in the range of 0.4 to 0.48. From a straight interpretation of the inverse elasticity rules, that implies that consumption should be taxed at a lower rate than labour

income. But in practice exactly the opposite happens, apparently because of the different roles

of 𝜏𝐶 in these predictive equations. This explanation implies that in tax mix models, optimal tax rates might be substantially different from those in non-mix models.

Some policy implications

At this stage the question arises: how do tax mixes look like in the real world? According

to statistics presented by Martinez-Vazquez et al. (2009, p. 5), in 2005 the average worldwide

direct-indirect tax ratio stood at around 0.95. However, there are large variations between

countries. The average for developed countries was around 2.05 while for developing countries

it was 0.7. Developing countries rely much less on direct taxation, because they lack adequate

accounting capabilities. Also, social security contributions, which are classified as direct tax,

are much less important in the developing world.

It appears that little change has occurred in recent years in Organisation for Economic

Cooperation and Development (OECD) countries. OECD (2019, Figure 3.20) reports that the

average standard VAT rate in OECD countries has increased from 18% in 2000 to 19.3% in

2019.

Fitoussi (2005) nominates two main reasons against shifting from income tax to

consumption tax. First such a move may increase, at least temporarily, inflationary pressures

and related disequilibrium problems. Second, such a move is likely to reduce tax progressivity.

The first point seems realistic, even if the tax reform is designed to be revenue neutral. The

second point is more debatable. Even if the current indirect taxation system is regressive, a

reform could be highly progressive, particularly if it involves taxing at higher rates luxury goods

and imposing higher property rates on luxury housing. Such a reform appears administratively

feasible in the computer age, though its political acceptability may be in doubt. All in all, apart

from possible inflationary pressures and adjustment problems, there seems to be no strong

economic reason against replacing part of income tax by progressive indirect taxation.

Summary and conclusions

This paper presented analytical and computational results in relation to a static three-

tax model of labour, capital and indirect taxation. Perhaps, the most significant finding in this

paper, is the dominant role of indirect taxation in the optimal tax mix. It is possible that this

result was partly due to some unique characteristics of the LES utility function.

Although the numerical findings are also supported by some analytical results, there is

a need for further research, to test the validity of the conjecture about the dominance of indirect

taxation in other models, under different utility function and capital supply specifications. The

explicit utility-demand functions that could be tested apart from LES include: CES, AIDS,

NLPS and direct and indirect addilog.

14

Another interesting topic in future numerical research could be, the estimated welfare

losses involved in adopting a “politically acceptable” solution, rather than the mathematically

optimal solution. The presence of many sub-optimal solutions with a similar utility level to the

optimum, suggests that it may be unwise to concentrate exclusively on the mathematical

optimum.

Some attention was given to approximate formulas for optimal linear capital and labour

income taxes, titled the modified inverse elasticity rule. These formulas were used to examine

the optimal linear taxation of capital and labour. They were helpful in explaining the puzzling

numerical results on the dominance of indirect taxation.

The incorporation of capital taxation does not appear to make much difference in the

results. Even in a two-tax model without capital taxation, there is a strong predilection in favour

of indirect taxation. But in this context, let us not forget that low capital taxation is politically

acceptable only in societies that are not strongly averse to inherited wealth or some other socio-

political aspects of high capital concentration, as implicitly assumed in the present model.

It is interesting to note that the modified inverse elasticity rule is an important element

of optimal taxation not only in respect to capital and labour, as indicated earlier, but also in the

field commodity taxation. Originally, the inverse elasticity rule was proposed by Ramsey

(1927) as an efficient method to generate a given amount of tax revenue through commodity

taxation. Apparently, the modified version of it, which takes into account also equity

considerations and the presence of other taxes, is applicable to all major forms of taxation.

However, as indicated in the explanation about the dominance of indirect taxation, the inverse

elasticity rule is not always the most important factor in determining optimal taxes. In a tax mix

model, the size and configuration of other taxes is sometimes more important.

So far, comprehensive numerical modelling of the tax mix has been missing from the

optimal taxation literature. Hopefully, the tantalising results from the present model, will

stimulate further analytical and computational research.

References

Arnold, J. M., Brys, B., Heady, C., Johansson, A., Schwellnus, C. and Vartia, L. (2011), “Tax

policy for economic recovery and growth”, The Economic Journal, 121 (550): F59-F80.

Atkinson, A. B. and J. E. Stiglitz (1976), “The design of tax structure: direct versus indirect

taxation”, Journal of Public Economics, 6: 55-75.

Bastani, S. and D. Waldenström (2018), How Should Capital be Taxed? Theory and Evidence

from Sweden, CESifo Working Paper No. 7004,

https://cepr.org/sites/default/files/news/FreeDP_Jan31.pdf

Conesa, J., Sagiri K. and D. Krueger (2009), ̋ Taxing capital? Not a bad idea after all˝, American

Economic Review, 99(1): 25-48.

Deaton, A. and Muellbauer, J. (1980), Economics and Consumer Behavior, Cambridge

University Press, New-York

Diamond, P. E. (1999), Taxation, Incomplete Markets, and Social Security, Munich Lectures

in Economics, The MIT Press, Cambridge, MA.

https://cepr.org/sites/default/files/news/FreeDP_Jan31.pdf

15

European Commission (2013) “Tax reforms in EU member states 2013”, European Economy

https://ec.europa.eu/economy_finance/publications/european_economy/2013/ee5_en.h

tm

Fitoussi, J.P. (2005), Shifting The Tax Burden: From Direct To Indirect Tax, HAL Archives-

Ouverters, France, https://hal-sciencespo.archives-ouvertes.fr/hal-00972763/document

Golosov, M. A., Tsyvinski, A. and I. Werning (2006), ˝New dynamic public finance: a user´s

guide˝, in NBER Macroeconomics Annual 2006, vol. 21: 317-388.

Hines, J. R. (1998), Three Sides of Harberger Triangles, NBER Working Paper 6852,

Cambridge, MA.

Hoy, M., Livernois, J., McKenna, C., Rees, R. and T. Stengos (2011), Mathematics for

Economics, The MIT Press, Cambridge, MA.

Hum D. and W. Simpson (1994), ˝Labour supply estimation and public policy˝, Journal of

Economic Surveys, 8(1): 59-81.

International Monetary Fund (IMF) (2014), Fiscal Policy and Income Inequality, IMF Policy

Paper 23. Washington DC.

https://www.imf.org/external/np/pp/eng/2014/012314.pdf

Krueger, D. and A. Ludwig (2018), Optimal Taxes on Capital in the OLG Model with

Uninsurable Idiosyncratic Income Risks, NBER Working Paper No. 24335, Cambridge,

MA

Martinez-Vazquez, J., Vulovic, V. and Y. Liu (2009), Direct versus Indirect Taxation: Trends,

Theory and Economic Significance, Andrew Young School of Policy Studies, Georgia

State University, https://ideas.repec.org/p/ays/ispwps/paper1014.html

Mirrlees, J. A. (1976), “Optimal tax theory: a synthesis”, Journal of Public Economics, 7: 327-

358.

Organisation for Economic Co-operation and Development (OECD) (2019), Tax Policy

Reforms 2019, www.oecd.org/tax/tax-policy-reforms-26173433.htm

Piketty, T. and E. Saez (2012), The Theory of Optimal Capital Taxation, NBER Working Paper

17989, Cambridge, MA.

Piketty, T. and E. Saez (2013), ˝The theory of optimal inheritance taxation˝, Econometrica,

82(4): 1241-1272.

Ramsey F. P. (1927), ˝A contribution to the theory of taxation˝, Economic Journal, 37(145):

47- 61.

Revesz, J. T. (1997) “Uniform versus non-uniform indirect taxation – some numerical results”,

Public Finance/Finances Publiques, 52: 210-234.

Revesz, J. T. (2014), “A numerical model of optimal differentiated indirect taxation”, Hacienda

Pública Española/Review of Public Economics, 211– (4/2014): 9–66.

Revesz, J. T. (2018), Some contributions to the theory of optimal indirect taxation, Editorial

Academica Espanola, Spain. Also available from Amazon.com

Revesz, J. T. (2019), ‘grid1’ - A Program to Find the Optimum in a Tax Mix Model, This

computer program and associated user’s guide are located in the website:

http://www.john1revesz.com

https://ec.europa.eu/economy_finance/publications/european_economy/2013/ee5_en.htmhttps://ec.europa.eu/economy_finance/publications/european_economy/2013/ee5_en.htmhttps://hal-sciencespo.archives-ouvertes.fr/hal-00972763/documenthttps://www.imf.org/external/np/pp/eng/2014/012314.pdfhttps://ideas.repec.org/p/ays/ispwps/paper1014.htmlhttp://www.oecd.org/tax/tax-policy-reforms-26173433.htmhttp://www.john1revesz.com/

16

Saez, E. and S. Stantcheva (2018), ˝A simpler theory of optimal capital taxation˝, Journal of

Public Economics, 162, pp. 120-142. Available online:

https://doi.org/10.1016/j.jpubeco.2017.10.004

Thomas, R. L. (1987), Applied Demand Analysis, UK., Longman Group, London, United

Kingdom.

Tuomala, M. (2016), Optimal Redistributive Taxation, Oxford University Press, Oxford,

United Kingdom.

Appendix

Proof for the optimal capital income tax formula The solution for the optimal capital tax formula is based on deriving the welfare function

in (1) in respect to 𝜏𝐾 and then equalizing the sum total of resulting components to zero. There are three welfare effects to consider following a change in 𝜏𝐾: (i) the income effect on capital owners, (ii) the effect on social utility of variations in the uniform lump-sum grant due to

changes in total tax revenue, (iii) the variations in dead-weight costs that arise due to tax related

distortions in the allocation of resources. The formula developed using this approach will be

examined here in more detail.

First comes the utility effect on capital owners. Assuming 𝜏𝐾 is increased by a small amount ∆𝜏𝐾. Then the change in welfare will be:

− ∑𝜕𝑢𝑖

𝜕𝑦𝑖𝑖 𝑘𝑖𝑟∆𝜏𝐾 = − ∑

𝜕𝑢𝑖

𝜕𝑦𝑖𝑖

𝑘𝑖 ∑ 𝑘𝑖𝑖 𝑟

∑ 𝑘𝑖𝑖 ∆𝜏𝐾 = − ∑ �̅�𝐾𝐾𝑖 𝑟∆𝜏𝐾 (A.1)

where �̅�𝐾 is the capital income weighted average social marginal utility of income of capital owners.

The change in welfare due to a variation in tax revenue is much more complicated than

that. Assuming fixed public goods expenditure requirements (G), all the tax generated above G

will be redistributed, by providing a uniform lump-sum grant (b) to everyone (Hb). The average

social marginal utility of income of this uniform grant will be: ∑𝑔𝑖

𝐻𝑖= �̅�𝑏, (A.2)

This is the unweighted average of the social marginal utilities of everyone.

Total tax revenue is given as:

𝑅 = ∑ (𝜏𝐾𝑘𝑖𝑟 +𝑖 𝜏𝐿𝑤𝐿𝑖 + 𝜏𝐶𝑐𝑖) (A.3)

where ci represents total consumption (in producer prices) of taxpayer i. The variation in total

tax revenue following ∆𝜏𝐾 change in the tax rate can be divided into two parts. First the direct

variation in capital income tax revenue: 𝜕𝑅1

𝜕𝜏𝐾∆𝜏𝐾 = ∑ (𝑘𝑖𝑟𝑖 − 𝜏𝐾

𝜕𝑘𝑖

𝜕(1−𝜏𝐾)𝑟)∆𝜏𝐾 (A.4)

Note, 𝜕𝑘𝑖

𝜕𝜏𝐾= −

𝜕𝑘𝑖

𝜕(1−𝜏𝐾)

https://doi.org/10.1016/j.jpubeco.2017.10.004

17

Second, the change in the tax rate will cause an oppositely signed change in consumer

income, amounting approximately to − ∑ 𝑘𝑖𝑟∆𝜏𝐾𝑖 . This change in household income will cause a change in consumption tax and labour income tax revenue, as shown in (A.5).

𝜕𝑅2

𝜕𝜏𝐾∆𝜏𝐾 = ∑ 𝑘𝑖𝑟𝑖 (− 𝜏𝐶

𝜕𝑐

𝜕𝑦− 𝜏𝐿

𝑤𝜕𝐿𝑖

𝜕𝑦𝑖)∆𝜏𝐾 (A.5)

Combining the two revenue effects leads to:

𝜕𝑅

𝜕𝜏𝐾∆𝜏𝐾 = 𝐾𝑟(1 − 𝜏𝐶

𝜕𝑐

𝜕𝑦− 𝜏𝐿

𝑤𝜕𝐿

𝜕𝑦)∆𝜏𝐾 − 𝜏𝐾

𝜕𝐾

𝜕(1−𝜏𝐾)𝑟∆𝜏𝐾 (A.6)

𝜕𝑐

𝜕𝑦 is defined from (4).

From definition (6), the derivative 𝜕𝐾𝑟

𝜕(1−𝜏𝐾) = 𝑒𝐾

𝐾𝑟

(1−𝜏𝐾) (A.7)

Replacing the derivative by the elasticity term in (A.6), we obtain:

∂R

∂τK∆τK = Kr (1- τC

∂c

∂y- τL

w∂L

∂y) ∆τK - τK𝑒𝐾

Kr

(1-τK)∆τK (A.8)

Combining (A.2) and (A.8), the change in social welfare due to the change in total tax

revenue will be: ∆𝑅 = �̅�𝑏[Kr (1- τC∂c

∂y- τL

w∂L

∂y) ∆τK - τK𝑒𝐾

Kr

(1-τK)∆τK] (A.9)

The third component of welfare change is related to dead-weight loss (D), defined in

eq. (1b). This is redefined in (A.10).

𝐷(𝐾) = − 𝜏𝐾

2(

𝜕𝐾𝑟

𝜕𝜏𝐾𝜏𝐾) = −

𝜏𝐾2

2

𝜕𝐾𝑟

𝜕𝜏𝐾≈

𝜏𝐾2

2𝑒𝐾

𝐾𝑟

(1−𝜏𝐾) (A.10)

Note, the third expression is only valid if the elasticity of capital supply stays at the

constant rate, 𝑒𝐾, along the domain from zero to 𝜏𝐾. If this condition is not satisfied, one can only talk about an approximation. The approximate change in the social welfare value of dead-

weight costs following ∆𝜏𝐾 variation in the tax rate will be then:

∆𝐷(𝐾) = �̅�𝐻𝜕𝐷

𝜕(1−𝜏𝐾)∆𝜏𝐾 = −

�̅�𝐻

2𝑒𝑘𝑟

𝜕(𝜏𝐾

2 𝐾(1−𝜏𝐾)

⁄ )

𝜕𝜏𝐾∆𝜏𝐾 =

= - �̅�𝐻Kr𝑒𝐾𝜏𝐾

(1− 𝜏𝐾){

1

(1− 𝜏𝐾)−

𝜏𝐾

2(1− 𝜏𝐾)(1 + 𝑒𝐾)} ∆𝜏𝐾 (A.11)

𝑔𝐻 represents the income weighted average social marginal utility of income of the entire population. In the absence of more detailed information, we assume that variations in dead-

weight costs are distributed in proportion to the marginal utilities of incomes of consumers.

We can now combine the three welfare variations from (A.1), (A.9) and (A.11) to yield

𝜕𝑈

𝜕𝜏𝐾∆𝜏𝐾 = −�̅�𝑘𝐾𝑟∆𝜏𝐾 + �̅�𝑏 [Kr

1

(1− 𝜏𝐾)( 1 – τC

∂c

∂y – τL

w∂L

∂y) ∆τK] −

− �̅�𝐻𝑒𝐾𝐾𝑟𝜏𝐾

(1− 𝜏𝐾){

1

(1− 𝜏𝐾)−

𝜏𝐾

2(1− 𝜏𝐾)(1 + 𝑒𝐾)} ∆𝜏𝐾 (A.12)

In order to be at the optimum, the terms on the right-hand side must add up to zero.

Eliminating ∆𝜏𝐾 from both sides, dividing all terms by Kr and setting the left-hand side to zero, we have:

0 = −�̅�𝑘 + �̅�𝑏 ( 1 – τC∂c

∂y – τL

w∂L

∂y) − �̅�𝐻𝑒𝐾

𝜏𝐾

(1− 𝜏𝐾){

1

(1− 𝜏𝐾)−

𝜏𝐾

2(1− 𝜏𝐾)(1 + 𝑒𝐾)} (A.13)

18

Transferring the last term to the left-hand side and dividing all terms by �̅�𝐻𝑒𝐾, we get:

𝜏𝐾

(1−𝜏𝐾){

1

(1− 𝜏𝐾)−

𝜏𝐾

2(1− 𝜏𝐾)(1 + 𝑒𝐾)} = −

�̅�𝑘

�̅�𝐻𝑒𝐾+

�̅�𝑏

�̅�𝐻𝑒𝐾 ( 1 - τC

∂c

∂y - τL

w∂L

∂y) (A.14)

Or in a more explicit form:

𝜏𝐾

(1−𝜏𝐾)=

− �̅�𝑘�̅�𝐻

+ �̅�𝑏�̅�𝐻

( 1 − 𝜏𝐶𝜕𝑐


𝑤𝜕𝐿

𝜕𝑦)

𝑒𝐾{1

(1− 𝜏𝐾) −


(1+ 𝑒𝐾)} (A.15)

This is the approximation for the optimal linear capital income tax rate. It will be

referred to it as the modified inverse elasticity rule for capital taxation.

It is of some interest to compare formula (A.15) with the one presented by Saez and

Stantcheva (2018). The steady-state tax formulas presented in their work are derived from the

following dynamic social welfare function for consumer i:

Vi = 𝛿𝑖 ∫ [𝑐𝑖(𝑡) + 𝑎𝑖(𝑘𝑖(𝑡)) − ℎ𝑖(𝑧𝑖(𝑡))]∞

0𝑒−𝑡𝛿𝑖𝑑𝑡 (A.16)

where 𝛿𝑖 is the time discount rate. In the steady-state solution time discounting disappears, and the problem becomes one of maximizing ∑ 𝑔𝑖[𝑐𝑖 + 𝑎𝑖(𝑘𝑖) − ℎ𝑖(𝑧𝑖)]𝑖 subject to the budget constraint 𝑐𝑖 = 𝑟𝑘𝑖 + 𝑧𝑖 − 𝑇(𝑘𝑖𝑟, 𝑧𝑖), where 𝑇(𝑘𝑖𝑟, 𝑧𝑖) is the combined tax (or support grant) to person i on capital and labour income.

Notice that the social welfare term is strongly separable between consumption and labour

income, is linear in consumption and contains a separate term, 𝑎𝑖(𝑘𝑖(𝑡)), to represent the

intrinsic utility derived from holding wealth. By contrast the utility function defined in (1) does

not aggregate consumption into a single variable, does not assume strong separability between

consumption and labour income, is not linear in consumption, and there is no intrinsic utility

from holding wealth. Wealth only contributes to utility by virtue of generating income.

Moreover, the present model is a static model that does not assume steady-state conditions, as

defined in dynamic growth models. The model of Saez and Stantcheva (2018) excludes ’income

effect’ by virtue of the utility function chosen. By contrast, in the numerical work discussed

here, 𝑤𝜕𝐿𝑖

𝜕𝑦𝑖 is set to -0.24.

From the definition of utility in (A.16), they obtain the following steady-state linear

capital income tax equation: 𝜏𝐾 = 1−�̅�𝑘

1− �̅�𝑘+ 𝑒𝐾 (A.17)

The definitions of �̅�𝑘 and 𝑒𝐾 in (A.17) are exactly the same as in (A.15). The differences between the formulas are mainly due to the dissimilarities between the utility

functions in (1) and (A.16). The presence of indirect taxes in the present model is another

reason. Despite differences in appearance, in polar cases the two formulas lead to similar

results.

When 𝑒𝐾 is very high, both (A.15) and (A.17) suggest that optimal 𝜏𝐾 will approach zero. When 𝑒𝐾 approaches zero, then 𝜏𝐾 in both (A.15) and (A.17) will converge to one, which is the highest permissible value.

When the social marginal utility of all income groups is the same (absence of

distributional objectives) then according to (A.17) �̅�𝑘 equals to 1 and optimal 𝜏𝐾 will be zero. In this case, (A.15) is reduced to the equation:

19

𝜏𝐾

(1−𝜏𝐾)=

− 1 + ( 1 − 𝜏𝐶𝜕𝑐


𝑤𝜕𝐿𝑖𝜕𝑦𝑖

)

𝑒𝐾{1

(1− 𝜏𝐾) −


(1+ 𝑒𝐾)} (A.18)

Provided G = 0, and 𝜏𝑐 and 𝜏𝐿 are zero, then 𝜏𝐾 = 0, in line with (A.17).

When the social marginal utility of capital owners approaches zero, then according to

(A.17) 𝜏𝐾 = 1

(1 + 𝑒𝐾)⁄ . This is the revenue maximizing rate. Eq. (A.15) is reduced to a

simpler form presented in (A.19):

τK

(1-τK)≈

g̅bg̅H

( 1 - τC∂c

∂y - τL

w∂Li∂yi

)

eK (A.19)

Denoting �̅�𝑏

�̅�𝐻 𝑎𝑠 �̅�𝑏𝐻, when 𝜏𝐶 and 𝜏𝐿 are both zero, then (A.19) is reduced to:

𝜏𝐾

(1−𝜏𝐾)≈

𝑔𝑏ℎ

𝑒𝐾 (A.20)

�̅�𝑏𝐻 is the ratio between the average social marginal utility of demogrant recipients and that of the entire population. The relationship between the two is discussed in Revesz (2014, pp. 21-

22). Adopting an estimated value between 1.1 and 1.6 appears realistic. Numerical

comparisons, using plausible values for �̅�𝑏𝐻 between 1.1 and 1.6 and for 𝑒𝐾 between 0.5 and 2, indicate that optimal 𝜏𝐾 from (A.17) and from (A.20) tend to be fairly close. The similarity of polar outcomes from the present model and the static steady-state model of Saez and

Stantcheva (2018), suggests that the two models are not far apart.

User’s guide to the computer program ‘grid1’

This program carries out optimisation of linear commodity, labour and capital income

tax rates. We used the “grid” method to find the optimum, which means that it examined all

possible combinations of linear taxes. It yielded the numerical results discussed in the paper

titled “A model of the optimal tax mix including capital taxation”. The user can specify values

such as utility function parameters, wage distribution and the inequality aversion of society.

The original program was developed more than 20 years ago and was used in publications

dealing with optimal commodity taxation. (Revesz, 1997, 2014, 2018). The current version of

the program focuses only on the optimal mix of three linear taxes. It contains in addition to

commodity taxation, the optimisation of labour and capital income taxes. Prompts enable the

user to choose any combination of these three broad categories of taxes.

20

Program installation and running

This program, located on the john1revesz.com website, is written in a programming

language called QBasic, which two decades ago was part of Microsoft Windows. The

program is written as a “source code” in a Word file, without being converted into machine

language, as are most computer programs sold on the market. A special compiler is needed to

convert it into machine code. A QBasic compiler for 64 bit machines is available free of

charge from the following website: http://www.qb64.net/ . Note, the QBasic compiler will

accept the source code only in Word format.

Having a QBasic compiler, the installation is simple. Open the file named qb64 and

then Edit and Paste the entire content of the ‘grid’ Word file into the QBasic screen. Next,

click Run and then Start. Less than two minutes later the first prompt will appear on the

screen. The prompts present a menu of options. This menu provides access to various options

such as type of utility function, inequality aversion rate, the selection of taxes to be optimised,

the choice of ordinary or two-part LES, the parameter of constant elasticity capital, as well as

the size of expenditure on public goods as a percentage of total income.

After you finished with the prompts, processing will start automatically and will finish

in less than an hour. The output file, called ‘tax2.txt’, will be located in the same folder as the

QBasic compiler. In case you encounter any problem when responding to prompts, the

quickest way to overcome it is to exit and start again. Click on the cross at the upper right

hand corner of the QBasic screen. After the screen disappears, start running the program

again.

Finally, all the library files associated with the compiler should be kept in the same

folder as the compiler file. Because the program is a source code in Word format, it could be

corrupted accidentally or during data entry. For that reason, it is advisable to keep a couple of

copies in a separate folder.

DATA lines

Virtually all parameters related to consumer demand and labour supply can be

changed by the user, as well as data related to income distribution. All the numbers in the

DATA lines close to the beginning of the source code can be altered. Above the DATA lines

are text lines (marked at the left by asterisk to indicate that it is a comment and not a program

line) explaining what the DATA lines refer to. These include initial wage rates, initial capital

incomes and LES utility-demand parameters. After reading the paper, most of these

parameters are self-explanatory.

Usually there are a number of DATA lines for each item. Those marked with an

asterisk at the beginning of the line are ignored by the program. Only the unmarked DATA

line is currently active. Considerable care must be exercised when entering new numbers. If

the number of entries in the DATA line does not correspond to the number of READs

specified underneath, then the program will be corrupted without warning. If there are too few

entries, the READ instruction will assume that the missing numbers are zero. If there are too

many entries, the surplus numbers will be picked up by the following READ instruction

relating to a different variable or parameter. As a result, the output from the program will be

meaningless.

http://www.john1revesz.com/http://www.qb64.net/

21

Please check the parameter summary tables appearing at the beginning of the printout,

to ensure that correct numbers are used by the program. Always make sure that there is only

one DATA line without an asterisk for each variable.

Note, every time you change DATA lines on the ‘grid1’ Word file, you have to copy

and paste the file again into the QBasic screen

Utility and demand parameters

The current version of the program enables the user to specify LES utility-demand

parameters according to his/her choice, for the two wage groups in the model. The relevant

DATA lines for 9 goods and 18 goods utility-demand appear close to the beginning of the

source code. The 9 goods utility function is allocated to the 8 lower wage earners, the 18

goods utility function is allocated to the 7 higher wage earners. The leisure parameters are

defined under them, through equations with numbers on the right hand side. The program

checks that the betas of commodities and leisure add up to one. If this condition is not

satisfied, a warning will appear at the very start of the printout and program execution will

terminate. To ensure the global quasi-concavity of LES utility, all betas must be positive.

Labour and capital income tax schedules

The constant elasticity of capital supply is entered through a prompt. No such prompt

appears for labour supply, because the compensated elasticity of labour supply is determined

by the program for each taxpayer, according to his/her wage rate, lump-sum income and

relevant LES parameters, specified in DATA lines.

Inequality aversion

The program contains two social welfare functions to describe the inequality aversion

of society, one is LES utility (a logarithmic function) the other is maximin. Maximin

considers only the utility level of the lowest wage group. It leads to very high optimal tax

rates. Even LES leads to fairly high tax rates. To overcome this problem, an inequality

aversion rate (z) was introduced, as explained in Revesz (2018, Appendix 3.3 section C.4).

The inequality aversion rate ranges between zero and one. Zero represents the absence of

distributional objectives, one represents LES utility. Any number between zero and one is a

convex combination of the two. The user can choose this rate in the second prompt.

Sometimes when z is low, the search for the optimum does not converge properly. In these

cases an error message is issued, and you should try a somewhat higher value of z.

a model of the optimal tax mix including capital...

Documents