working paper series - scuola superiore sant'anna · model. daniele and malanima (2007, p....

LEMLEMWORKING PAPER SERIES

Where Do We Go From Here? Market Access and Regional Development in

Italy (1871-1911)

Anna Missiaia °

° Lund University, Sweden

2015/18 June 2015ISSN(ONLINE) 2284-0400

Where Do We Go From Here? Market Access and Regional

Development in Italy (1871–1911)

Anna Missiaia∗

Lund University

Abstract

Italy has been characterized, throughout its history as a unified country, by large

regional differentials in the levels of income, industrialization and socio-economic

development. This paper aims at testing the New Economic Geography hypothesis on

the role of market access in explaining these regional differentials. We first quantify

market access of the Italian regions for benchmark years from 1871 to 1911 following

Harris (1954). We then use these estimates to study the causal link between GDP

per capita and market potential following Head and Mayer (2011). The main result of

this paper is that only domestic market potential, which represents the home market,

shows a “traditional” North-South divide. When international markets are introduced,

the South does not appear to lag behind. Regression analysis confirms that market

potential is a strong determinant of GDP per capita only in its domestic formulation.

This suggests that the home market in this period mattered far more for growth than

the international markets, casting new light on one of the classical explanations to the

North-South divide.

JEL Classification Numbers: N33, N73, N93, O18, R11.

Keywords: Economic Geography; Economic History of Italy; Market Potential.

∗Lund University, Economic History Department, Room 2110, Alfa 1, Scheelevagen 15 B, Lund, Skane

22363, Sweden, e-mail:[email protected]. The author is grateful to Brian A’Hearn, Carlo Ciccarelli,

Emanuele Felice, Giovanni Federico, Stefano Fenoaltea, Steve Gibbons and Max Schulze for advice and

suggestions, and to participants of the seminars at the London School of Economics, Bocconi University,

EHS Annual Conference, Carlos III Summer Schools and European Business History Association Summer

School, FRESH Winter School for helpful comments.

1

1 Introduction

Italy’s political Unification dates back to 1861. It was the result of two independence

wars against the Habsburg Empire and the Expedition of the Thousand through which

Southern regions were annexed to the newly unified North. At the time of Unification,

the regions of Italy were not at all homogeneous in terms of their economic structure and

performance. Although they did not show as large a North-South divide in terms of GDP

per capita as today’s, it is in this period that the gap starts widening (see Figure 1).

It is in the 1880s that the Italian regions start to polarize in terms of GDP per capita

and even more sharply in the level of industrial production. During the first decades after

Unification, most of the economic gap was between the Northwestern regions (Piedmont,

Lombardy and Liguria) versus the rest of the country. If we look at the indicators of

economic development, such as the literacy rate, the level of regional inequality appears

evident at the time of Unification. Figure 2 shows the literacy rates for each region in the

benchmark years 1871, 1881, 1901 and 1911. Finally, looking at the industrial value added

per capita (which corresponds specifically to the part of GDP produced by industry) in

Figure 3, the formation of the gap during this period is much more evident, due to the

process of industrialization which was largely concentrated in the Northwestern regions of

Piedmont, Lombardy and Liguria.

The historiography has brought forward several explanations for what is often referred

to as the “Questione Meridionale”, the Southern Question. The hypotheses proposed by

scholars range from colonial exploitation of the South by the North (Villari (1979) and

Sonnino (1877)), to differences in agricultural structure (Cafagna (1989) and Zamagni

(1990)), to differences in the institutional framework (Felice and Vasta (2012)) all the way

to cultural differences (Galassi (2000) and A’Hearn (2000)).1

Another line of research has been focusing on the physical geography of Italy as the

primary cause of the backwardness of the South. For instance, Fenoaltea (2006, p. 261)

gives a possible explanation for the regional patterns of industrialization in Italy before

the First World War based on the comparative advantages of the North, in particular

in terms of energy endowment from water, but he does not translate it into a formal

model. Daniele and Malanima (2007, p. 181) discuss the role of the physical distance

of the southern regions to the centre of Europe and they claim that the position of the

South constituted a natural disadvantage for its industrialization. This explanation is

strongly rejected by Felice (2013, p. 201) who uses several other historical examples, i.e.

1For a more detailed overview of these views see Missiaia (2014).

2

Figure 1: GDP per capita of the Italian regions, 1871–1911 (constant 1911 prices,

Italy=100).

Source: Felice (2009) for 1881 and 1901 and Brunetti et al. (2011) for 1871, 1891,

1901 and 2009.

California, Japan and Southeast Asia, to demonstrate that mere physical distance from

the core does not necessary imply a disadvantage. One other existing piece of research

that links Economic Geography to the regional disparities in Italy is a working paper

by A’Hearn and Venables (2011). The paper explores the relationship between economic

disparities, internal geography and external trade for the 150 years of the unitary history of

Italy. The authors propose different explanations for what drove economic activity across

Italian regions in different periods: in the period 1861–1890 the main driver was natural

advantage; in the period 1890–1950 it was access to the domestic market; and finally in

the post-war period it was the access to foreign markets. The purpose of this paper is

to contribute to the existing debate on the role of geography in the Italian North–South

divide.

In Economic Geography, we know that there are two competing views on why certain

regions attract economic activity more than others. The traditional Heckscher-Ohlin (H-

O) view focuses on factor endowment to explain the location of economic activity, meaning

that regions with a higher endowment of natural, financial or human resources attract

economic activity. Opposite to this, the New Economic Geography (NEG) view considers

market access as the main force. While measuring endowments is a fairly straightforward

procedure, if not in terms of data collection at least in terms of methodology, measuring

3

Figure 2: Literacy rates in the Italian regions, 1871–1911.

Source: A’Hearn et al. (2011).

market access is far more complicated. Reliable measures of market access are an essential

starting point in order to evaluate NEG forces. This paper looks at the role of market

access in the regional divergence of Italian regions, using the concept of market potential.

This is a measure of the centrality of a region in terms of its access to markets. Given

the lack of trade volumes data for the Italian regions, this measure will be based on the

GDP of each region and the GDP of the adjacent regions, weighted by their distance.

The formulation used here dates back to the seminal work by Harris (1954), adjusted

by the several developments and extensions since then. This paper will estimate the

market potentials of all Italian regions for a series of ten year benchmarks, from 1871 to

1911, following the methodology by Crafts (2005) and Schulze (2007). These estimates

will allow us to look at the market access of different regions both before the process of

industrialization gained ground and during its evolution.

The estimation of market access through market potentials has fruitful applications

beyond the mere quantification of the relative position of the regions. In this paper we

use both domestic and total market potential in order to explain regional GDP per capita

and regional industrial value added per capita. The main result is that regional GDP

per capita in this period was affected by domestic market potential more strongly than

by total market potential, which includes trading partners. In particular, it is the case

4

Figure 3: Industrial value added per capita in the Italian regions, 1871–1911 (constant

1911 prices, Italy=100).

Source: Fenoaltea (2003) and MAIC (1874, 1883, 1902, 1914).

when the model is run in first differences, which corresponds to looking at growth rates.

The second economic indicator that we try to explain is industrial value added per capita,

which in terms of growth rates seems to be much less affected by any type of market

potential. This measure is included in the analysis to help us focus on the part of GDP

generated by the secondary sector, which appears to be more geographically polarized.

This paper is organized as follows. Section 2 discusses the methodology used to

calculate market potentials for the Italian regions; Section 3 illustrates the sources used;

Section 4 shows the estimates and provides a commentary on the results; Section 5 applies

these estimates to explain GDP per capita and industrial value added per capita; Section 6

concludes.

2 Market access in regional analysis: empirical framework

In the literature, market potential has been calculated following several methodologies.

The most basic one goes back to Harris (1954) and uses retail sales weighted by distances

to evaluate the market access of US regions. This type of formulation has been used

5

in more recent works such as Midelfart et al. (2000) on the European Union, Crafts

(2005) on Britain (1870–1910), Schulze (2007) on Austria-Hungary (1870–1910), Martinez-

Galarraga (2012) on Spain (1856–1929) and Klein and Crafts (2012) on the US (1880–

1920). All these works use the total GDP of the regions weighted by distance.2 The

alternative approach to calculating market potentials is to use a gravity model to estimate

the functional form of market potential. A gravity model explains the volumes of trade

among regions using mainly the size of the regions and the distance between each region,

jointly with some dummy variables and controls such as adjacency or the presence of a

border. Market potential in this case is calculated through the parameters estimated by

means of the gravity model. Examples of works using this methodology are Redding and

Venables (2004) at world level, Head and Mayer (2004) on Japanese firms in the European

Union, Hanson (2005) on the United States, and in historical perspective, Wolf (2007) on

Poland (1925–1937) . This exercise is not possible in the case of Italian regions because

data on volumes of trade within the Italian borders are not available for this period. The

next section illustrates the methodology adopted here, which largely follows Crafts (2005)

and Schulze (2007).

2.1 Modelling market potential

In its original formulation, the market potential of region A was defined as the sum of the

GDP of all the adjacent regions, each weighted by their distance from region C, plus the

GDP of region C adjusted by a coefficient that takes account of its size. The calculation

of market potential, following Harris (1954), is shown in Equation (1):

MPc =∑

w

GDPw ×Dγc,w (1)

with Dγc,w the distance between region c and w. The parameter γ is set as -1 and is defined

as in Equation (2):

Dc,c = 0.333×

√

Areacπ

. (2)

The idea behind market potential is quite straightforward. First, for each region C,

we take the main node. The second step is to calculate the distances between the node of

region C and the nodes of each other adjacent region.3 For a given region C, the larger

the GDP of the other regions, the better the access of the regions to markets; the larger

the distance between region C and the other regions and the lower the weight of the GDP

2All the works actually use transport cost adjusted distances; the only exception is Klein and Crafts(2012), which uses straight line distances. The appropriateness and difference between these approachesis discussed later in the paper.

3In this case all the Italian regions plus the main trading partners of Italy.

6

of each of these regions in the market access of the region concerned. Finally, Equation

(1) shows how to deal with the own GDP of the region, which represents the contribution

of the home GDP to the overall market access: Harris (1954) proposes a formula for own

distance that takes into account the size of the region, so that the larger the region, the

lower the weight of its own GDP. The rationale here is that for a given level of GDP, the

larger the region the more spread out, and therefore harder to access, is its own GDP.

Although a gravity model cannot be used because of the lack of internal volumes

of trade data, several refinements are still possible. First of all, distance in our case is

weighted by transport costs as normally done in the literature on market integration and

market potential when volumes of trade are not available. In this case, we decided to take

into account both ground distances, which are assumed to be covered by railway and sea

distances which are assumed to be covered by ship. For each pair of nodes, we calculate

the cheapest combination of railway and shipping. To do so, we apply to all distances both

a variable component (cost per km) and a terminal component (a lump sum cost when

using each given mean of transportation). Whenever a part of the distance is assumed

to be covered with a different mean of transportation, the corresponding terminal cost

is applied. Finally, the last cost to take into account is the existence of trade barriers

between nodes. It is not the case for Italian nodes, but whenever one of the two nodes is

a foreign city, a correction is needed. Following Crafts (2005) and Schulze (2007), tariffs

are converted into distance equivalents. This procedure is based on the coefficients of the

gravity model by Estevadeordal et al. (2002). The elasticities of the model are used to

convert ad valorem tariffs into a distance equivalent measure to be added to the regular

terminal component of the transport cost. The tariff between Italy and each trading

partner is computed as the ratio of the total custom revenues of the trading partner over

its total imports. This gives an average tariff level for each country.

2.2 Testing the effect of market potential on economic development

The goal of this paper is to study the relationship between economic development and

market access in the Italian regions in the period 1871–1911. The empirical framework we

use is taken from the work by Head and Mayer (2011) on market potential and economic

development in the period 1965–2003. This work focuses on the calculation of market

potentials for all countries in the world, relating them to GDP per capita. The main

methodological difference between our work and that by Head and Mayer (2011) is the

calculation of market potentials: here the calculation of market potentials follows Harris

7

(1954) using GDP figures and transport costs; Head and Mayer (2011) use a gravity model

based on trade data.

After obtaining market potential estimates, we implement the model; our goal is to

cast light on the relationship between economic development and market access. We rely

on the following base line specification:

ln(GDPpci)t = βt Market Potentiali

+ αt Region Controlsi

+∑

t

θt Year + ǫt .

(3)

The model described in Equation 3 aims at explaining the GDP per capita of region

i through market potential as the main explanatory variable. The region controls are as

follows: South, which is equal to 1 if the region is in the South (we also show a version of

this model with region fixed effects and with latitude as a control); literacy, which is the

literacy rate in a given region and the share of arable land. This latter is used in 1871

level to explain all years as a method to avoid endogeneity.

Equation 3 can thus be expanded in the following estimating equation:

ln(GDPpci)t = βt Market Potentiali

+ γ1t Latitudei + α1t Literacyi

+ α2t Share Arable Landi

+∑

t

θt Year + ǫt .

(4)

In Section 5 various specifications of this model are shown. We also split the sample

in North and South and we show the same regression with industrial value added per

capita as a dependent variable. All the models are also run in first differences to address

collinearity concerns. Before we show the results, the next section illustrates in detail the

sources used.

8

3 Sources

In this section we describe the sources used for testing the model of Equation 4. All

monetary measures are taken in constant 1911 lire. The regions considered in this work

are the sixteen regions created in 1870 after the annexation of Rome. Figure 4 shows the

boundaries of the Italian regions which did not change over the period.4

Starting the analysis on market potentials in 1871 and ending it in 1911 is both

historically and practically useful. From a pragmatic point of view, the 1871–1911 period

is convenient because borders did not have any variation and because 1871 was the year of

the first census after the main annexations. In fact, Italy was formally unified in 1861 but

its borders changed twice, in 1866 and in 1870, when Veneto and Latium were annexed.

All the main regions of Italy (except Trentino Alto Adige and Venezia Giulia) were part

of Italy in 1871. From an historical point of view, ending the analysis in 1911 allows us

to isolate this period of the early industrialization from the effects of the First World War

and Fascism.

3.1 Market potentials

The variable at the core of this model is market potential. Market potential is calculated

using regional the GDP, the GDP of trading partners, transport cost adjusted distances

and tariffs. The next sections illustrates the sources and how they are used for each of the

components of market potential.

Regional and Foreign GDP Estimates. The first and main ingredient of market

potential is GDP. The estimation of regional disparities for Italy in the period 1871–1911

has been a matter for discussion among scholars for a long time.5 For the GDP estimates

for the Italian regions the latest available series are those from the work of Emanuele

Felice. The estimates of regional GDP used here come from Felice (2009) for 1881 and

1901 and Brunetti et al. (2011), of which Felice is a coauthor, for 1871, 1891 and 1911.

The data provided here on the regional disparities of GDP per capita are the starting point

for deriving GDP estimates in levels for this period. The next step in the procedure is to

apply these per capita disparities to the national GDP per capita. We decided to use for

this step the GDP estimates published by Baffigi (2011) within the broader project of the

Bank of Italy for the 150th anniversary of the Unification. These are the latest estimates

4These regions are quite similar to present regions, with the exception of Venezia Giulia and TrentinoAlto Adige which were not yet Italian and Valle Aosta and Molise which were at the time parts of Piedmontand Abruzzi, respectively.

5See Fenoaltea (2003), Zamagni (1978), Felice (2007), Felice (2009) and Brunetti et al. (2011).

9

Figure 4: Italian regions, 1870–1918.

for the Italian national income and are published both in constant and current prices. In

this paper we use the GDP figures in 1911 constant lire. Starting from the national GDP

estimates by Baffigi (2011), we calculate the national GDP per capita estimates by dividing

them by the present Italian population. This figure is then multiplied by the coefficients

of regional disparities provided by Felice (2009) and Brunetti et al. (2011) to work out all

the level of the regional GDP per capita. Finally, the per capita figures are multiplied by

the regional population figures to obtain the total GDP of each region in levels at constant

1911 prices.6 The regional GDP disparities from Felice (2009) and Brunetti et al. (2011)

are shown in Figure 5.

With regard to the GDP of foreign trading partners, the procedure is the following.

Using data on exports from the Annuario Statistico Italiano, for each benchmark year, we

take all the countries that cover 80% of Italian foreign trade. The trading partners included

are: Austria-Hungary, France, Germany, the United Kingdom, Switzerland, Argentina and

the United States.7 Figure 6 shows the Italian exports to these countries as the share of

total exports for each benchmark year.

6To illustrate, assume that the national GDP per capita in a given year is 100 lire. If the GDP percapita coefficient for a region A is 1.20, the GDP per capita is 120. If the present population of the regionin 1871 is 1,000,000 than the total GDP of region A in 1871 is derived by multiplying these three figures,returning 1,200,000,000 lire.

7The only exception to the 80% criterion is Turkey, for which GDP estimates for this period in currentprices are not easily available from either source.

10

Figure 5: Total GDP of the Italian regions, 1871–1911 (constant 1911 prices, Italy=100).

Source: Felice (2009) for 1881 and 1901 and Brunetti et al. (2011) for 1871, 1891and 1911.

The main source for the GDP of foreign partners is Crafts (2005), who relies primarily

on the work of Prados de la Escosura (2000). Argentina and Switzerland come directly

from Prados de la Escosura (2000) for the years 1881–1911.8 We also consider Austria

and Hungary separately. In order to do so, we split the estimate by Crafts (2005).9

Figure 7 shows the magnitude of the Italian GDP compared to foreign GDP. Looking

at Figure 7, the different magnitude of the US, for 1911 in particular, stands out. Including

such a large GDP compared to that of the Italian regions could be problematic in the sense

that most of the market access could be driven by the US. To underplay the role of the US,

we split it in four macro-regions, Northeast, Midwest, West and South. The implication

of this choice is discussed in more detail in the next section. The regional disparities for

the US are worked out from Klein (2009). These are the same estimates as those on which

the market potentials of Klein and Crafts (2012) are based for 1881–1911. To work out

1871, we use the same disparities as for 1881.

Distances. In order to weight the sum of GDP, a distance measure is required. As in

previous works by Crafts (2005) and Schulze (2007), geographical distance is replaced by

transport costs. This is suitable because straight line distances per se do not take into

account differences of costs for alternative means of transportation nor of the existence

8For 1871, estimates were not available; therefore we used the relative disparity among countries of1881 and applied it to 1871.

9The two GDP are worked out by looking at the relative size in each year from Schulze (2007).

11

Figure 6: Italian export shares, 1871–1911.

Source: Annuario Statistico Italiano 1877, 1881, 1892, 1911.

of railway lines and ports. The first step is to choose a node for each region and foreign

trading partner. Then the distance in terms of railway line or sea route (or the two

together) between each pair of nodes is computed. The administrative regions considered

are shown in Figure 4.

The general rule in selecting the nodes is to take the most populous centre which often

corresponds to the administrative centre. Exceptions are Terni and Pescara for Umbria and

Marches. The reason for this choice is that the actual administrative centres, Perugia and

L’Aquila, were not the economic centres of the regions and were not very well integrated

in its transport network. Using them would have created too high a penalty for Umbria

and Abruzzi in terms of market access. For the US, none of the nodes is the administrative

centre since in the US the largest cities almost never correspond to the capital. The two

means of transportation considered in this period are railways and shipping. For railways,

the length of lines has been worked out from the following sources: Bradshaw’s Continental

Guide of 1914 and the publication on the all lines opened by Ferrovie dello Stato (1927).

Relying on sources that cover the whole period is very important, because this takes into

account the construction of new lines. For shipping, distances were easily computed from

the website www.dataloy.com, which provides the length of maritime routes between all

the main ports worldwide.

Transport Costs. Once a matrix of distances is computed, the next step is to quantify

the rate per tonne per kilometre. The rate taken into account is the average rate between

12

Figure 7: GDP of Italy and its main trading partners, 1871–1911 (constant 1911 millionlire).

Source: Prados de la Escosura (2000) and Crafts (2005).

coal and wheat, which are considered here the two representative goods.10 For Italian

railways, the source is a publication on railway rates by Ferrovie dello Stato (1912). This

publication is quite detailed, providing terminal and variable components of the rate of

transportation for a variety of goods. For the rates of foreign countries, we rely on the

work by Schulze (2007) which uses on the information from the US Bureau for Railway

Economics (1915) and Noyes (1905) to compute terminal and variable components. The

first source provides an overview of 1914 rates for different countries and for both coal

and wheat, separating the cost in terminal and variable component for different city pairs

in each country. From this, the terminal and variable components for various countries

are worked out. The next step is to project these estimates back in time. Noyes (1905)

provides average rates starting from 1870. This information is converted into an index

and used jointly with the 1914 baseline to extrapolate terminal and variable components

for the whole period. For the US, which is not covered by Schulze (2007), we rely on

the information from Noyes (1905) and assume that the US rate is 50% of the general

European rate. For shipping, there are no sources specific to Italy. The estimates for

international ocean shipping from Kaukiainen (2003) are used for all routes. This is a

10We would of course like to take into account the transportation cost of all industrial goods. However,collecting information on transport rate for all goods would be extremely data and time consuming.Moreover, it would not be clear how to use this information in a synthetic measure. Therefore, we followthe existing literature such as Crafts (2005) and Schulze (2007) and adopt the standard solution.

13

Table 1: Regions, trading partners and nodes.

Region/Country Node

Nodes with sea access Liguria Genoa

Venetia Venice

Marches Ancona

Campania Naples

Apulia Bari

Calabria Reggio Calabria

Sicily Palermo

Sardinia Cagliari

United Kingdom London

Turkey Istanbul

Argentina Buenos Aires

Unites States New York

Northeast New York

South New Orleans

West San Francisco

Nodes without access to the sea Piedmont Turin

Lombardy Milan

Emilia Bologna

Tuscany Florence

Umbria Terni

Latium Rome

Abruzzi Pescara

Basilicata Potenza

Austria-Hungary Vienna

France Paris

Germany Berlin

Switzerland Zurich

widely used source for this type of research, including works on Italy.11 The transport

costs used are set out in Tables 2, 3 and 4.

Table 2: Shipping rate per tonne per km, in constant 1911 lire, 1871-1911.

Terminal Component Cost per km

1871 19.67 0.00307

1881 14.71 0.00180

1891 11.14 0.00116

1901 9.06 0.00111

1911 7.29 0.0011

Source: Kaukiainen (2003)

11Federico (2007) uses the same source to study the market integration of Italy in the 19th century.

14

Table 3: Italian railway rates per tonne per km, in constant 1911 lire, 1871-1911.

Terminal Component Cost per km

1871 1.93 0.05688

1881 1.83 0.05386

1891 1.51 0.04456

1901 1.56 0.04588

1911 1.26 0.03731

Source: Ferrovie dello Stato (1912), Schulze (2007), Noyes (1905)

Table 4: Railway and Shipping rates per tonne per km, in constant 1911 lire 1871-1911.

Austria Railway Rates France Railway Rates Germany Railway Rates

Terminal Variable Terminal Variable Terminal Variable

1871 5.44 0.08342 7.67 0.01937 6.56 0.03344

1881 4.35 0.06633 8.69 0.02195 6.69 0.03379

1891 3.11 0.04860 7.61 0.01924 6.21 0.03149

1901 2.98 0.04618 7.00 0.01753 5.96 0.03057

1911 2.60 0.04013 5.73 0.01450 4.90 0.02505

Europe Railway Rates UK Railway Rates US Railway Rates

Terminal Variable Terminal Variable Terminal Variable

1871 7.55 0.034289 2.35 0.052741 7.99 0.03614

1881 7.58 0.034409 2.34 0.05054 4.88 0.022046

1891 6.32 0.028692 2.04 0.045535 3.63 0.016437

1901 6.19 0.028006 2.29 0.0491 3.10 0.014003

1911 5.10 0.023171 1.87 0.041229 2.55 0.011585

Source: Ferrovie dello Stato (1912), Schulze (2007), Noyes (1905) andUS Bureau for Railway Economics (1915)

Tariffs. The last cost to be taken into account is the one originated by trade barriers

between nodes. It is not the case for Italian nodes, but whenever one of the two nodes is

a foreign city, a correction is needed. Following Crafts (2005) and Schulze (2007), tariffs

are converted into distance equivalents. The source used is Mitchell (2003) except for the

United Kingdom, where Mitchell (1988) is used. Capie (1994) gives data for 1870 Germany

and Ferreres (2005) for Argentina. Whenever a year is missing, either the closest available

year is used or the gap is filled by interpolation. Table 5 shows the level of tariffs used.

3.2 Region controls

In Section 5, GDP per capita and industrial value added per capita are explained using

market potentials, geographical controls (such as dummies for macro areas, region fixed

effects and latitude) and two other controls; literacy rates and share of arable land. The

share of arable land is shown in Figure 8 while literacy rates in Figure 2 are derived from

(A’Hearn et al., 2011).

15

Table 5: Ad valorem tariffs, 1871–1911.

Austria-Hungary France Germany UK Switzerland Argentina US

1871 2% 3% 8% 6% 2% 22% 29%

1881 0% 6% 6% 5% 2% 27% 24%

1891 3% 8% 9% 5% 3% 20% 20%

1901 6% 9% 9% 9% 4% 29% 20%

1911 6% 10% 8% 12% 4% 2% 15%

Source: Mitchell (2003) for Austria-Hungary, France and Germany, Mitchell (1988)for the United Kingdom, Capie (1994) for 1870 Germany and Ferreres (2005) forArgentina.

Figure 8: Share of arable land by province, 1870.

Source: MAIC (1976).

A last remark is on the year 1891. The 1891 population census has not been carried

out owing to budget restrictions. For this reason, explanatory variables such as literacy

rates are not available. In order to avoid relying heavily on interpolations, we decided to

leave 1891 out of the regression analysis. The next section shows the results of the market

potential calculations.

4 Market potential of Italian regions, 1871–1911: empirical

results

This section shows and compares different versions of market potential for the Italian

regions that will be used in the next section as explanatory variable for GDP per capita and

industrial value added per capita. The first one proposed is a domestic market potential,

exclusively taking into account the Italian regions. The second one is a repetition of the

domestic market potential using straight distances and shows how different results can

16

Figure 9: Domestic market potential in Italian regions, 1871–1911 (constant 1911 prices,Italy=100).

Source: our own calculations.

emerge without controlling for transport costs. The third is total market potential, which

comprises all the main trading partners of Italy along with the Italian regions.

Let us start with the first version. Figure 9 and Table 6 provide the estimates of

domestic market potential for the Italian regions between 1871 and 1911.

The domestic market potential shows two main results. The first is that at the

beginning of the period the picture is quite in line with the classic North-South divide,

with the North showing a higher level of market access and the South lagging behind. The

exceptions in the South are Campania and Sicily. The reason why these two regions have

levels comparable to the Northern regions is that their total GDP at the beginning of the

period were comparable to those of the regions in the North and that these two regions

have sea-ports as their economic centre. This makes them able to exploit shipping, which

is cheaper than railways in this period. The second result is that there is a tendency

over the period for the market access of the North to worsen with respect to the South.

This tendency can be explained if we look at the transport costs in Tables 2, 3 and 4.

Shipping costs drop relatively faster in this period than railway costs, giving an advantage

to regions that have their node on the coast. This is the case of the main Southern regions.

A similar result, connected to the access to shipping, is discussed in Schulze (2007) on the

Habsburg Empire. In its case, the only two regions with access to the sea (Littoral and

Dalmatia) have a persistent advantage over the others in terms of market potential in spite

17

Table 6: Domestic market potential in the Italian regions, 1871–1911.

1911 million lire Italy=100

1871 1881 1891 1901 1911 1871 1881 1891 1901 1911

Piedmont 766 932 1264 1528 2552 120 118 114 113 109

Liguria 768 946 1385 1702 2973 121 119 123 126 127

Lombardy 908 1033 1496 1779 3094 142 130 133 131 132

Veneto 777 893 1274 1570 2716 122 113 114 116 116

Emilia 783 889 1270 1414 2550 123 112 114 104 109

Tuscany 733 863 1197 1340 2365 115 109 106 99 101

Marches 612 774 1115 1374 2343 96 98 99 101 100

Umbria 541 593 811 916 1588 85 75 72 68 68

Latium 560 728 1013 1182 2063 88 92 89 87 88

Abruzzi 509 608 826 951 1634 80 77 74 70 70

Campania 747 915 1298 1574 2712 117 115 116 116 116

Apulia 547 765 1105 1401 2352 86 96 98 103 101

Basilicata 336 536 746 874 1489 53 68 67 64 64

Calabria 512 701 998 1307 2234 80 88 89 96 96

Sicily 641 869 1228 1527 2593 101 110 108 113 111

Sardinia 457 642 952 1247 2139 72 81 85 92 91

Italy 637 793 1124 1355 2337 100 100 100 100 100


of their lower GDP. This relationship between sea access and market potential is clear when

trading partners are taken out of the sample. This is because Littoral and Dalmatia are

the only regions in the sample directly connected by sea to the foreign trading partners.

Taking trading partners out, the two regions no longer have such advantage. In Italy this

advantage is present even without the inclusion of trading partners because shipping is an

available option in internal trade as well.

The second set of estimates uses straight line distances between nodes. Figure 10

and Table 7 show the domestic market potentials when we take straight line distances

instead of transport costs. Here we see that the process of worsening market access

worsening in the North and improving in the South is not taking place. This version

of market access also shows a much wider gap between North and South, with most

Southern regions doing sensibly worse than is implied by the calculation adjusted for

transport costs. This version of market potential is the most similar to the one proposed

by A’Hearn and Venables (2011). The authors calculate market potentials using the

same GDP estimates and weight them by straight line distances. Their results show a

large gap between North and South: the authors claim that domestic market access was

the driving force for the location of industries in the period 1890–1950. Looking at the

18

Figure 10: Domestic market potential in Italian regions, 1871–1911 (constant 1911 prices,Italy=100, straight line distances).


difference between the domestic market potentials calculated with transport costs adjusted

distances and with straight line distances, the claim that market potentials were moving

in the same direction as GDP or industrial value added appears to be supported much

less by the former calculation.12

The third version proposed here, which we call total market potential, shows the market

potentials calculated including the main trading partners of Italy.13 Figure 11 and Table

8 show the results.

This version of market potential shows a quite different picture from the domestic one.

The South here appears not only to perform increasingly better in the period but also

as starting from a higher level than the North, which experiences the opposite evolution.

These estimates, which pool the GDP of Italian regions and that of very large trading

partners, are very much influenced by the GDP of the latter, in particular the US, which

has a very high GDP compared to Italy (see 12). Given that most trading partners are

reached at least partially by sea, the sea effect is even higher in this formulation of market

potential. Taking the US as one unit would force us to pick only one node and weight

the entire GDP of the US according to this node, which in our case is New York. This

12If we compared total market potential calculated with transport costs and with straight line distancesthe difference would look even more marked.

13The choice of trading partners to include is discussed in Section 3 above.

19

Table 7: Domestic market potential in the Italian regions, 1871–1911 (straight linedistances).


1871 1881 1891 1901 1911 1871 1881 1891 1901 1911

Piedmont 61 74 81 95 143 116 123 120 128 121

Liguria 70 78 95 106 177 134 130 138 142 150

Lombardy 78 86 105 121 194 149 144 154 163 164

Veneto 61 62 71 82 131 116 104 105 110 111

Emilia 73 81 97 103 172 140 135 142 138 146

Tuscany 63 71 81 85 139 120 119 118 114 117

Marches 47 53 61 65 102 90 89 90 88 86

Umbria 48 53 59 63 98 92 89 86 84 83

Latium 51 63 71 77 124 99 105 101 104 105

Abruzzi 45 52 56 59 93 87 86 82 80 78

Campania 67 73 82 85 134 129 122 121 114 113

Apulia 38 49 55 59 89 74 82 80 80 75

Basilicata 35 42 46 49 75 68 70 69 66 63

Calabria 31 38 40 43 67 60 64 59 58 57

Sicily 44 55 61 64 98 84 93 87 87 83

Sardinia 22 27 31 34 53 43 45 45 45 45

Italy 52 60 68 74 118 100 100 100 100 100


practically corresponds to assuming that the whole GDP of the US is as easily accessible

as if it was all located on the East Coast. This of course is far from the fact. This is why

we decided to consider the US as four separate macro regions (Northeast, Midwest, South

and West). Doing so allows us to reduce the role of the US in the calculation without

ignoring the presence of the US in the foreign markets.14

The next section move our focus to the relationship between these estimates and

regional economic development.

14The same problem could be posed by Austria and Hungary: we decided to keep them separate in allcalculations as in this period, it must be remembered that the two regions of the Habsburg Empire becamea dual monarchy in 1867 and therefore were quite economically independent.

20

Figure 11: Total market potential in Italian regions, 1871–1911 (constant 1911 prices,Italy=100).


Table 8: Total market potential in the Italian regions, 1871–1911.


1871 1881 1891 1901 1911 1871 1881 1891 1901 1911

Piedmont 3,976 4,739 7,020 10,609 15,071 108 93 88 85 86

Liguria 4,322 5,912 9,361 14,902 20,886 118 116 117 119 119

Lombardy 4,010 5,032 7,697 11,533 16,665 109 99 96 92 95

Venetia 3,900 5,506 8,775 13,763 19,136 106 108 110 110 109

Emilia 3,402 4,312 6,589 9,744 14,216 93 84 82 78 81

Tuscany 3,628 4,874 7,523 11,470 16,421 99 95 94 92 93

Marches 3,673 5,363 8,582 13,573 18,790 100 105 107 108 107

Umbria 3,028 4,136 6,408 9,744 13,895 82 81 80 78 79

Latium 3,452 4,877 7,601 11,812 16,775 94 96 95 94 95

Abruzzi 3,048 4,235 6,577 10,086 14,293 83 83 82 81 81

Campania 4,025 5,791 9,188 14,600 20,372 109 113 115 117 116

Apulia 3,673 5,445 8,709 13,869 19,193 100 107 109 111 109

Basilicata 2,934 4,230 6,588 10,177 14,418 80 83 82 81 82

Calabria 3,767 5,548 8,846 14,251 19,767 102 109 111 114 112

Sicily 4,159 6,036 9,515 15,404 21,391 113 118 119 123 122

Sardinia 3,837 5,646 9,027 14,640 20,340 104 111 113 117 116

Italy 3,677 5,105 8,000 12,511 17,602 100 100 100 100 100


21

Figure 12: Contribution of trading partners to total market potential, 1871–1911.


22

5 Market access and economic development: empirical

results

Section 4 presented market potential estimates for the Italian regions for five benchmark

years, between 1871 and 1911. In this section we explore the relationship between

market potential and regional GDP per capita, which represents a fundamental measure

of economic development. We then show the same model using industrial value added

per capita as the dependent variable, which represents the part of the GDP of a region

that comes from the secondary sector. To do so, we use both basic pooled OLS and first

difference OLS. The main explanatory variable in all specifications is market potential,

along with other geographic and economic controls. In the next section we show the results

on GDP per capita.

5.1 Market potential and GDP per capita

In this section we show the empirical results based on the model of section 2.2. Table 9

shows the OLS estimates of Equation 4. All years are pooled and standard errors are

heteroskedasticity-robust.15 Columns 1 and 2 show the basic pooled OLS regression with

GDP per capita explained by domestic and total market potential. Here there are no

controls. The relationship appears positive and significant at the 1% level for domestic

market potential as well as for total market potential. The coefficients are expressed in

logs and therefore they can be interpreted as elasticities. Domestic market potential has

an elasticity of 0.351 and total market potential has an elasticity of 0.235. The R2 is

about 10% higher for the specification with domestic market potential, which is near 0.5,

suggesting that the former has more explanatory power. The comparison with previous

literature appears challenging, because the existing works are at country level rather than

sub-national level. Redding and Venables (2004, p. 65) provide a coefficient of 0.146 for

domestic market potential and 0.395 for total market potential. Head and Mayer (2011,

p. 289) find a coefficient of 0.80 for the specification without specific country controls.

As admitted by the authors, the difference in the magnitude of the coefficients is more

than double that of Redding and Venables (2004) and “this difference in the coefficients

stems mainly from the different construction of the market potential variable”. Therefore,

differences in the type of sample, in the historical periods and in the techniques used to

calculate market potentials allow for very different magnitudes in the coefficients without

affecting the validity of the different works. In columns 3 and 4 of Table 9 we add year

15The pooling of the four available years (1871, 1881, 1901 and 1911) is necessary in order to increasethe number of observations. This procedure is the one followed by Head and Mayer (2011).

23

dummies to take into account the fact that the data are pooled across years. In this

version, the results change: domestic market potential increases its elasticity to 0.544

significant at 1% while total market potential is not significant in this case. The next step

is to introduce further controls to capture differences across regions. The first candidate is

physical geography. In columns 5 and 6 we introduce a dummy for belonging to different

macro areas.16 In the regression, we decided to use the North-West as our baseline and

include the dummies corresponding to the other two macro areas. The result is that

the domestic market potential decreases its coefficient size and level of significance while

the total market potential moves in the opposite direction. Both columns have market

potentials significant at the 5% level while the dummy South is highly significant with

a negative sign. The North-East-Centre is also negative but with a considerably smaller

coefficient. The negative signs on both macro areas are not surprising, since we chose the

richest macro area as the baseline. The fact that being in the South has the strongest

effect is also expected because of the generally poorer conditions of Southern regions in

this period. The use of these macro areas should be considered in light of the types of

difference we want to capture. If the aim is to describe geographic differences across

Italy, avoiding other elements that could be endogenous to GDP, the imposition of these

macro borders is not necessarily the best option in spite of the strong results. This is

because macro areas are often designed ex-post to collect regions with similar economic

conditions.17 In order to avoid this endogeneity problem, columns 7 and 8 use latitude

as the geographic control. Latitude is a control that is completely exogenous, being the

distance from the Equator, and has a positive effect on GDP per capita. The result is that

both coefficients are around 0.4, with domestic market potential significant at the 1% and

total market potential at 5%. The R2 decreases slightly compared to that in columns 6

and 7, but are very much higher than the specification without geographic controls. From

now on, whenever we decide to include a control for the geographic position of the regions

we will use latitude.

The previous table aimed at exploring the relationship between market potential and

GDP per capita using controls that are possibly not endogenous to GDP per capita.

In Table 10 we try to introduce further controls and discuss whether or not these are

appropriate in our case. The first control that we can think of including, to capture

16The classification of regions in the three different macro areas is taken from Felice (2007) and was laterused in his other works. The Northwest includes Piedmont, Lombardy and Liguria; the Northeast-Centreincludes Venetia, Emilia, Marches, Tuscany, Umbria and Latium; the South comprises Campania, Apulia,Basilicata, Calabria, Sicily and Sardinia.

17See for example the inclusion of the North-East in the same macro area of the Centre and the decisionto keep the regions of the Industrial Triangle separate from the rest.

24

Table 9: GDP per capita and market potential, 1871–1911 (pooled OLS regression withgeographic controls).

GDPpc (1) (2) (3) (4) (5) (6) (7) (8)

Log Domestic MP 0.351*** 0.544*** 0.217** 0.431***

(0.0452) (0.0853) (0.0939) (0.0929)

Log Total MP 0.235*** 0.191 0.306** 0.411**

(0.0450) (0.194) (0.115) (0.172)

Northeast-Centre -0.142** -0.163***

(0.0566) (0.0597)

South -0.315*** -0.389***

(0.0535) (0.0406)

Latitude 0.0207** 0.0452***

(0.00801) (0.00961)

Constant -1.189 0.768 -5.049*** 1.773 1.760 -0.529 -3.634** -4.983

(0.941) (1.022) (1.732) (4.275) (1.920) (2.545) (1.792) (4.070)

Year Fixed Effects no no yes yes yes yes yes yes

Region Fixed Effects no no no no no no no no

Observations 64 64 64 64 64 64 64 64

R2 0.491 0.309 0.537 0.376 0.667 0.671 0.563 0.519

Notes: Heteroskedastic robust standard errors in parentheses. * , ** and ***correspond to a coefficient significantly different from zero with a 10%, 5% and 1%confidence level respectively. The dependent variable is the log of GDP per capita in1911 prices. Market potentials are in 1911 prices.

other fundamental variables that can affect GDP per capita is some measure of human

capital. In modern studies, such as Head and Mayer (2011), average years of schooling

is the best variable to use. For the case of pre-First World War Italy we decided to use

a more basic output measure that is often used to capture human capital in the Italian

case: literacy rates. When literacy rates are introduced in the regression, they are highly

significant with a coefficient of around 0.6 in both specifications. Market potentials loose

their significance and latitude is incorrectly signed. These results clearly suffer from a bias.

The reason why the plain inclusion of this variable is problematic is two-fold: literacy is

endogenous to GDP per capita and it is also collinear to other variables that present a

North-South gradient.18 This is, for instance, the case of domestic market potential, which

has a geographic distribution that appears very similar to that of literacy rates. In this

case, it is very hard to disentangle the effect of literacy from that of market potential;

moreover, it is very hard to establish the direction of the causality between literacy and

GDP per capita. Columns 3 and 4 show the inclusion of the share of arable land in 1871

as a control.19 Including arable land at the beginning of the period is a way to avoid

18We also tried to included similar controls such as the share of labour force in agriculture; the problemsencountered in the estimation were very similar and we decided to deal with literacy rates only in theestimation to avoid making the issue even more problematic by including further controls.

19The share of arable land is also used as a control by Redding and Venables (2004). The authorshere use a number of other controls, such as fraction of land in tropical areas, prevalence of malaria and

25

Table 10: GDP per capita and market potential, 1871–1911 (pooled OLS regression withfurther controls).

Log GDPpc (1) (2) (3) (4) (5) (6) (7) (8)

Log Domestic MP -0.0531 0.431*** 0.430*** 0.430**

(0.125) (0.0843) (0.127) (0.181)

Log Total MP -0.155 0.335** -0.206 -0.206

(0.158) (0.162) (0.212) (0.286)

Log Literacy 0.623*** 0.640***

(0.118) (0.0998)

Latitude -0.0522*** -0.0595*** 0.0241*** 0.0468***

(0.0148) (0.0178) (0.00825) (0.00961)

Log Share of Arable Land 1871 -0.192*** -0.153**

(0.0513) (0.0627)

Liguria 0.165** 0.245*** 0.165*** 0.245***

(0.0665) (0.0816) (0.0127) (0.0695)

Lombardy -0.0766 0.00284 -0.0766** 0.00284

(0.0773) (0.0588) (0.0280) (0.0181)

Venetia -0.262*** -0.223*** -0.262*** -0.223***

(0.0690) (0.0766) (0.00276) (0.0451)

Emilia -0.153** -0.184*** -0.153*** -0.184***

(0.0680) (0.0488) (0.00469) (0.0282)

Tuscany -0.0989* -0.129*** -0.0989*** -0.129***

(0.0517) (0.0459) (0.0149) (0.00718)

Marches -0.269*** -0.308*** -0.269*** -0.308***

(0.0508) (0.0454) (0.0273) (0.0366)

Umbria 0.00337 -0.188*** 0.00337 -0.188***

(0.0659) (0.0489) (0.0682) (0.0411)

Latium 0.388*** 0.283*** 0.388*** 0.283***

(0.0605) (0.0433) (0.0467) (0.00728)

Abruzzi -0.269*** -0.483*** -0.269*** -0.483***

(0.0763) (0.0638) (0.0796) (0.0345)

Campania -0.175*** -0.127* -0.175*** -0.127**

(0.0610) (0.0704) (0.00215) (0.0597)

Apulia -0.128* -0.174** -0.128*** -0.174***

(0.0705) (0.0659) (0.0318) (0.0407)

Basilicata -0.201** -0.494*** -0.201* -0.494***

(0.0922) (0.0483) (0.112) (0.0360)

Calabria -0.366*** -0.437*** -0.366*** -0.437***

(0.0649) (0.0599) (0.0443) (0.0480)

Sicily -0.204*** -0.177** -0.204*** -0.177**

(0.0631) (0.0720) (0.0106) (0.0723)

Sardinia -0.153** -0.251*** -0.153** -0.251***

(0.0596) (0.0752) (0.0575) (0.0545)

Constant 7.257*** 9.838** -3.100* -2.845 -2.631 10.66** -2.631 10.66

(2.631) (3.856) (1.564) (3.856) (2.570) (4.661) (3.689) (6.287)

Year Fixed Effects yes yes yes yes yes yes yes yes

Region Fixed Effects no no no no yes yes yes yes

Region Clustering no no no no no no yes yes

Observations 64 64 64 64 64 64 64 64

R2 0.760 0.763 0.599 0.541 0.955 0.948 0.955 0.948

Notes: Heteroskedastic robust standard errors in parentheses. * , ** and ***correspond to a coefficient significantly different from zero with a 10%, 5% and 1%confidence level respectively. The dependent variable is the log of GDP per capita in1911 prices. Market potentials are in 1911 prices. Literacy is expressed as rate.

26

endogeneity as the share of arable land in subsequent years may have been affected by

changes in the economic or technological conditions of the regions. The share of arable

land is significant and negatively signed in both specifications. The negative sign can be

explained by interpreting the share of arable land as the presence of agricultural activity

in a region. Since agriculture has lower value added levels than industries, leading to lower

levels of GDP per capita, it has a larger weight in the regional economy. Here latitude is

correctly signed and significant again, suggesting that endogeneity and multicollinearity

are less of an issue in this specification. Columns 4–8 take a different approach to the

specification. In the last four columns we introduce region fixed effects. This strategy is

also used by Head and Mayer (2011). However, in their case the inclusion of these dummies

leads to much less of an increase in the R2 than our case shows. Here, region fixed effects

lead to an increase from about 0.5 to values well over 0.90. This increase is explained by

the number of regional dummies that are highly significant. The baseline is Piedmont, a

fairly rich Northern region. This is why most other significant regional dummies have a

negative sign (with the exceptions of Liguria and Latium which do show quite high levels

of GDP per capita even compared to Piedmont in this period). Southern regions all have

negative signs, which is expected given the lower levels of GDP per capita in this part of

the country. The high explanatory power of these fixed effects suggests that the inclusion

of any further control other than market potential should be dropped in order to avoid

the risk of over-identifying the model. For this reason, given the high share of variation

explained, we assume that the region fixed effects capture a sufficient part of the regional

differences (although they are not able to disentangle them) and we do not include further

controls. Columns 6 and 7 show the inclusion of fixed effects without regional clustering,

while Columns 8 and 9 show the results with clustering. Market potential here shows

similar results to the previous specification: domestic market potential is positive and

significant with an elasticity of 0.430; total market potential is not significant.

Summing up the results of the first two tables, we notice that domestic market potential

is significant and correctly signed in all specifications except when literacy rates are

introduced while total market potential is not significant in a number of specifications,

most notably when fixed effects are used, appearing less robust as the explanatory variable

for GDP.

We now move to the issues of endogeneity and multicollinearity that affect the model

when literacy rates are introduced. On the first point, Head and Mayer (2011) attempt

risk of expropriation that appear sensible when doing a cross-country analysis but would lead to difficultquantification or have little meaning in the case of Italian regions in this period.

27

two different approaches. First, they substitute total market potential with foreign market

potential, which is market potential calculated with own GDP removed.20 This solution

has been attempted for the case of the Italian regions but the results do not hold.

According to Head and Mayer (2011, p. 291),“[foreign market potential] has nice features

[in dealing with endogeneity], but is clearly not ideal as a replacement or instrument

for RMP”. The solution that Head and Mayer (2011) adopt in their work is to use an

instrument for market potential. In particular, they take geographic centrality proxied

by two instruments: the sum of the inverse of straight line distances and the inverse of

transport costs. In their case the former is ruled out and the solution adopted is to use the

latter. Unfortunately, this instrument does not appear to work for the Italian regions in

this period. Looking at the levels of our variables, in the absence of reliable instruments,

we cannot establish the direction of causality between GDP per capita, domestic market

potential and literacy. This is evident when looking at the geographical distribution of the

levels in Figures 1, 2 and 10, where the North-South gradient appears clear. One solution

here is to move our focus from levels to changes in the time of the variables of interest.

By transforming all the logs of the variables in first differences we can effectively interpret

the model in terms of growth rates. Running the model in first differences allows us to

address the issue of multicollinearity, since the common long-term trend in the variables

is not present in the growth rates.21

Table 11 shows the basic formulation without region or geographic controls in Columns

1 and 2. The result is a positive and significant coefficient for the domestic market potential

and very low and insignificant coefficients on the total market potential.22 Columns 3–8

replicate the main specifications proposed so far: latitude as control with literacy rates first

and then with the share of arable land and, in the last two columns, region fixed effects

as only controls. Unlike the results with levels, first difference regressions show little

effect from the controls, while domestic market potentials are always correctly signed and

significant with a coefficient around 0.8 in all the specifications. Total market potential

does not appear significant in any specification.

20This leaves us effectively with a market access measured only accounting for the GDP of foreigncountries in the sample, ignoring the internal market.

21In particular, if we look at Figures 1, 2 and 10, we notice that literacy rates and domestic marketpotential in the South tend to converge with those of the North while GDP per capita tends to diverge.

22The coefficients here can be interpreted as percentage point increases in growth rate of GDP percapita when an explanatory variable increases by 1%. Therefore, in column 1, if the growth rate ofdomestic market access increases by 1%, the growth rate of GDP per capita increases by 0.649%.

28

Table 11: GDP per capita, market potential and literacy rates, 1871–1911 (pooled OLSregression in first differences).

Log GDPpc (1) (2) (3) (4) (5) (6) (7) (8)

DLog Domestic MP 0.649*** 0.849*** 0.816*** 0.883**

(0.206) (0.235) (0.249) (0.345)

DLog Total MP 0.0557 0.0731 0.111 0.370

(0.332) (0.399) (0.373) (0.489)

DLog Literacy 0.0645 -0.245

(0.272) (0.304)

Latitude 0.0124 -0.00195 0.0112* 0.00296

(0.00812) (0.0102) (0.00612) (0.00761)

Share of Arable Land 1871 -0.000347 -0.00146

(0.00109) (0.00127)

Constant -0.0764 0.258** -0.724 0.392 -0.629* 0.167 -0.133 0.216

(0.113) (0.114) (0.449) (0.575) (0.335) (0.395) (0.190) (0.129)


Region Fixed Effects no no no no no no yes yes

Observations 48 48 48 48 48 48 48 48

R2 0.619 0.520 0.651 0.531 0.651 0.532 0.696 0.592

Notes: Heteroskedastic robust standard errors in parentheses. * , ** and ***correspond to a coefficient significantly different from zero with a 10%, 5% and 1%confidence level respectively. The dependent variable is the log of GDP per capita in1911 prices. Market potentials are in 1911 prices. Literacy is expressed as rate. Allvariables are in first differences.

5.2 Market potential in the North and South

Tables 12 and 13 show our model splitting the sample in the North (which merges

the North-West and North-East-Centre) and the South.23 As a result, the number of

observations goes down to 21 for the South and 27 for the North. Let us start with the

South. In Table 12 we notice that the level of GDP per capita in Southern regions is

much more clearly affected by market potential when the regression is in levels (Columns

1–4) than in first differences (Columns 5–8). This is the case both without geographic

controls and when we include latitude and share of arable land (Columns 3–4). Moreover,

the coefficients are slightly higher for total market potential than for the domestic one.

In first differences, the results do not hold, suggesting that after Unification market

potentials do nothing to explain the growth rates of GDP per capita.

The picture looks quite different in Table 13, where the sample is restricted to

the northern regions. In this case, total market potential is never significant while

domestic market potential is significant in both levels and first differences, with fairly

23Venetia, Lombardy, Piedmont, Liguria, Emilia, Tuscany, Latium, Umbria and Marches are consideredNorth; the rest of the regions are considered South as in footnote 16.

29

Table 12: GDP per capita and market potential, southern regions, 1871–1911 (pooled OLSregression and first differences).

Log GDPpc (1) (2) (3) (4) (5) (6) (7) (8)

Log Domestic MP 0.514*** 0.642***

(0.0647) (0.0996)

Log Total MP 0.624*** 1.030***

(0.104) (0.176)

Latitude 0.0361** 0.0589*** 0.00476 0.00591

(0.0146) (0.0164) (0.0111) (0.0106)

Log Share of Arable Land 1871 -0.132** 0.0298

(0.0539) (0.0508)

DLog Domestic MP 0.617** 0.489

(0.260) (0.302)

DLog Total MP 1.316 0.976

(0.914) (1.033)

DLog Literacy 0.539 0.427 0.523 0.446

(0.437) (0.419) (0.455) (0.435)

Share of Arable Land 1871 -0.00205 -0.00291

(0.00205) (0.00221)

Constant -4.486*** -7.906*** -8.046*** -19.31*** -0.227 -0.303 -0.269 -0.324

(1.295) (2.266) (2.203) (4.336) (0.212) (0.324) (0.539) (0.608)


Observations 28 28 28 28 21 21 21 21

R2 0.824 0.750 0.866 0.870 0.700 0.652 0.719 0.697

Notes: Heteroskedastic robust standard errors in parentheses. * , ** and ***correspond to a coefficient significantly different from zero with a 10%, 5% and 1%confidence level respectively. The dependent variable is the log of GDP per capitain 1911 prices. Market potentials are in 1911 prices. Literacy is expressed as rate.Columns 5–8 show the variables in first differences.

high coefficients. Market potential is therefore a stronger predictor of GDP per capita

within northern regions compared to the South.

Considering the North and South of Italy separately brings some interesting insights.

The effect of market access on the GDP per capita of these two different parts of the

country is not the same: in the South, both domestic and total market potential have a

strong role in determining the levels of GDP per capita but they do not seem to impact

on the growth rates. For the North, only the domestic market potential is significant in

both in levels and in first differences. This difference suggests that the results at national

level are driven by the Northern part of the country, where the access to markets outside

Italy was often more expensive because of the presence of rich but landlocked regions.

The next section moves from GDP to industrial production, repeating the exercise

using industrial value added per capita as the dependent variable.

5.3 Market potential and industrial value added per capita

The growth of the industrial output is considered a strong driver of economic growth for

Italy in this period. One further question we can address is how far the factors that explain

30

Table 13: GDP per capita and market potential, northern regions, 1871–1911 (pooled OLSregression and first differences).

Log GDPpc (1) (2) (3) (4) (5) (6) (7) (8)

Log Domestic MP 0.120 1.458***

(0.210) (0.212)

Log Total MP 0.194 0.0767

(0.275) (0.259)

Latitude -0.221*** -0.0517 0.00255 0.0000621

(0.0366) (0.0325) (0.0181) (0.0283)

Log Share of Arable Land 1871 -0.318*** -0.400***

(0.0889) (0.107)

DLog Domestic MP 1.552*** 1.586***

(0.338) (0.343)

DLog Literacy -0.241 -0.109 -0.356 -0.176

(0.291) (0.354) (0.451) (0.584)

DLog Total MP -0.146 -0.128

(0.449) (0.515)

Share of Arable Land 1871 0.00152 0.000694

(0.00206) (0.00238)

Constant 3.632 1.811 -12.75*** 8.108 -0.530*** 0.355** -0.698 0.332

(4.298) (6.063) (3.062) (5.572) (0.179) (0.167) (0.813) (1.352)

Observations 36 36 36 36 27 27 27 27

R2 0.450 0.453 0.804 0.633 0.762 0.569 0.770 0.570

Notes: Heteroskedastic robust standard errors in parentheses. * , ** and ***correspond to a coefficient significantly different from zero with a 10%, 5% and 1%confidence level respectively. The dependent variable is the log of GDP per capita in1911 prices. Market potentials are in 1911 prices. Literacy is expressed as rate.

GDP per capita can also explain industrial value added per capita. Let us start with

Table 14, in which the main OLS regressions in levels are reported. Columns 1 and 2 show

the baseline model with year dummies as the only controls. Here domestic market potential

is positive and significant at 1% while total market potential is significant at the 5% level.

The elasticities are considerably higher than the ones for GDP per capita. Columns 3 and

4 introducing the dummy for latitude show the same result; Columns 5 and 6 report the

specification with the share of arable land and latitude together. It appears that in all

specifications, both domestic and total market potentials are positive and significant with

high coefficients (between 0.6 and 1) and a 1% level of significance. However, the results

do not hold when fixed effects are introduced. This last two Columns provide a counter

intuitive result for total market potential, which is negative and significant. To investigate

further, we turn our attention to Table 15, where the same exercise is repeated in first

difference. Here again the results on market potentials do not hold. As in the case of the

GDP per capita of Southern regions, market potential seems to possibly explain levels of

GDP per capita but not the changes from period to period. The next section sums up the

results.

31

Table 14: Industrial value added per capita and market potential, 1871–1911 (pooled OLSregression).

Log Ind VA pc (1) (2) (3) (4) (5) (6) (7) (8)

Log Domestic MP 1.136*** 0.997*** 0.997*** -0.369

(0.142) (0.153) (0.151) (0.240)

Log Total MP 0.651** 1.059*** 1.022*** -0.835***

(0.272) (0.207) (0.214) (0.214)

Latitude 0.0255** 0.0840*** 0.0289** 0.0847***

(0.0110) (0.0132) (0.0116) (0.0135)

Log Share of Arable Land 1871 -0.193*** -0.0741

(0.0656) (0.0885)

Constant -18.96*** -10.30* -17.22*** -22.84*** -16.68*** -21.80*** 11.84** 22.62***

(2.897) (5.989) (2.950) (4.779) (2.842) (4.963) (4.895) (4.707)



Observations 64 64 64 64 64 64 64 64

R2 0.801 0.538 0.817 0.742 0.832 0.744 0.953 0.957

Notes: Heteroskedastic robust standard errors in parentheses. * , ** and ***correspond to a coefficient significantly different from zero with a 10%, 5% and 1%confidence level respectively. The dependent variable is the log of the industrial valueadded per capita in 1911 prices. Market potentials are in 1911 prices. Literacy isexpressed as rate.

Table 15: Industrial value added per capita, market potential and literacy rates, 1871–1911(pooled OLS regression in first differences.

Log Ind VA pc (1) (2) (3) (4) (5) (6) (7) (8)

DLog Domestic MP -0.0138 0.378 0.281 0.406*

(0.195) (0.251) (0.249) (0.216)

DLog Total MP -0.291 0.268 0.275 0.565*

(0.253) (0.304) (0.281) (0.323)

Latitude 0.0244*** 0.0223*** 0.0248*** 0.0241***

(0.00615) (0.00658) (0.00605) (0.00614)

Share of Arable Land 1871 -0.00268* -0.00307**

(0.00144) (0.00136)

Constant 0.389*** 0.481*** -0.980*** -0.888** -0.722* -0.621* 0.141** 0.311***

(0.107) (0.0878) (0.299) (0.343) (0.362) (0.329) (0.0519) (0.0849)

Year Fixed Effects no no yes yes yes yes yes yes


Observations 48 48 48 48 48 48 48 48

R2 0.483 0.492 0.622 0.603 0.655 0.648 0.796 0.792

Notes: Heteroskedastic robust standard errors in parentheses. * , ** and **correspond to a coefficient significantly different from zero with a 10%, 5% and 1%confidence level respectively. The dependent variable is the log of industrial valueadded per capita in 1911 prices. Market potentials are in 1911 prices. Literacy isexpressed as rate. All variables are in first differences.

32

6 Conclusions

This paper presented the estimates of market potentials of the Italian regions for 10-year

benchmarks in the period 1871–1911 and used market potential to explain GDP per capita

and industrial value added per capita. Market potentials are based on constant 1911 price

estimates of regional GDP for Italy from Felice (2009) and Brunetti et al. (2011) and the

GDP of the main trading partners of Italy in the period from 1871 to 1911 are from Prados

de la Escosura (2000) and Crafts (2005). In the calculation of market potentials, the GDP

are weighted by distance-adjusted transport costs to take into account the actual distance

among regions in terms of the costs of transporting one unit of a representative good.

In the case of foreign partners a distance-equivalent tariff is calculated and added to the

transport cost. In this paper we proposed different specifications ranging from domestic

market potential, which comprises Italian regions only to total market potential that

includes all Italian regions and all the main trading partners. The bottom line of these

calculations is the following: domestic market potentials at the beginning of the period

show a more traditional picture of Italy, with the North presenting higher values than the

South, in particular at the beginning of the period. What contradicts the North-South

traditional image is that North and South converge. This is explained by the relative

cost of shipping, which goes down, unlike that of railways, over the period. This is an

advantage for the main Southern regions such as Campania and Sicily, which have good

access to the sea along with a fairly high total GDP, while Piedmont and Lombardy for

instance do not. The role of transport costs is easily appreciated when we compare the

same calculation with straight line distances: the picture is reversed, showing the strong

and consistent position of the Northern regions. These comparisons show how useful it

is, at least for the case of Italy and other countries with more than one viable mean of

transportation, to correctly account for transport costs and existing lines. Introducing

foreign markets in the total market potential, the results change for the first year, but the

trend in time goes in the same direction. The North has no advantage in terms of market

access at the beginning of the period and, as for the domestic one, it worsens its position

compared to the South.

Once the estimates of market potentials have been produced, we used them as

explanatory variables for the main measure of economic performance: GDP per capita.

We first proposed a baseline specification in levels and we then added further controls. In

the specifications in levels, we found that domestic market potential has a stronger and

more significant positive effect on GDP per capita than total market potential. These

33

findings are in line with our expectations from the descriptive analysis of the data.

However, the inclusion of economic controls such as literacy rates cause concerns over

the multicollinearity and endogeneity that could be affecting the results. To deal with

these issues, we ran the model in first differences. Taking the first differences of the logs

corresponds to considering the variables in growth rates. This procedure leaves out the

common trends that the variables might present in the long run and restricts the focus to

the change from one benchmark year to another. The model in first differences confirms

that the domestic market potential is positive and significant when trying to explain the

growth rate of GDP per capita of the Italian regions.

The next step was to divide the sample into North and South, running the model

separately. It is quite useful to look at different parts of the country separately whenever

the number of observations allows us to do so. In this case it is especially useful to separate

levels and growth rates as the main result here is that total market potential determined

the level of GDP per capita in the South but it did not in the North. Moreover, growth

rates in market potential of both types do not affect the growth rates of GDP per capita

in the South but they do in the North when we consider domestic market potential. This

insight tells us that in the South, the total market potential was positively correlated to

the levels of GDP per capita but its growth rate did not drive the growth rate of GDP

per capita. In the rest of the country, both the level and the growth rates of total market

potential are not significant in explaining GDP per capita. This suggests that the access

to international markets did matter more for the South compared to the rest of Italy.

This result is in line with the view that the Southern economy was, until the invasion of

agricultural products from the US in the late 19th, quite open to international trade. A well

known study by Morilla et al. (1999, pp. 333–337) takes into account the case of southern

Italy as exporter of high value added agricultural products. The authors give the example

of citrus production in Sicily, which supplied 95% of lemons and 16% of oranges consumed

in the US in 1890. However, the trend over the period before the First World War was

negative for the southern Italian exporters because of the competition from California. We

believe that this decline in exports is informative on the first difference regressions results

on total market potential. Finally, we ran the model in levels and first differences, using

industrial value added per capita as the dependent variable. Here the main result was

that both domestic and total market potentials were positive and significant in explaining

the levels of industrial value added per capita, but the results were not robust to the use

of fixed effect and first differences. This suggests that market potential, in its domestic

34

formulation, is a more clear predictor of GDP per capita than industrial value added per

capita.

The findings of this paper contribute to the existing literature in two ways. First of

all, the paper suggests that the quantification of market access can be quite sensitive to

the inclusion of trading partners as well as to the proper quantification of transport costs

rather than simple geographical distances. Furthermore, the different role of domestic

and international market potentials challenges the idea that the South of Italy has been

penalized by its geographical position in terms of access to the international markets as

suggested for instance by Daniele and Malanima (2007).

35

Primary Sources

[1] Bureau of Railway Economics, Comparison of railway freight rates in the United

States, the principal countries of Europe, South Australia, and South Africa.

Washington, 1915.

[2] Ferrovie delo Stato (FS), Tariffe e condizioni pei trasporti sulle Ferrovie dello Stato.

Roma, 1912.

[3] Ferrovie delo Stato (FS), Sviluppo delle ferrovie italiane dal 1839 al 31 dicembre

1926. Roma, 1927.

[4] Istituto Nazionale di Statistica (ISTAT), Annuario statistico italiano, various years.

[5] Ministero di agricoltura, industria e commercio (MAIC), Relazione sulla condizione

dell’agricoltura nel quinquiennio 1870–1874. Roma, 1876

[6] Ministero di agricoltura, industria e commercio (MAIC), Censimento generale al 31

dicembre 1871. Roma, 1874.

[7] Ministero di agricoltura, industria e commercio (MAIC), Censimento della

popolazione del Regno d’Italia al 31 dicembre 1881. Roma, 1883.


popolazione del Regno d’Italia al 10 febbraio 1901. Roma, 1902.


popolazione del Regno d’Italia al 10 giugno 1911. Roma, 1914.

References

A’Hearn, B. (2000): “Could Southern Italians Cooperate? Banche Popolari in the

Mezzogiorno,” Journal of Economic History, 60, 67–93.

A’Hearn, B., C. Auria, and G. Vecchi (2011): In ricchezza e in poverta, Il Mulino,

Bologna, chap. Istruzione.

A’Hearn, B. and A. Venables (2011): “Internal Geography and External Trade:

regional disparities in Italy, 1861-2011,” Economic History Working Papers 12, Bank of

Italy.

Baffigi, A. (2011): “Italian National Accounts, 1861-2011,” Economic History Working

Papers 18, Bank of Italy.

Brunetti, A., E. Felice, and G. Vecchi (2011): In ricchezza e in poverta, Il Mulino,

Bologna, chap. Reddito.

Cafagna, L. (1989): Dualismo e sviluppo nella storia d’Italia, Marsilio, Padova.

Capie, F. (1994): Tariffs and Growth: Some Illustrations from the World Economy,

1850-1940, Manchester University Press, Manchester.

Crafts, N. (2005): “Market potential in British regions, 1871-1931,” Regional Studies,

39, 1159–1166.

Daniele, V. and P. Malanima (2007): “Il prodotto delle regioni italiane e il divario

nord-sud in Italia, 1861-2004,” Rivista di Politica Economica, 47, 267–315.

Estevadeordal, A., B. Frantz, and A. M. Taylor (2002): “The Rise and Fall of

World Trade, 1870-1939,” Working Papers 9318, NBER.

Felice, E. (2007): Divari regionali e intervento pubblico: per una rilettura dello sviluppo

in Italia, Il Mulino, Bologna.

——— (2009): “Estimating regional GDP in Italy (1871-2001): sources, methodology and

results,” Working Papers in Economic History 0907, Universidad Carlos III.

——— (2013): “Regional income inequality in Italy in the long run (1871-2001). Patterns

and determinants,” UHE Working Papers 08, Universitat Autonoma de Barcelona.

37

Felice, E. and M. Vasta (2012): “Passive Modernization? The new Human

Development Index and its Components in Italy’s Regions (1871-2007),” UHE Working

Papers 10, Universitat Autonoma de Barcelona.

Fenoaltea, S. (2003): “Peeking Backward: Regional Aspects of Industrial Growth in

Post-Unification Italy,” Journal of Economic History, 63, 1059–1102.

——— (2006): L’economia italiana dall’Unita alla Grande Guerra, Collezione storica,

Laterza, Bari.

Ferreres, O. (2005): Dos siglos de economıa argentina, 1810-2004: : historia argentina

en cifras, Fundaciøn Norte y Sur, Buenos Aires.

Galassi, F. (2000): “Measuring Social Capital: Culture as an Explanation of Italy’s

Economic Dualism,” Department of economic working papers, University of Warwick.

Hanson, G. (2005): “Market potential, increasing returns and geographic concentration,”

Journal of International Economics, 67, 1–24.

Harris, C. D. (1954): “The Market as a Factor in the Localization of Industry in the

United States,” Annals of the Association of American Geographers, 44, 315–348.

Head, K. and T. Mayer (2004): “Market Potential and the Location of Japanese

Investment in the European Union,” Review of Economics and Statistics, 86, 959–972.

——— (2011): “Gravity, market potential and economic development,” Journal of

Economic Geography, 11, 281–294.

Kaukiainen, M. (2003): “How the price of distance declined: ocean freights for grain and

coal from the 1870s to 2000,” Department of history, University of Helsinki, MIMEO.

Klein, A. (2009): “Personal Income of U.S. States: Estimates for the Period 1880–1910,”

Economics Research Paper Series (TWERPS) 916, University of Warwick.

Klein, A. and N. Crafts (2012): “Making sense of the manufacturing belt:

determinants of U.S. industrial location, 1880–1920,” Journal of Economic Geography,

12, 775–807.

Martinez-Galarraga, J. (2012): “The determinants of industrial location in Spain,

1856–1929,” Explorations in Economic History, 49, 255–275.

38

Midelfart, K., H. Overman, S. Redding, and A. Venables (2000): “The

Location of European Industry,” European Economy Economic Papers 142, European

Commission.

Missiaia, A. (2014): “Industrial location, market access and economic development:

regional patterns in post-unification Italy,” PhD Thesis, London School of Economics

and Political Science.

Mitchell, B. (1988): British Historical Statistics, Cambridge University Press,

Cambridge.

——— (2003): International historical statistics: Europe, 1750-2000, Palgrave Macmillan,

New York.

Morilla, C., A. Olmstead, and P. Rhode (1999): “Horn of Plenty: The

Globalization of Mediterranean Horticulture and the Economic Development of

Southern Europe, 1880-1930,” Journal of Economic History, 59, 316–352.

Noyes, W. (1905): American railroad rates, Little, Brown and Company, New York.

Prados de la Escosura, L. (2000): “International Comparisons of Real Product, 1820-

1990: an Alternative Data Set,” Explorations in Economic History, 37, 1–41.

Redding, S. and A. J. Venables (2004): “Economic geography and international

inequality,” Journal of International Economics, 62, 53–82.

Schulze, M. (2007): “Regional income dispersion and market potential in the late

nineteenth century Hapsburg Empire,” Working Paper 106-07, London School of

Economics.

Sonnino, S. (1877): La Sicilia nel 1876, Barbera, Firenze, chap. I contadini in Sicilia.

Villari, P. (1979): Le Lettere Meridionali ed Altri Scritti sulla Questione Sociale in

Italia, Guida, Napoli.

Wolf, N. (2007): “Endowments vs. market potential: What explains the relocation of

industry after the Polish reunification in 1918?” Explorations in Economic History, 44,

22 – 42.

Zamagni, V. (1978): Industrializzazione e squilibri regionali in Italia: bilancio dell’eta

giolittiana, Il Mulino, Bologna.

39

——— (1990): Dalla periferia al centro: la seconda rinascita economica dell’Italia (1861-

1990), Il Mulino, Bologna.

40

working paper series - scuola superiore sant'anna · model. daniele and malanima (2007, p....

Documents