on persistent poverty in a rich country · of persistent poverty status at the close of the 20th...

On Persistent Poverty in a Rich Country

T. M. Tonmoy Islam Center for Poverty Research

and Department of Economics

University of Kentucky [email protected]

Jenny Minier

Department of Economics University of Kentucky

[email protected]

James P. Ziliak Center for Poverty Research

and Department of Economics

University of Kentucky [email protected]

September 2, 2010

PRELIMINARY

Abstract: Despite robust economic growth in the post-War era, there remain deep pockets of persistent poverty in the United States. We examine the origins of permanent income differences across counties using contemporaneous and historical Census data. Using a panel of U.S. counties from 1960 to 2000 and a long-difference dynamic panel-data estimator, we first test whether aggregate production technologies and rates of income convergence are the same across persistently poor and non-poor regions. We then use data from the late 19th and early 20th centuries to estimate the relative roles of culture, geography, institutions, and human capital in accounting for residual income growth differences across counties. We find evidence of significant regional differences in production technologies, but rates of convergence, which range from 7 to 9 percent, do not differ significantly between poor and non-poor regions. Illiteracy rates at the close of the 19th century are a key determinant of persistent poverty status at the close of the 20th century, suggesting long-term negative consequences for low initial human capital among persistently poor counties.

1

Between 1960 and 2000 real income per capita in the United States grew 175 percent and

aggregate poverty rates fell by half, from 22 percent to just over 11 percent. In spite of this

economic progress, poverty remains persistently high in several regions of the country,

suggesting that the gains from growth have not been realized equally. Figure 1 depicts counties

in which poverty rates exceeded 20 percent in each Decennial Census since 1960, which is the

definition of persistent poverty adopted by the USDA.1 The figure reveals that there are five

persistently poor regions of the country, encompassing 11 percent of all U.S. counties:

Appalachian Kentucky, the “Black Belt” region spanning the Carolinas to Alabama, the

Mississippi Delta region, the Texas “colonias” along the Rio Grande River, and Native American

reservations in the four corners states and the Dakotas. The difference in average per capita

incomes between these counties and other counties in the U.S. is significant: in 2000, average

income in the persistently poor counties was 72 percent of the non-poor county average. These

five regions differ greatly in racial, ethnic, geographic, and economic composition, yet share the

seeming puzzle of persistent poverty.

It is also interesting that these areas of persistent poverty occur within one of the richest

countries in the world. Much of the growth and development literature has concentrated on

explaining differences in standards of living across countries: why is income per capita about

$40,000 in the U.S. and similar countries, but only approximately $500 in many African

countries? Proposed explanations include differences in policy choices, institutions, and

geography, as well as factor accumulation. Since many of these potential determinants of

income per capita occur at the national level – monetary and trade policies, much of the system

1 In fact, the USDA defines a county as persistently poor if its poverty rate exceeds 20 percent in each Census since 1970 (http://www.ers.usda.gov/briefing/rurality/typology/maps/Poverty.htm). We extend this to include the 1960 Census. We note that the typography of persistent poverty is the same if one adopts a more stringent criterion of a 30 percent poverty rate, though fewer counties meet the criteria in each subregion.

2

of government – looking at subnational differences in incomes allows us to better understand the

relative importance of other factors that vary within a country. We utilize a neoclassical growth

framework to examine whether these differences in income, and the areas of persistent poverty

that have arisen, are due primarily to differences in aggregate production technologies, or

whether the historical roles of institutions, culture, geography, and endowments of human capital

have placed these regions on divergent growth paths.

The dynamic panel data model of Islam (1995) advanced the growth empirics literature

by explicitly allowing for heterogeneity in aggregate production functions via the inclusion of

permanent cross-country differences. This is attractive because a key determinant of steady-state

growth in the empirical model of Mankiw, Romer, and Weil (1992), which follows the canonical

Solow (1957) model closely and serves as the basis of Islam’s approach, is the initial level of

productive efficiency, and cross-country heterogeneity in productive efficiency seems likely.2

We apply these ideas to U.S. county-level data to identify why certain regions in an otherwise

wealthy nation remain mired in persistent poverty. Specifically, we assemble a panel of counties

across the five Decennial Censuses from 1960 to 2000 to estimate a dynamic panel data model of

conditional income convergence. In the full sample, we allow aggregate production technologies

to differ in terms of productive efficiency by including county intercepts. We also admit

potential heterogeneity in the production function parameters by estimating the dynamic income

model separately for counties classified as persistently poor versus those that are not poor. Lee,

Pesaran, and Smith (1997) discuss the issue of parameter heterogeneity in the cross-country

context, but as noted by Durlauf, Johnson, and Temple (2005), identification in these models is

perilous, and a possible alternative is to split the sample into groups likely to share similar

parameter values. Since our interest is in understanding what sets the persistently poor counties 2 Efficiency is also referred to as “technology” or “productivity.”

3

apart, we estimate separate models based on persistent poverty status and test whether rates of

conditional income convergence differ across regions. To correct for possible measurement

error at the county level, we estimate our income models with a long-difference GMM estimator

proposed by Hahn, Hausman, and Kuersteiner (2007), which has good finite sample properties

compared with other dynamic panel estimators such as the Blundell-Bond (1998) estimator.

With the estimated parameters from the income convergence models, we recover the

county-specific intercepts, which serve as proxies of initial productive efficiency. As Durlauf and

Quah (1999) argue, these intercepts are not nuisance parameters to discard, but rather serve as a

mechanism through which we can uncover permanent differences in income. Recent research has

emphasized culture, geography, and institutions as key determinants of long-run growth, and

these forces are likely embedded in the fixed effects (see, among many others, Grief 1994;

Rappaport and Sachs 2003; Acemoglu, Johnson, and Robinson 2005). These factors have

figured prominently in historical economic and sociological research on the development of the

antebellum South as well as Appalachia (Engerman 1966; Fogel and Engerman 1974; Billings

1974; Duncan 1999; Ransom and Sutch 2001; Eller 2008). We thus estimate a model that links

our modern county fixed effects to county-level data from the historical 1890 and 1900 Censuses

that contain rich data on land tenure, religious participation, nativity, and human capital, and

supplement it with data on geography including elevation and temperature.

Although much of the growth literature has been directed at understanding income

convergence across rich and poor countries, we are not the first to apply such models to regional

income growth within the U.S. Much of this work has centered on a panel of states (Barro and

Sala-i-Martin 1992; Evans and Karras 1996; Bauer, Schweitzer, and Shane 2006), but some has

also utilized county data (Rappaport and Sachs 2003; Clifton and Romero-Barrutieta 2006;

4

Higgins, Levy, and Young 2006). Rappaport and Sachs (2003) focus on changes in population

density, especially on the coasts, as opposed to income. Higgins, et al. (2006) use county-level

data from 1970-2000 to estimate income convergence rates, but they do not recover the county

fixed effects to identify the historical role of institutions, culture, and geography. Clifton and

Romero-Barrutieta (2006) pose a parsimonious model relating county poverty in 2003 to

geography and land tenure in 1910, but do not estimate the income-convergence process, and

they focus on poverty in Appalachia at a point in time as opposed to the broader set of

persistently poor regions over time. Thus, while we draw on several key ideas in the previous

research, our study differs in several dimensions in terms of methods, data, and emphasis.

We find evidence of significant regional differences in production technologies, but rates

of convergence, which range from 7 to 9 percent, do not differ significantly between persistently

poor and non-poor regions. Illiteracy rates at the close of the 19th century are a key determinant

of persistent poverty status at the close of the 20th century, suggesting long term negative

consequences for low initial human capital among persistently poor counties.

II. The Origins of Persistent Poverty: Some Preliminaries

On the eve of the 1960s the poverty status of a vast stretch of the United States was

bleak. In Figure 2 we present county poverty rates for 1959 based on income reported in the

1960 Census.3 The figure shows that county poverty rates in excess of 50 percent were common

in the South, and those in excess of 20-30 percent were the norm in the Midwest and Plains

states. With strong economic growth and expansion of income support as part of the Johnson

Administration’s Great Society programs, the poverty landscape changed dramatically over the

next decade. Figure 3 shows that county poverty rates in the 1970 Census were considerably

3 The United States did not produce its first estimates of poverty until the 1960s, but in the special tabulation the Economic Research Service of the USDA produced estimates for the 1960 Census. We thank Robert Gibbs of USDA for providing these data.

5

lower throughout much of the nation, especially in the South and Midwest. However, shades of

the five regions of persistent poverty already emerge as poverty rates remained in excess of 40

percent in many of the counties. Further progress against poverty continued through the 1970s as

Figure 4 shows, but comparing poverty rates in 1979 against those in 1989 in Figure 5 suggests

that gains against poverty abated in the 1980s. The economic expansion of the 1990s was

similar in strength to the 1960s, and as seen in Figure 6 poverty fell compared to 1989, but the

expansion was not enough to lift the poorest areas up to levels found elsewhere. Across Figures

2 to 6 it is clear that the poverty status of many counties that are identified as persistently poor in

Figure 1 improved over the past four decades, but poverty in many of those counties remained

four times higher than the national rate.

A fundamental question in the growth literature is whether the counties that have

emerged over the last four decades as persistently poor attained that status because of differences

in factor accumulation, or whether their status is due to differences in their aggregate production

functions – that is, their ability to use the factors that they have. In other words, do persistently

poor counties have lower levels of inputs like physical and human capital, or do they use the

capital they have less efficiently than other counties? We consider two types of differences in

production technology: differences in the initial level of a county’s “technology” and institutions,

and differences in the rate at which factors are transformed into output. We also focus on the

determinants of these underlying differences in initial conditions.

To fix ideas, the growth literature generally specifies that output or income is determined

by stocks of physical and human capital along with economy-specific “technology” or

“productivity,” which encompasses technology, institutions, and endowments of natural

resources. Table 1 illustrates some descriptive differences in incomes and observed human and

6

physical capital between the persistently poor counties and non-persistently poor (“non-poor”)

counties pooled across the 1960 to 2000 Censuses. In our sample there are 3,094 counties, 338 of

which are categorized as persistently poor.4 A data appendix provides details on the sources and

definitions of the variables. Not surprisingly, the persistently poor counties have lower per capita

real incomes, fewer people in the labor force, slower growth of the labor force, fewer high school

graduates, and lower new capital expenditures in manufacturing. The persistently poor counties

are also less urban, smaller (in terms of population), and have higher percentages of African

Americans. However, the bottom two panels of Table 1 indicate some signs of convergence

between the regions. In 1960, income per capita in the persistently poor areas was 62 percent of

the non-poor areas, but by 2000 it was 72 percent of non-poor income. Likewise, high school

completion rates in the persistently poor counties increased from 63 percent of the non-poor rate

in 1960 to 82 percent in 2000; and real capital spending increased from 7 percent of the non-poor

rate in 1960 to 17 percent in 2000. We more formally test for evidence of convergence below.

Many have argued that fundamental determinants of growth, such as geography,

institutions, and culture, explain more of the difference in growth rates than do current

differences in factor accumulation (Easterly and Levine 2001; Acemoglu, et al. 2005). To proxy

for these components of growth, we use historical data from the Censuses of the late 19th and

early 20th centuries. Since 1890 was the cusp of expansions in the logging and coal industries in

the U.S., we use 1890 data whenever possible (but for several variables that are not available in

1890, we use 1900 data).

Sociologists such as Duncan (1999) and Billings and Blee (2000) argue that the roots of

persistent poverty in Appalachia and the Mississippi Delta can be traced to social and economic

institutions that bestow ownership rights of land and resources to a select group of individuals— 4 There are 3,141 counties in the country, but data are missing on one or more variables on 47.

7

in the case of Appalachia to absentee coal and timber barons, and in the Delta to plantation

owners. Economists such as Fogel and Engerman (1974) and Ransom and Sutch (2001) have

made a similar case on the role of institutions on the economic development and growth in the

South in the decades following the Civil War, especially the economic organization of

sharecropping. Likewise, resettlement of Native Americans in the 19th Century often took the

form of removal from productive lands in the South and East to non-productive, arid lands in the

central Plains (Barrington 1999). This suggests that the extent of local ownership of land and

natural resources, sometimes referred to as land tenure, likely varies across the U.S. in response

to regional political institutions, and the higher the share of local land tenure, the more

productive income remains in the local community.

In the historic Censuses of the 19th and early 20th centuries, data were collected on the

number of improved and unimproved acres of farmland, distinguishing whether or not the

improved acres were owner-occupied. We thus construct a proxy for local institutions as the

fraction of farmland in the hands of local owners (Clifton and Romero-Barrutieta 2006).5 As

seen in Table 2, average land tenure in 1890 was higher in non-poor counties (80 percent)

compared to persistently poor counties (77 percent), and the variance lower. However, because

the differences are not striking, the empirical importance of historical land tenure on persistent

poverty today is not clear a priori.

Culture is also suggested as a possible source of persistent poverty, both across and

within countries (Banfield 1970; Billings 1974; Murray 1984; Grief 1994). Most prominent

among these is the role of religion in economic development, especially the argument made a

century ago by Max Weber that Protestantism (and Calvinism in particular) played a crucial part

5 Note that the sample size is smaller in Table 2 than in Table 1. As described in the Data Appendix we restrict our historic analysis only to counties without redefined borders between the relevant year (1890, 1900) and 1960.

8

in the economic success of Northern Europe compared to its southern neighbors who were

predominantly Roman Catholic. Barro and McCleary (2003) and Cavalcanti, Parente, and Zhao

(2007) provide some evidence in support of the Weber thesis in the cross-country context. In the

case of the U.S., religion-based cultural influences are determined by historic patterns of

immigration. 19th Century immigrants in the East, Midwest, and West tended to be dominated by

Roman Catholics, while those in the South were primarily Baptists. The early Scots-Irish who

settled northern and central Appalachia in the 18th Century tended to be Presbyterian, while later

immigrants from Germany tended to adhere to various movements within the Baptist faith as

well as Catholicism. The 1890 Census of Religious Bodies recorded the number of persons in a

county who claim membership in a church, both overall and by denomination. Table 2 includes

the means of the overall share of the population who are churched, as well as the share of some

major denominations. The table shows that differences in the share of the population counted as

church members are very small across poverty groupings, although the distribution across

religious affiliation varies considerably between the persistently poor and other counties. For

example, the share of residents who are Calvinist is twice as high in the non-poor counties,

which the Weber thesis would suggest greater economic growth in non-poor counties. However,

the non-poor counties are also have higher shares of Roman Catholics (5.9 versus 3.4 percent),

which is contrary to the pro-growth Weberian view. There is, however, twice the share of

Baptists in persistently poor counties compared to non-poor counties.

Geography, including mountainous terrain and temperature, is also often listed as a

barrier to riches (Gallup, Sachs, and Mellinger 1999; Acemoglu, et al. 2005; Iyigun 2005;

Rappaport and Sachs 2003; Rappaport 2007; Eller 2008). For example, the Appalachian

Mountains, which span from Mississippi to Maine, are rugged and densely packed with narrow

9

valleys, which make development a challenge. Likewise, the lowlands of the Mississippi Delta

were historically prone to flooding, another barrier to development. The hot and humid summers

of the South and the arid farmland of Native American reservations also pose challenges to

productivity.

In Table 2 we present two measures of geography, latitude (normalized relative to 90

degrees, so that larger values represent more northern locales), and the standard deviation of

elevation as a measure of how mountainous the terrain is. As expected based on the map in

Figure 1, Table 2 shows that persistently poor counties are located further south than non-poor

counties. The difference in the means between the poor and non-poor is about four degrees,

which is approximately the difference in latitude between Oxford, Mississippi and St. Louis,

Missouri. There is little difference between the persistently poor and non-poor counties in terms

of the standard deviation of elevation; the persistently poor are have slightly less variation than

do the non-poor.

Finally, economists have stressed the important roles of human capital endowments and

potential agglomeration economies in urban areas (for example, Glaeser, Kallal, Scheinkman,

and Shleifer 1992; Moretti 2004; Shapiro 2006). In the 1900 Census, individuals were asked

whether they could read or write, which leads to our focal historical measure of human capital,

the illiteracy rate. We also include the share of the county’s population that is foreign born,

while we use the share of the population residing in an urban area as our measure of

agglomeration economies. Table 2 shows that the most striking difference in the historical data

is in the human capital variables: counties that are classified as persistently poor today had

illiteracy rates more than three times higher than other counties in 1900 (36 percent versus 11

10

percent). Persistently poor counties also had urban shares 80 percent lower, and shares of

foreign-born residents 75 percent lower, than non-poor counties.

For a preliminary look at whether these historical differences in institutions, geography,

and human capital matter for the chances of being persistently poor a century later, in Table 3 we

specify a linear probability model of the probability that a county is persistently poor over the

period 1960-2000 as a function of historic Census data. In each regression, 273 counties (9.6

percent of the sample) are persistently poor. Column 1 of Table 3 contains a parsimonious

specification of persistent poverty as a function of institutions (land tenure) and geography

(temperature and terrain).6 The results in column 1 show that a percentage point increase in the

share of owner-occupied farmland in 1890 lowers the probability of being persistently poor a

century later by just under 10 percent, and living in the North reduces the chances being

persistently poor (a one degree move North lowers the probability by about 2 percent). In

columns 2 and 3 we present results from our preferred specifications that add measures of culture

and human capital to the regression model. Higher literacy rates, higher church membership

(especially Calvinist and Baptist), and higher urban population shares all significantly lower the

odds of being persistently poor. At the same time, the addition of these variables completely

wipes out any role for institutions and geography. In particular, as we show in columns 4 and 5,

omitting illiteracy causes the coefficient estimate on latitude and farm ownership to regain

statistical significance, and also causes the coefficient estimates on the shares of Baptists and

Catholics to become positive and statistically significant, while the coefficient estimate on the

share of Calvinists increases in magnitude but remains negative. Illiteracy is highly correlated

6 This is akin to the model estimated by Clifton and Romero-Barrutieta (2006). The dependent variable in their model is the poverty rate in 2003, rather than an indicator variable for persistent poverty for 1960-2000 as we use. In addition, we use two measures of geography—temperature and terrain—whereas they use only terrain.

11

with the prevalence of some denominations, in particular the share of Baptists in a county7, so

that models in columns 4 and 5 yield the spurious result that a higher share of Baptists increases

the probability of being a persistently poor county when in fact the driving force is county rates

of illiteracy. This result highlights the long-run importance of endowments of basic human

capital to county economic status. In the following sections, we turn to the growth literature to

examine more formally the mechanisms through which these historical factors affect modern

income levels while simultaneously accounting for current levels of factor accumulation.

III. Dynamic Model of Income Convergence

In neoclassical growth models such as the textbook model of Solow (1956), an economy

converges to a steady state determined by factors such as the economy’s rates of saving and

population growth, where income per capita grows at the rate of technological progress. To

illustrate, consider the human capital-augmented version of the Solow model following Mankiw,

Romer, and Weil (1992) and Islam (1995). The production function is given by

(1)

where Y is aggregate output, K and H are stocks of physical and human capital, L is the labor

force, and A is labor-augmenting technology, which grows at the exogenous rate g. Output is

invested in physical and human capital at the constant rates sk and sh respectively.

Under standard assumptions, an economy’s rate of convergence to its steady-state level

of income per capita (y=Y/L) can be derived:

ln ln 1 ln 1 ln 1

ln 1 ln 1 ln (2)

7 The correlation coefficient between the 1900 illiteracy rate and the share of church members in 1890 is 0.265; for Baptists, Calvinists, and Catholics, respectively, the correlations are 0.630, -0.189, and -0.163.

12

where λ=(n+g+δ)(1-α-γ). Thus, steady-state growth in income per capita depends on investment

in physical capital (sk), investment in human capital (sh), a term including population growth,

technological progress and depreciation (n+g+δ), initial income (y), and the initial level of

technology (A0). Furthermore, with reasonable assumptions about the values of α and γ (the

shares of physical and human capital, respectively, from the production function), the coefficient

estimate on the log of initial income can be used to infer the speed of convergence toward the

steady state (i.e., λ).

Traditionally, this has typically been estimated in the cross section, with an assumption

not only of identical production functions across economies (most commonly, countries), but

also of identical rates of technological progress; economies’ steady states still differ, based on

their savings and population growth rates. Under these assumptions, controlling for rates of

population growth and savings, an initially poor economy will “converge” to the same steady

state as an initially richer economy. This test of “conditional convergence” has received wide

support in the empirical literature.

However, moving to a panel framework, as in Islam (1995), allows for the estimation of

different initial levels of technology across economies (the rate of growth of technology is still

assumed constant across economies.) The workhorse specification in a panel data setting comes

from Islam (1995) (see also eqn (59) in Durlauf, et al. 2005):

1 (3)

where is the log of real income per capita for county i (=1,...,N) in year t (=1,...,T); is

the lag of the dependent variable; are time-varying rates of factor accumulation (new capital

investment, population growth rate, school enrollment rates); are time-invariant income

factors that may affect initial productivity (A0) such as geography and institutions; represents

13

unobserved, permanent differences across counties in productivity that do not vary over time;

is a macroeconomic shock common across counties but time varying; and is an iid error for

county i in year t. The parameter identifying the speed of convergence is .

By construction, unobserved productivity is correlated with and thus to

eliminate the latent heterogeneity we take differences

∆ 1 ∆ ∆ ∆ ∆ . (4)

where ∆ is the difference operator. Although is iid and uncorrelated with all right-hand side

variables, in differences, ∆ is MA(1) and thus correlated with ∆ . If the number of time

periods T is large, then the so-called Nickell bias of estimating (2) via OLS is small; that is, the

bias goes to zero as T goes to infinity (Nickell 1981). In our case, although the spacing of

measurement is per decade, the number of actual decades is small (T=5) and thus we will need to

instrument for the lagged dependent variable.

There are many possible estimators for the dynamic panel data model in equation (4).

For example, if we apply first differences, then under the assumptions of our model is a

valid instrument for ∆ , yielding the just-identified Anderson-Hsiao (1982) estimator.

Arellano and Bond (1991) proposed a more efficient GMM estimator, and Blundell and Bond

(1998) proposed a GMM estimator that was more robust in the presence of weak instruments and

when the true parameter on the lagged dependent variable approached unity. We instead use a

GMM estimator recently proposed by Hahn, Hausman, and Kuersteiner (2007) that has finite

sample properties at least as good as the Blundell and Bond estimator. Griliches and Hausman

(1986) showed that in the presence of mismeasured covariates, first differences tended to

exacerbate errors-in-variables bias relative to pooled OLS, and even relative to the within

estimator. However, taking wide differences, e.g. two-year or wider, had the effect of

14

attenuating bias from measurement error. Hahn, et al. (2007) apply this idea to the dynamic

panel case and suggest taking as wide a difference as possible while still retaining an instrument

for the lagged dependent variable.

In our model with Decennial Census data from 1960 to 2000 we estimate

∆ 1 ∆ ∆ ∆ ∆ . (5)

where ∆ is the three-decade difference operator. That is, the dependent variable is ∆

, the lagged dependent variable is ∆ , and the time-

varying covariates are ∆ . Note that the differenced time dummy

becomes a constant term because in long-difference form we are in effect left with a cross-

sectional model in equation (5). This approach retains as a valid instrument for the lagged

dependent variable. We in fact overidentify the model by including the t-3 and t-4 lags of the

time-varying productive factors, , as additional instrumental variables for the lag

dependent variable. In the case that the human and physical capital variables in X are strictly

exogenous, i.e. 0 , , then the ∆ serve as instruments for themselves. We test

this assumption against the alternative that human and physical capital should instead be treated

as predetermined to the income process of a county, in which case 0 and thus

the ∆ are dropped from the instrument set.

Specifically, define the matrix of regressors as ∆ , ∆ , 1 , the vector of

unknown coefficients as Γ 1 , , ∆ , and the matrix of instruments

, ∆ , , . Then the GMM estimator for the linear dynamic panel estimator

is Γ Ω Ω Δ , where Ω is an initial consistent estimate

of the weight matrix. For the two-step GMM estimator we take the identity matrix as the initial

15

weight matrix and then apply a heteroskedasticity-robust weight matrix in the second step.8 The

associated variance-covariance matrix is given as Γ Ω . Define

the Hansen (1982) J-test of the validity of the set of overidentifying restrictions as

Γ Ω Γ , which is distributed asymptotically chi-square with degrees of freedom

equal to the number of overidentifying restrictions. Newey and West (1987) show that it is

possible to test the validity of a subset of overidentifying restrictions as , where U

refers to the unrestricted model that includes Δ in the instrument set and R refers to the

restricted model without those instruments. The C-statistic is distributed asymptotically chi-

square with degrees of freedom equal to the number of restrictions.

The model in equation (5) assumes common production technology across counties, and

thus a common speed of convergence, except for the initial productivity embedded in . We

test this assumption by applying the long-difference GMM estimator to subsets of persistently

poor counties Γ and the non-poor counties Γ , and then conducting a Wald test for whether

convergence rates are similar as Γ Γ Γ Γ Γ Γ , which

is distributed asymptotially chi-square with degrees of freedom equal to the dimension of Γ.

In addition to examining the possible differences between aggregate production

functions between the persistently poor and other counties, we are interested in the extent to

which residual growth rates – also referred to as “technology” or “productivity” in the growth

literature – are explained by historical data. However, a consequence of differencing the model

in equation (5) is that it is no longer possible to identify , the vector of coefficients on the time-

invariant regressors Zi that in part determine initial productivity (e.g., culture, geography, and

8 Hahn et al. (2007) also discuss alternative, more efficient GMM estimators than the one we use here that take advantage of additional moment conditions in between year t and t-k.

16

institutions). It is still possible to recover estimates of with an auxiliary regression as follows

(Hausman and Taylor 1981; Hsiao 1986). Based on the GMM estimates from equation (5) we

construct an estimate of the fixed effect for each county as

1 , , (6)

where is the county-specific “time mean” and the other variables are similarly defined. We

then estimate a regression of the fixed effect on those factors that affect permanent differences

across counties as

, (7)

and thus identify . Provided that the unobserved initial conditions in 1960, the at the start of

our sample period, are uncorrelated with the Zi factors from the distant past then equation (7) can

be estimated via OLS.

IV. Evidence on Convergence

We first present estimates of the long-difference GMM model from equation (5) in Table

4. To recap, the dependent variable is the log difference of real per capita income between 1970

and 2000. The independent variables include the 1960-1990 lag difference in log income per

capita (yit-1), and the changes from 1970 to 2000 in the following (Xit): the percent of high school

graduates; real private capital expenditures in manufacturing; labor force growth; urban share;

and the share of residents who are black. For comparison purposes, in Regression 1 we estimate

equation (5) via OLS, while in Regressions 2 and 3 we use GMM. In the base case GMM model

of Regression 2 the instruments for the lagged dependent variable include the 1970-2000 change

in the Xit as well as the 1960 and 1970 levels of the Xit. In Regression 3 we exclude the 1970-

2000 change in the Xit from the instrument set and conduct pseudo-likelihood ratio tests (the C

statistic above) of the validity of their inclusion.

17

Regardless of the estimation method, per capita income levels are positively correlated

with lagged per capita income, and thus based on the specification in equation (1), per capita

income growth is negatively related with the lagged income level, implying conditional

convergence across U.S. counties. The rate of convergence implied by the models is found by

the transformation 1 log / , where time is the interval of data

measurement (here, 10 years) (Islam 1995).9 The estimated convergence rates presented in the

lower panel of Table 4 are quite consistent across the three specifications, at 0.08, and within the

range of estimates found in previous research at various levels of aggregation (e.g., Islam 1995 at

the cross-country level; Higgins et al 2006 at the county level). Other results of note in Table 4

are that per capita income growth is positively related with levels of human capital and urban

share, and negatively related to the share of the county population that is black. Given the semi-

logarithmic specification, the coefficients on the production function terms are more readily

interpreted in terms of elasticities. For example, the elasticity of per capita income with respect

to the share of high school graduates in Regression 2 is 0.23 and in Regression 3 is 0.13.10 In

Regression 2, the Hansen J-test of 101.6 rejects the null hypothesis that the overidentifying

restrictions are valid, but the test result of 9.98 in Regression 3 indicates that the restrictions are

valid at the 8 percent level. The estimated C-statistic of 91.6 thus confirms that our controls for

human and physical capital in the production technology should be treated as predetermined to

the income process for estimation.

One of our primary interests is in the extent to which aggregate production functions – or

the efficiency with which inputs (e.g., capital, labor) are transformed into output – of the

persistently poor counties differ from the production functions of other counties. As we saw in

9 The standard error for the convergence rate is found by the delta method (Wooldridge 2001). 10 In the semi-logarithmic specification of lny = a + b*x + u the elasticity is b*x.

18

Table 1, the persistently poor counties are different in terms of the levels of such inputs: they

have smaller, less well-educated labor forces, for example. To what extent are these lower

incomes explained by lower levels of inputs, and to what extent are they due to less efficient use

of the inputs? In Table 5, we repeat the analysis of Table 4 separately for the persistently poor

and non-poor counties.

Regressions 1-3 present results from the non-persistently poor sample, while Regressions

4-6 are for the persistently poor counties. Not surprisingly, the results in Regressions 1-3 are

very similar to those in Table 4, since the non-poor counties make up the overwhelming majority

of the full sample (2,752 out of 3,094). The estimates of the convergence parameter are also

quite similar, although slightly higher (this is also expected, since the persistently poor counties

over the 1960-2000 period have been dropped from the sample). As with the pooled sample, the

Hansen J-test and associated C-statistic point to the GMM model with ∆Xit excluded from the

instrument set (Regression 3)

Regressions 4-6 are estimated on the 338 persistently poor counties. While the effects of

lagged income growth and human capital are similar to those for the non-poor counties, most of

the other coefficient estimates differ substantially. For both the OLS and GMM regressions a

Wald test rejects the null hypothesis of equal coefficients between persistently poor and non-

poor counties, with a p-value of at least 0.02. Specifically, labor force growth is more strongly

and positively correlated with growth in the persistently poor counties, while the urban share –

which was strongly and positively correlated with growth in the non-poor counties – has little

effect. The return to physical capital accumulation is also higher in the persistently poor

counties, as the assumed Cobb-Douglas production function with diminishing returns to capital

would predict. Also, the negative correlation between growth and the share of black residents

19

seen in non-poor counties is not present in any of the persistently poor regressions. While the

coefficient estimates on the fraction having completed high school are comparable (except in

Regressions 3 and 6), recall that the level of high school completion is significantly higher in the

non-persistently poor counties: approximately 15 percentage points higher in the pooled sample.

Finally, for each model, Table 5 also presents estimated convergence rates. These are

quantitatively similar to those in Table 4 for the full sample, though the range is somewhat

wider. Although the Wald tests for equal convergence rates between persistently poor and non-

poor counties cannot reject the null for either OLS or GMM estimators, recall that these

estimates are for convergence within the groups (i.e., the persistently poor counties are

converging to a lower income level than are the non-poor; the results in Table 5 cannot provide

evidence for or against convergence across the entire sample).

V. The Origins of Persistent Poverty: Further Evidence

In Table 6, we report OLS estimates of equation (7) where the dependent variable is

residual growth rates (i.e., growth after controlling for the explanatory variables from our

preferred specification, Regression 3 of Table 4) from the entire sample. These residuals are

regressed on variables from historical Census data described in Section II. Recall that as a proxy

for institutions of land tenure, we include the percentage of farms operated by their owners. As

measures of geography, we include both a normalized measure of latitude and the standard

deviation of the county’s elevation. Finally, to proxy for culture, we include the population

shares of three major faiths in 1890: Baptists, Catholics, and Calvinists. We also include

historical data on illiteracy rates and urbanization to control for initial endowment of human

capital and the potential for agglomeration economies. In Regression 1, the variables are entered

linearly; in Regression 2, we include quadratic terms to allow for potential nonlinearities.

20

Interestingly, given the prevalence of the “fundamental determinants” argument in the

cross-country literature, these variables together explain only a small fraction of the variation in

residual income across counties: adjusted R2s are low in both regressions. Of course, even within

a country as large and geographically diverse as the United States, there is much less variation in

these factors as there is across countries.

In the specification with only linear terms (Regression 1), residual growth is negatively

correlated with higher shares of farms operated by owners, while the standard deviation of

elevation is positively correlated with residual growth. At first blush the latter two findings may

seem surprising, but given the linear probability results of Table 3 where once we controlled for

initial human capital and culture land tenure and variation in elevation had opposite signs than

expected (though no statistical impact), these results are less surprising. Counties with higher

shares of Baptists and Calvinists in 1890, higher shares of foreign-born residents in 1900, and

more urbanized in 1890, also have, on average, higher residual growth rates.

Allowing for quadratic relationships (in Regression 2) changes these implications

somewhat. In particular, there are inverted U-shaped relationships between residual growth and

normalized latitude, 1890 Baptist share, 1890 Catholic share, 1900 foreign-born share, and the

1900 illiteracy rate, i.e. residual growth is increasing at a decreasing rate in these factors. For

most of these variables, the maximum value of the marginal effect occurs at a fairly high level,

so that the correlation is increasing for over 90% of the sample. A notable exception is the

inverted U-shape relationship between latitude and residual growth, where the maximum value

occurring closer to the sample median (at a normalized latitude of 0.44; 61 percent of the sample

is below this level). Importantly, even though the negative link between initial income and

illiteracy does not occur until illiteracy rates reach 38 percent, this happens to be quite close to

21

the sample mean of persistently poor counties as identified in Table 2. The negative correlation

between residual growth and Catholic share, and the positive correlation between residual

growth and urbanization, seem to occur only at the highest levels of Catholic share and

urbanization, respectively.

In Table 7, we address whether these historical determinants of residual growth differ for

the persistently poor counties and other counties. In Regressions 1 and 2 of Table 7, we present

results for the non-poor counties, where the dependent variable is the residual growth rate from

Regression 3 of Table 5 (the regression on the non-poor subsample, with the regressors treated as

predetermined). Consistent with our earlier results, and as we expect given that the non-poor are

such a large portion of the full sample, Regressions 1 and 2 look quite similar to the full sample

results of Table 6.

Our main interest, however, lies in comparing these estimates (Regressions 1 and 2) for

the non-poor sample with estimates for the persistently poor subgroup. Regressions 3 and 4

include results for the persistently poor counties, where the dependent variable is the residual

growth rate from our preferred specification, Regression 5 of Table 5 (estimated with the

regressors treated as exogenous). As indicated at the bottom of the table, formal Wald tests

strongly reject the null hypothesis of equal coefficients.

Comparing Regressions 1 and 3, a number of the qualitative and quantitative estimates

are quite different. While farm ownership has a negative and statistically significant correlation

with residual growth for the non-poor sample, the correlation is not statistically different from

zero among the poor counties. Among the non-poor counties, the standard deviation of elevation

is positively and statistically significantly correlated with residual growth, but this effect changes

sign and loses statistical significance among the persistently poor. This may suggest additional

22

nonlinearities or contingencies in the relationship: a county that is rich enough (through other

means) to pave roads through the mountains, or large enough to have a less mountainous area

where production can more easily be located, may be able to capitalize on mountains for their

amenity value, while poorer counties (or counties with no flat area at all) struggle with

transportation and access.

In Regressions 1 and 3, the coefficient estimate on Baptist share is positive and

statistically significant in both regressions, while the estimates on Calvinist and Catholic shares

are not statistically significant in either case. For both the foreign-born share and the illiteracy

rate, the effects are opposite in sign but statistically significant in both samples. A higher

percentage of foreign born in 1900, and a higher illiteracy rate in 1900, are positively correlated

with residual growth in the non-poor sample, but negatively correlated with growth in the

persistently poor sample. (Recall from Table 2 that the persistently poor counties had, on

average, much higher illiteracy rates in 1900, and much lower percentages of foreign born

residents.) Finally, the positive and statistically significant correlation between urbanization in

1900 and residual growth in the persistently poor sample does not appear in the non-poor sample.

Regressions 2 and 4 include quadratic terms for all of the regressors. There are several

nonlinear relationships worth discussing. Among the non-poor sample, the nonlinear

relationship between residual growth and illiteracy found in Table 6 remains, and the nonlinear

relationship between residual growth and urbanization strengthens. The implied maximum of

the marginal correlation between illiteracy and residual growth occurs at an illiteracy rate of 49

percent; only 56 counties (3 percent of the sample) have illiteracy rates higher than this. The

implied minimum of the U-shaped relationship between urbanization in 1890 and residual

growth occurs at an urban share of 36.5 percent, which 86 percent of the sample is below. For

23

the persistently poor sample, the quadratic specification seems to be “overfitted” for the

comparatively small sample. The only statistically significant coefficient estimate is on the

Baptist share entered linearly, which remains positive. However, of interest is the negative

coefficient estimate on illiteracy that is opposite in sign from those in the non-poor sample,

suggesting potential long term negative consequences for low initial human capital.

VI. Conclusion

Deep pockets of poverty persist in several regions of the United States despite the

widespread availability of technology and the lack of institutional barriers to labor mobility

across county and state borders. To examine the sources of persistent poverty across regions we

estimated a dynamic panel data model of conditional income convergence using county-level

data from the past five Censuses. We also recovered county-specific estimates of residual

income to examine the influence of fundamental determinants of income; namely, the roles of

culture, geography, human capital, and institutions based on measures from historic Censuses of

1980 and 1900.

Our estimates from a long-difference GMM estimator suggest that there are significant

regional differences in production technologies between counties that are persistently poor

versus non-poor: in particular, income growth is more highly correlated with capital spending

and the growth of the labor force in the persistently poor counties, and less correlated with the

urban share of the county. Despite this, rates of income convergence, which range from 7 to 9

percent, do not differ substantively between poor and non-poor regions. In terms of fundamental

determinants, we found an inverted U-shaped relationship between residual growth and our

measures of geography, culture, and human capital; that is, initial income is increasing at a

decreasing rate in these factors. Perhaps most compelling of the fundamental determinants is the

24

role of initial human capital. Counties with high rates of illiteracy in the late 19th century were at

much greater risk of being persistently poor a century later, suggesting long-term negative

consequences for low initial human capital among persistently poor counties. An implication of

these results is that to overcome such barriers to riches, these counties must invest heavily in

human capital development to catch up to their richer neighboring counties.

VII. References Acemoglu, D., S. Johnson, and J. Robinson (2005), “Institutions as the fundamental cause of

long-run growth,” in P. Aghion and S.N. Durlauf, eds., Handbook of Economic Growth, vol. 1. Amsterdam:. North-Holland Elsevier: 385-472.

Anderson, T., and C. Hsiao (1982), “Formulation and Estimation of Dynamic Models Using

Panel Data,” Journal of Econometrics 18(1): 47-82. Arellano, M., and S. Bond (1991), “Some Tests of Specification for Panel Data: Monte Carlo

Evidence and an Application to Employment Equations,” The Review of Economic Studies 58 (2): 277-297.

Banfield, E. (1970), The Unheavenly City. Boston: Little, Brown and Company. Barrington, L. (1999), “Editor’s Introduction: Native Americans and U.S. Economic History,” in

L. Barrington, ed., The Other Side of the Frontier: Economic Explorations into Native American History, Westview Press.

Barro, R., and R. M. McCleary (2003), “Religion and Economic Growth,” American

Sociological Review 68 (5): 760-781. Barro, R., and X. Sala-i-Martin (1992), “Convergence,” Journal of Political Economy 100 (2):

223-251. Bauer, P., M. Schweitzer, and S. Shane (2006), “State Growth Empirics: The Long-Run

Determinants of State Income Growth,” Social Science Research Network. Billings, D. (1974), “Culture and Poverty in Appalachia: A Theoretical Discussion and

Empirical Analysis,” Social Forces, 53, 2, 315-323. Billings, D., and K. Blee (2000), The Road to Poverty: The Making of Wealth and Hardship in

Appalachia. New York: Cambridge University Press.

25

Blundell, R., and S. Bond (1998), “Initial conditions and moment restrictions in dynamic panel data models,” Journal of Econometrics 87(1): 115-143.

Cavalcanti, T., S. Parente, and R. Zhao (2007), “Religion in macroeconomics: a quantitative

analysis of Weber's thesis,” Economic Theory 32: 105-132 Clifton, E., and A. Romero-Barrutieta (2006), “Institutions v. geography: sub-national evidence

from the United States,” International Monetary Fund Working Paper. Duncan, C., (1999), Worlds Apart: Why Poverty Persists in Rural America, New Haven:

Yale University Press. Durlauf, S. N., P. Johnson, and J. Temple (2005), “Growth Econometrics,” in P. Aghion and

S.N. Durlauf, eds., Handbook of Economic Growth vol. 1.. Amsterdam: North-Holland Elsevier: 555-677.

Durlauf, S. N., and D. Quah (1999), “The new empirics of economic growth,” in J. Taylor and

M. Woodford, eds, Handbook of Macroeconomics vol. 1. Amsterdam: North-Holland: 235-308.

Easterly, W., and R. Levine (2001), “What have we learned from a decade of empirical research

on growth? It's Not Factor Accumulation: Stylized Facts and Growth Models,” The World Bank Economic Review 15(2): 177-219.

Eller, R (2008), Uneven ground: Appalachia since 1945. Lexington: The University Press of

Kentucky. Engerman, S. (1966), “The Economic Impact of the Civil War,” Explorations in Economic

History 3: 176-99. Engerman, S., and R. Fogel (1974), Time on the Cross: The Economics of American Negro

Slavery. Boston: Little Brown. Evans, P., and G. Karras (1996), “Convergence revisited,” Journal of Monetary Economic 37

(2): 249-265. Gallup, J., J. Sachs, and A. Mellinger (1999), “Geography and economic development,”

International Regional Science Review 22(2): 179-232. Glaeser, E., H. D. Kallal, J. A. Scheinkman and A. Shleifer (1992), “Growth in Cities,” The

Journal of Political Economy 100(6): 1126-1152. Greif A. (1994), “Cultural Beliefs and the Organization of Society: A Historical and Theoretical

Reflection on Collectivist and Individualist Societies,” The Journal of Political Economy. 102(5): 912-50.

26

Griliches, Z, and J. Hausman (1986), “Errors in Variables in Panel Data,” Journal of Econometrics 31: 93-118.

Hahn, J., J. Hausman, and G. M. Kuersteiner (2007), “Long difference instrumental variables

estimation for dynamic panel models with fixed effects,” Journal of Econometrics 140(2): 574-617.

Hansen, L. (1982), “Large Sample Properties of Generalized Method of Moments Estimators,”

Econometrica 50(4): 1029-1054. Higgins, M., D. Levy, and A. Young (2006), “Growth and Convergence across the United States:

Evidence from County-Level Data,” The Review of Economics and Statistics 8(4): 671-681.

Hausman, J., and W. Taylor (1981), “Panel Data and Unobservable Individual Effects,”

Econometrica 49(6): 1377-1398. Hsiao, C. (1986), Analysis of Panel Data, Cambridge, MA: Cambridge University Press. Islam, N. (1995), “Growth Empirics: A Panel Data Approach,” The Quarterly Journal of

Economics 110(4): 1127-1170. Iyigun, M. (2005), “Geography, Demography, and Early Development,” Journal of Population

Economics 18(2): 301-321. Lee, K., M., H. Perasan, and R. Smith (1997), “Growth and convergence in a multi-country

empirical stochastic Solow model,” Journal of Applied Econometrics 12(4): 357-392. Mankiw, N. G., D. Romer, and D. N. Weil (1992), “A Contribution to the Empirics of Economic

Growth,” The Quarterly Journal of Economics 107(2): 407-437. Moretti, E. (2004), “Workers’ Education, Spillovers, and Productivity: Evidence from Plant-

Level Production Functions,” The American Economic Review 94(3): 656-690. Murray, C. (1984), Losing ground: American social policy, 1950-1980. New York: Basic Books. Newey, W. K. and K. D. West (1987). “Hypothesis Testing with Efficient Method of Moments

Estimation,” International Economic Review 28(3): 777–787. Nickell, S. (1981), “Biases in Dynamic Models with Fixed Effects,” Econometrica 49(6): 1417-

1426. Ransom, R., and R. Sutch (2001), One kind of freedom: The economic consequences of

emancipation, 2nd edition. New York: Cambridge University Press.

27

Rappaport, J. (2007), “Moving to nice weather,” Regional Science and Urban Economics 37(3): 375-398.

Rappaport J., and J. Sachs (2003), “The United States as a Coastal Nation,” Journal of

Economic Growth. 8(1): 5-46. Shapiro, T. (2006), “Race, homeownership and wealth,” Washington University Journal of Law

and Policy 53. Solow, R. (1956), “A Contribution to the Theory of Economic Growth,” Quarterly Journal of

Economics 70(1): 65-94. Data Appendix

Data on income, population, civilian labor, private capital expenditure, persons living in

poverty, number of high-school degree holders, number of African-Americans, land tenure,

religiosity, geography and institutions have been collected from various sources. A summary of

variable definitions and their measurement units is shown in Appendix Table 1.

The summary statistics in Table 1 of the text, and the dynamic panel model estimates of

income per capita in Tables 4-5, utilized county-level Census data from the 1960-2000 Decennial

Censuses. The USA Counties Basic Information database of the Census Bureau provided

information on many of the variables for 1980-2000 Census. Included in this database are county

per capita income (average income earned by the residents of the county), the total population of

the county, civilian labor force residing in the county (defined as the number of people over the

age of 16 in the county who are not employed in the armed forces and are not institutionalized),

number of people living in urban areas in the county (defined below), number of African-

Americans living in each county, persons living under the poverty-level in the county according

to the official poverty definition of the US, and the proportion of residents residing in the county

who are over the age of 25 and have at least a high school degree. These data are publicly

available from the URL: http://www.census.gov/support/DataDownload.htm.

28

The corresponding variables for the years 1960 and 1970 were collected from the County

and City Data Book of the Census (1962 and 1972), which are available on the website of ICPSR

(Inter-University Consortium for Political and Social Research). This site is maintained by the

University of Michigan. Data collected from ICPSR has been obtained from the URL:

http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/2896/system.

In this analysis, the growth rate of labor force in each county has been used. This growth

rate has been defined as the percentage change of civilian labor force in a country from one

decade to the next. To construct this variable for 1960 we obtained the corresponding

information on labor force from the 1950 Census to construct the 1950-1960 change.

The definition of what constitutes as an urban area has changed over time. In 2000, the

definition of urban areas was a core census block groups or census block that had at least 1000

persons per square mile and the surrounding census blocks that have a population density of at

least 500 persons per square mile (http://www.census.gov/geo/www/ua/ua_2k.html). So, a city

with a population density of at least 1000 people per square mile and the surrounding suburbs

with at least 500 people per square mile would be considered to be an urban area. For the years

1960, 1970, 1980 and 1990, the definition of an urban area was less stringent; any area that was

one of the Census designated places with more than 2500 people, or was incorporated in an

urban area (http://www.census.gov/population/censusdata/urdef.txt) was considered to be an

urban area. Therefore, for those areas, people living in a city or a town with at least 2500 people

or, living in an area that was historically considered to be urban would be considered to be living

in an urban area.

Private capital expenditure in the manufacturing sector (measured in millions of US

dollars) of each county for the years 1960, 1970 and 1980 has been obtained from the County

29

and City Data Books of various years. Private capital expenditure has been defined as either a

permanent addition or a major change made by a manufacturing firm and/or the addition or

replacement of any machinery or equipment in the plant (and whose depreciation account was

maintained). The data for 1990 and the definition of capital expenditures have been obtained

from the 1992 Census of Manufactures Report on each county

(http://www.census.gov/prod/1/manmin/92area/92manufa.htm). The 1990 data are in pdf format,

and thus a pdf-to-Excel converter was used to convert the data from pdf to Excel, and then

exported to Stata. Private capital expenditure data for the year 2000 has been obtained from US

Counties Basic Information database. The data from 1960 to 1990 were converted to real 2000

dollars using the personal consumption expenditure deflator from the Bureau of Economic

Analysis (BEA).

For the summary statistics in Table 2 of the text, as well as the regressions in Tables 3, 6,

and 7, historic data on geography, institutions, culture, and human capital were obtained from

various sources. The latitude of each county has been obtained from the Census 2000 U.S.

Gazetteer Files (http://www.census.gov/geo/www/gazetteer/places2k.html). The TIGER

database of Census stores latitude data as decimal degrees, instead of degrees, minutes and

seconds for ease of calculation. Therefore, if latitude of a certain area is 37 degrees, 25 minutes

and 40.5 seconds, it is recorded as 37.427916 by Census (http://www.census.gov/cgi-

bin/geo/tigerfaq?Q20). The decimal degree latitude of each county has been divided by 90 to

normalize it. The closer the value is to 1, the further the county is from the equator, and thus, the

more north/colder that county is relative to the equator. This measure shows the severity of

weather of a county. For example, Los Angeles County, CA has a latitude of 34 degrees 5

minutes and 18.6 seconds, but in decimal degrees, it has been recorded as 34.08851degrees.

30

After normalizing it, the value is 0.3787. Similarly, Cook County, IL has a latitude of 41 degrees

50 minutes and 15.3 minutes, and it is 41.8376 degrees. After normalizing it, the value is 0.4648.

Data on the standard deviation of elevation of each county has been obtained from

Rappaport and Sachs (March, 2003) and has been provided by Jordan Rappaport of the Federal

Reserve Bank of Kansas City. Elevation data were measured in feet, and it measures how high

the land is from sea-level. This data measures how varied the terrain of a county is. The higher

the standard deviation, the more extreme the terrain of the county; the lower the standard

deviation, the terrain is more or less constant. So, if a county has terrain near sea-level and also

high mountains, it is going to have a high value of standard deviation. On the other hand, if a

county has more or less uniform terrain (all the land near sea-level, or all the land are high-lands,

but without much variation in height) the value of standard deviation is going to be low.

Historical data of variables that proxies institution and culture, like percentage of foreign-

born living in a county, land tenure of a county, number of county residents living in urban areas

(areas that have been legally incorporated as cities, towns or boroughs,

http://www.census.gov/population/www/documentation/twps0027/twps0027.html) and number

of illiterate people in a county (number of people living in the county who cannot read or write)

have been collected from the 1890 and 1900 Census Database. The percentage foreign-born was

defined as the proportion of people living in a county who were not born in the United States.

Land tenure has been defined as the total area farmed by owners of those respective farms in a

given county, divided by the total amount of farmland in the county. This number was not given

explicitly; what was given was the number of farm owners who farmed 0-9 acres of farmland,

10-19 acres, 20-49 acres, 50-99 acres, 100-499 acres, 500-999 acres and 1000+ acres. The mid-

point of each segment was multiplied by their respected number of farm owners (for 1000+ acres

31

of farm segment, the number of farmers was multiplied by 1000 to estimate the total number of

farmland owned by farmers farming 1000+ acres) and then they were added together to get the

total acres of farmland farmed by owners.

Religious data on total church attendance and followers of different denominations,

namely Baptists, Calvinists and Catholics, have been obtained from the 1890 Census of

Religious Bodies. Data on the number of Baptists, Catholics, Calvinists and total church

members in a county have been collected from the 1890 Census of Religious Bodies. In this

paper, the Baptist denomination includes Regular (North, South and Colored), Freewill, General,

Primitive and Old Two-Seed denominations. The Calvinists denomination includes Welsh

Calvinist, Presbyterian (Northern and Southern), Cumberland Presbyterian (Regular and

Colored), United Presbyterian, US Reformed Church and American Reformed Church

Organizations. All these historical and religious data can be obtained from the ICPSR database:

http://www.icpsr.umich.edu/icpsrweb/ICPSR/studies/2896/system.

A number of counties changed their area and many new counties formed over time. Large

counties were split to form new counties and some counties were merged to form a new county.

For example, Manitou County in Michigan does not exist anymore and was merged with other

neighboring counties. These redrawing of county boundaries posed a problem because different

social indicators data (such as population, number of people illiterate in the county, land tenure)

changed whenever county boundaries were changed. To ensure that the changing of county

boundaries did not pose a problem with the regression analysis, only counties whose shape

remained the same from the years 1890 onwards were used in the analysis. Data on whether

county boundaries have changed over time has been obtained from the website of Newberry

Library (http://www.newberry.org). The library has electronic files of state maps from different

32

periods that show how the counties in each state evolved over time. The maps for the year 1890

were compared with those of current day (which are also available in the website of Newberry

Library) to see which counties did not change their shape between 1890 and present day.

However, during the time of this analysis, the library did not have data on the evolution of

county shapes for the state of Georgia, and this data has been obtained from the website:

http://www.FamilyHistory101.com.

Appendix Table 1: Description of Variables Used in the Regressions Variable Description Measurement Units Per Capita Income ($)

Average income earned by the residents of the county

In dollar amounts; values from 1960 to 1990 have been converted to 2000 dollars using the personal consumption expenditure deflator from the Bureau of Economic Analysis (BEA)

Population Total number of people living within the boundary of a county

In absolute value

Fraction in Labor Force The number of people over the age of 16 in the county who are not employed in the armed forces and are not institutionalized, divided by the population of the county.

Between 0 and 1

Growth in Labor Force The increase in civilian labor force in a country from one decade to the next

Between 0 and 1

33

Fraction High School Graduate Capital Expenditure

The number of residents over the age of 25 with at least a high school degree in the county, divided by the population of the county A permanent addition or a major change made by a manufacturing firm and/or the addition or replacement of any machinery or equipment in the plant (and whose depreciation account was maintained).

Between 0 and 1 In millions of dollars, and the values have been converted to real 2000 dollars using the personal consumption expenditure deflator from the Bureau of Economic Analysis

Fraction Living in Urban Area

The number of residents living in an urban area as defined by the Census, divided by the total population

Between 0 and 1

Fraction Black The number of African-Americans living in a county, divided by the total population

Between 0 and 1

Land Tenure in 1890 The total area of farmland farmed by their respective owners in the year 1890, divided by the total area under cultivation in a county in 1890

Between 0 and 1

Share Churched in 1890

The total number of people who attended church services in 1890, divided by the total number of people living in the county in 1890

Between 0 and 1

Share Baptist in 1890 The total number of people in a county who identify themselves as Baptists (Regular (North, South and Colored), Freewill, General, Primitive and Old Two-Seed denominations) in 1890, divided by the total population in 1890

Between 0 and 1

Share Calvinist in 1890 The total number of people in a county who identify themselves as Calvinists (Welsh Calvinist, Presbyterian (Northern and Southern), Cumberland Presbyterian (Regular and Colored), United Presbyterian, US Reformed Church

Between 0 and 1

34

and American Reformed Church Organizations) in 1890, divided by the total population in 1890

Share Catholic in 1890 The total number of people in a county who identify themselves as Catholics in 1890, divided by the total population in 1890

Between 0 and 1

Latitude The absolute value of latitude of a county (obtained from the Census)

Between 0 and 1

divided by 90 to obtain the normalized value

Std Dev of Elevation The standard deviation of elevation of a county

In feet

Urban Share in 1890 The number of residents living in an urban area as defined by the Census, divided by the total population in 1890

Between 0 and 1

Share Foreign Born in 1900

The number of residents living in the county who were not born in the United States in 1890, divided by the total population

Between 0 and 1

Illiteracy Rate in 1900 The number of people who cannot read or write in a county in 1890, divided by the total number of people living in the given county in 1890

Between 0 and 1

41

Table 1: Summary Statistics of Social Indicators for Counties by Persistent Poverty Status Not Persistently Poor Persistently Poor Mean Standard

Deviation Mean Standard

Deviation Pooled 1960-2000 Census Data Per Capita Income ($) 12,429 4,839 8,615 3,410 Population 78,576 260,310 24,368 48,481 Fraction in Labor Force 0.423 0.065 0.355 0.057 Growth in Labor Force 0.166 0.309 0.056 0.219 Fraction High School Graduate 0.588 0.188 0.431 0.172 Capital Expenditure ($millions) 34.676 142.399 5.405 19.781 Fraction Living in Urban Area 0.372 0.299 0.251 0.251 Fraction Black 0.070 0.118 0.252 0.247

1960 Census Data Per Capita Income ($) 6,918 1,894 4,295 1,343 Population 61,536 214,978 22,530 41,197 Fraction in Labor Force 0.360 0.038 0.305 0.045 Growth in Labor Force 0.069 0.324 -0.125 0.201 Fraction High School Graduate 0.361 0.105 0.227 0.073 Capital Expenditure ($millions) 14.857 72.808 1.112 3.129 Fraction Living in Urban Area 0.335 0.287 0.213 0.234 Fraction Black

0.076

0.131

0.271 0.259

2000 Census Data Per Capita Income ($) 18,431 3,813 13,199 1,958 Population 97,237 307,612 27,855 60,596 Fraction in Labor Force 0.478 0.048 0.395 0.039 Growth in Labor Force 0.144 0.166 0.068 0.117 Fraction High School Graduate 0.789 0.074 0.642 0.071 Capital Expenditure ($millions) 38.741 163.346 6.419 23.038 Fraction Living in Urban Area 0.415 0.309 0.272 0.260 Fraction Black

0.067

0.110

0.250

0.249

Observations 2,756 338 Notes: “Persistently poor” counties have poverty rates of at least 20% in 1960, 1970, 1980, 1990, and 2000. “Not persistently poor” are all others. Per capita income and capital expenditures are in real $2000 based on the personal consumption expenditure deflator.

42

Table 2: Summary Statistics of Historical Indicators for Counties by Persistent Poverty Status Not Persistently Poor Persistently Poor Land Tenure in 1890 0.800 0.117 0.766 0.146 Share Churched in 1890 0.294 0.120 0.311 0.157 Share Baptist in 1890 0.064 0.081 0.131 0.099 Share Calvinist in 1890 0.019 0.022 0.009 0.016 Share Catholic in 1890 0.059 0.089 0.034 0.122 Latitude 0.431 0.050 0.382 0.039 Std Dev of Elevation 0.097 0.170 0.060 0.071 Urban Share in 1890 0.132 0.220 0.025 0.105 Share Foreign Born in 1900 0.100 0.107 0.026 0.074 Illiteracy Rate in 1900 0.110 0.134 0.356 0.222 Observations 2,287 245 Notes: “Persistently poor” counties have poverty rates of at least 20% in 1960, 1970, 1980, 1990, and 2000. “Not persistently poor” are all others. The historic Census data include only counties without redefined borders between the relevant year (1890, 1900) and 1960. See the Data Appendix for detailed variable definitions.

43

Table 3: Linear Probability Estimates of the Probability of Being Persistently Poor

Variables (1) (2) (3) (4) (5)

Farm Owner Percentage in 1890

-0.098(0.047)

0.040 (0.048)

0.033(0.050)

-0.157 (0.050)

-0.202(0.050)

Normalized Latitude

-1.746(0.116)

-0.103(0.166)

-0.136(0.167)

-1.753 (0.151)

-1.362(0.159)

Std. Dev of Elevation

-0.084(0.045)

-0.045(0.042)

-0.031(0.042)

-0.067 (0.045)

-0.060(0.044)

Foreign-born Percentage in 1900

0.075(0.071)

-0.021(0.081)

0.107 (0.075)

0.029(0.085)

Urban Share 1890

-0.071(0.028)

-0.081(0.028)

-0.155 (0.029)

-0.138(0.030)

Illiteracy Rate 1900

0.839(0.045)

0.852(0.050)

Church Share 1890

-0.193(0.046)

0.016 (0.047)

Baptist Share 1890

-0.316(0.086)

0.335(0.081)

Calvin Share 1890

-0.623(0.261)

-1.382(0.272)

Catholic Share 1890

-0.021(0.071)

0.163(0.075)

Constant

0.926(0.058)

0.057(0.087)

0.062(0.087)

0.981 (0.077)

0.850(0.078)

Adjusted R2 0.091 0.215 0.213 0.101 0.119Observations 2,498 2,474 2,474 2,474 2,474 Persistently poor 237 237 237 237 237

Notes: Standard errors are in parentheses. The dependent variable is a binary variable equal to one for counties that are persistently poor between 1960 and 2000 and zero otherwise.

44

Table 4: Dynamic Panel Estimates of Income Convergence

(1) (2) (3)

OLS GMM

∆Xit exogenous GMM

∆Xit predetermined Lag Income per Capita

0.442

(0.016) 0.436

(0.018) 0.460

(0.026) Fraction High School 0.364

(0.046) 0.396

(0.047) 0.236

(0.075) Capital Spending (x1000) 0.042

(0.029) 0.069

(0.021) -0.156 (0.185)

Labor Force Growth 0.013 (0.015)

0.026 (0.012)

-0.031 (0.025)

Urban Share 0.093 (0.013)

0.085 (0.013)

0.159 (0.036)

Black Share -0.288 (0.049)

-0.222 (0.045)

-0.191 (0.163)

Constant 0.193 (0.010)

0.187 (0.010)

0.221 (0.020)

Convergence Rate

0.082 (0.003)

0.083 (0.004)

0.077 (0.006)

Hansen's J (p-value) 101.582 [0.000]

9.975 [0.076]

C-statistic 91.607 [0.000]

Observations 3,094 3,090 3,090 Notes: Asymptotic standard errors are reported in parentheses. The p-value for the chi-square distribution is reported in square brackets.

45

Table 5: Dynamic Panel Estimates of Income Convergence, by Persistent Poverty Status Non-Persistent Poverty Persistent Poverty

(1) (2) (3) (4) (5) (6)

OLS

GMM ∆Xit

Exog

GMM ∆Xit

Predet OLS

GMM ∆Xit

Exog

GMM ∆Xit

Predet Lag of Income

0.435

(0.016) 0.395

(0.020) 0.454

(0.030) 0.431

(0.049) 0.433

(0.043) 0.375

(0.052) Fraction High School

0.356 (0.047)

0.440 (0.050)

0.180 (0.070)

0.501 (0.145)

0.449 (0.134)

1.016 (0.277)

Capital Spending (x1000)

0.041 (0.029)

0.064 (0.021)

-0.158 (0.172)

0.247 (0.158)

0.145 (0.140)

0.371 (1.249)

Labor Force Growth 0.010 (0.013)

-0.001 (0.014)

-0.031 (0.024)

0.187 (0.048)

0.210 (0.044)

0.165 (0.087)

Urban Share 0.101 (0.014)

0.096 (0.014)

0.202 (0.035)

0.024 (0.049)

-0.001 (0.047)

-0.076 (0.127)

Black Share -0.321 (0.053)

-0.326 (0.049)

-0.209 (0.131)

-0.123 (0.139)

0.019 (0.112)

0.340 (0.336)

Constant 0.199 (0.011)

0.202 (0.010)

0.238 (0.017)

0.155 (0.037)

0.172 (0.031)

0.034 (0.069)

Convergence 0.083 (0.004)

0.093 (0.005)

0.079 (0.007)

0.084 (0.011)

0.084 (0.009)

0.098 (0.014)

Hansen's J [p-value] 147.290 [0.000]

7.575 [0.181]

17.363 [0.067]

5.762 [0.330]

C-statistic

139.715 [0.000]

11.601 [0.041]

(1) vs (4) (2) vs (5) (3) vs (6) Equal Coefficient Wald Test [p-value]

14.524 [0.024]

26.043 [0.000]

20.988 [0.002]

Equal Convergence Wald Test [p-value]

0.007 [0.931]

0.662 [0.415]

1.710 [0.190]

Observations 2,752 2,752 2,752 338 338 338Notes: Asymptotic standard errors are reported in parentheses. The p-value for the chi-square distribution is reported in square brackets.

46

Table 6: Pooled Regressions, Second-Stage Results (1) (2)

Farm Owner in 1890 -0.243 (0.029)

-0.220 (0.236)

Farm Owner in 1890, squared -0.021 (0.158)

Normalized Latitude 0.070 (0.098)

4.349 (1.214)

Normalized Latitude, squared -4.901 (1.454)

Std. Dev of Elevation 0.197 (0.025)

0.140 (0.046)

Std. Dev to Elevation, squared 0.024

(0.048)

Baptist Share in 1890 0.401 (0.051)

0.542 (0.099)

Baptist Share in 1890, squared -0.491 (0.196)

Calvin Share in 1890 0.338 (0.154)

-0.036 (0.332)

Calvin Share in 1890, squared 4.615

(3.034)

Catholic Share in 1890 0.016 (0.042)

0.158 (0.085)

Catholic Share in 1890, squared -0.327 (0.149)

Foreign-born Share in 1900 0.215 (0.047)

0.607 (0.119)

Foreign-born Share in 1900, squared -0.893 (0.292)

Illiteracy Rate in 1900 0.042 (0.029)

0.468 (0.075)

Illiteracy Rate in 1900, squared -0.606 (0.102)

Urban Share in 1890 0.075 (0.017)

-0.058 (0.043)

Urban Share in 1890, squared 0.165

(0.060)

Constant 9.023 (0.051)

8.053 (0.267)

Observations 2,473 2,473 Adjusted R2 0.108 0.132

Notes: The dependent variable is residual growth from the pooled, pre-determined model (Regression 3 of Table 4). Standard errors are in parenthesis.

47

Table 7: Second-Stage Results by Persistently Poor Status Not Persistently Poor Persistently Poor (1) (2) (3) (4)

Farm Owner in 1890 -0.280 (0.030)

-0.384 (0.249)

-0.110 (0.065)

-0.075 (0.462)

Farm Owner in 1890, squared

0.064 (0.165)

-0.040 (0.327)

Normalized Latitude 0.042 (0.102)

4.346 (1.222)

-0.296 (0.248)

-2.747 (4.357)

Normalized Latitude, squared

-4.961 (1.458)

3.125 (5.390)

Std. Dev of Elevation 0.208 (0.024)

0.166 (0.046)

-0.160 (0.105)

-0.437 (0.276)

Std. Dev of Elevation, squared

-0.004 (0.047)

0.778 (0.705)

Baptist Share in 1890 0.268 (0.054)

0.312 (0.103)

0.309 (0.084)

0.661 (0.224)

Baptist Share in 1890, squared

-0.243 (0.199)

-0.905 (0.503)

Calvin Share in 1890 0.188 (0.153)

-0.669 (0.338)

0.025 (0.486)

-1.336 (0.867)

Calvin Share in 1890, squared

9.638 (3.125)

12.272 (6.392)

Catholic Share in 1890 0.030 (0.045)

-0.012 (0.095)

-0.071 (0.070)

-0.186 (0.188)

Catholic Share in 1890, squared

0.060 (0.193)

0.184 (0.237)

Foreign-born Share in 1900 0.301 (0.049)

0.533 (0.119)

-0.462 (0.123)

-0.746 (0.469)

Foreign-born Share in 1900, squared

-0.482 (0.296)

0.610 (0.964)

Illiteracy Rate in 1900 0.244 (0.035)

0.594 (0.089)

-0.160 (0.052)

-0.203 (0.143)

Illiteracy Rate in 1900, squared

-0.601 (0.143)

0.005 (0.144)

Urban Share in 1890 -0.008 (0.016)

-0.149 (0.043)

0.163 (0.068)

0.111 (0.170)

Urban Share in 1890, squared

0.205 (0.059)

0.119 (0.225)

Constant 9.078 (0.053)

8.181 (0.270)

8.879 (0.121)

9.368 (0.912)

(1) vs (3) (2) vs (4) Equal Coefficient Wald Test [p-value]

1692.543 [0.000]

1673.917 [0.000]

Observations 2,236 2,236 237 237 Adjusted R2 0.133 0.159 0.204 0.212

Notes: The dependent variable in Regressions 1 and 2 is residual growth from the pre-determined model estimated on the non-persistently-poor counties only (Regression 3 of Table 5). The dependent variable in Regressions 3 and 4 is residual growth from the exogenous model estimated on persistently poor counties only (Regression 5 of Table 5). Standard errors are in parenthesis.

on persistent poverty in a rich country · of persistent poverty status at the close of the 20th...

Documents