one-dimensional scaling without apologies · inversion and speci cation tests using these combined...
TRANSCRIPT
One-dimensional scaling without apologies∗
Tasos Kalandrakis†
July 23, 2019
Abstract
I propose a fully non-parametric framework for estimation of and inference on the loca-
tion of political parties in a one-dimensional scale. I derive a balance condition sufficient for
identification and consistent estimation of parties’ order from left to right, and a weaker error
rate condition that partially identifies the order. The method relies only on ordinal data on
individual preference rankings over parties and is robust even to systematic or strategic error.
Capitalizing on recent advances in testing moment inequalities, I build a strong and weak spec-
ification test of the adequacy of the one-dimensional model and an inferential framework to
test hypotheses about the relative location of parties or to probe the validity of one-dimensional
scales produced by other methods. Under stronger cardinality assumptions on preferences, I rely
on arguments from revealed preference theory to execute similar tests efficiently, obviating the
need for confidence set construction using an approximating grid of party locations. I illustrate
these techniques using CSES survey data on the German party system.
∗Earlier versions of this paper have benefited from the audiences at the 2015 APSA and the2016 MPSA meetings, a Political Economy conference at Emory University, and a Riker seminarat the University of Rochester. I especially thank Kei Kawai, Keith Poole, and Sergio Montero fortheir comments. I would also like to thank the Center for Integrated Research Computing (CIRC)at the University of Rochester for providing computational resources for the analysis performed inSection 5.†Department of Political Science and Department of Economics, University of Rochester. E-
mail: [email protected]
1
It is virtually impossible to discuss politics without reference to a liberal-conservative or
left-right scale. Such scales have strong and broad resonance in political discourse and can be
traced back to the origins of modern representative democracy. In addition to their prominence in
both informal and expert political commentary, one-dimensional representations of the policy space
are widely used in scientific work, empirical and theoretical. Of course, despite its omnipresence,
the one-dimensional scale can rarely be defended literally. Any attempt to pin down or elaborate
the content of “left-right” or “liberal-conservative” runs into a multitude of stances and political
positions on diverse and often tenuously connected issue areas both across countries and across
time. Yet, citizens and pundits evidently find it possible to distill opinions on such diverse issue
areas into a powerful summary statistic. Endorsing the statistical nature of the one-dimensional
scale, the question pursued in this paper is whether we can rigorously determine its adequacy?
And, if a one-dimensional summary cannot be rejected (statistically), can it be recovered from the
data and scrutinized through rigorous inference procedures?
In answering the first question, it is important to ensure that evidence to reject the unidi-
mensonal model bears on its statistical inadequacy alone and not on any discrepancy between data
and extra ancillary or convenience assumptions forced into the model. To guard against such false
negatives, the approach pursued in this paper is fully non-parametric. By that I mean that both
the systematic and statistical error components of the model are specified as infinite dimensional
objects disciplined only by the bearest structure consistent with a one-dimensional interpretation.
As a result the preferences of individuals who consume the scale and inform its reconstruction need
not be order-restricted (in the sense explained in Footnote 7) and for the bulk of the analysis are
only ordinal in nature. Similarly, under the null hypothesis that one-dimensionality holds, any
observed discrepancy from these systematic preferences is assumed to originate from error that
follows general, preference-specific distributions, and can enter the data in arbitrary and, within
some tolerance, even systematic form.
The data for this study are assumed to take the form of individual preference rankings of a
set of objects to be scaled. These objects can be political parties, individual candidates, or other
choice items in elections or markets, and to fix ideas I heretofore refer to them as parties. I start with
the assumption that systematic individual preferences over parties can be represented by a quasi-
concave utility function. This is an ordinal assumption weaker than single-peakedness, arguably the
2
weakest assumption on individual preferences consistent with a one-dimensional interpretation of
the scale. Under this assumption, I derive a condition sufficient for identification of the ordering of
political parties from left to right, which I refer to as a balance condition. It requires that sufficient
(relative to the incidence rate of consequential erroneous responses) fractions of respondents least
prefer the left-most and right-most parties in every triplet of parties. The condition can be used to
construct a family of consistent estimators, and I exhibit a simple score estimator to that effect. The
balance condition is a joint restriction on the amount of error in responses and the distribution of
individual ideological leanings across the spectrum. A weaker error rate condition sheds the latter
restrictions thus relinquishing identification of the order of parties while still imposing testable set-
identification restrictions on that order. It amounts to a restriction that is necessary and sufficient
to prevent the fraction of individuals that rank the middle party in every triplet of parties as least
preferred being the maximum such fraction over the parties in the triplet.
A key contribution of this study is to interpret the balance and error rate conditions as
moment inequalities, making this approach amenable to the inferential framework of the burgeoning
literature on testing moment inequalities.1 These methods can be used to test whether the balance
or error rate condition are satisfied for specific orderings of parties. Inversion of these tests can
be used to build confidence sets of party orderings. As proposed in Romano and Shaikh (2008),
Andrews and Guggenberger (2009), and Andrews and Soares (2010), a specification test can then
be based on these confidence sets, rejecting the one-dimensional model when the corresponding
confidence set is empty. I therefore propose a strong test of one-dimensionality as a specification
test of this type based on the balance moment inequalities. A weak test of one-dimensionality
is a specification test built using the error rate moment inequalities. Both specification tests are
asymptotically valid but may be asymptotically conservative. Given the minimal structure on the
data generating process imposed by the latter test, there is very little room to defend the adequacy
of the one-dimensional model if the null is rejected using the weak specification test.
I also explore the additional purchase of cardinality assumptions, strengthening quasi-
concavity to concavity of individual preferences. Paired with an added (aggregate but otherwise
1It is outside the scope of this paper to fully review this literature. Canay and Shaikh (2017)
offer a recent review and I further discuss works most relevant to this study in Sections 2 and 4.
3
standard) assumption of zero expectation of party-specific response error,2 this assumption further
restricts relative party locations for any given ordering of parties. Therefore, when combined with
either the balance or error rate condition, concavity generally narrows the identified set of party
locations but may not be supported by the data. Unfortunately, confidence sets built through test
inversion and specification tests using these combined moment conditions require an approximation
grid on the space of party locations. To avoid the use of such approximation grids, I rely on argu-
ments from revealed preference theory to convert the concavity moment inequalities (expressed as
a function of party locations) to a set of unimodality of score expectations inequalities (expressed
as a function of party orders from left to right). Specification tests of the combined concavity and
error rate or balance moment inequalities can be performed efficiently using this conversion.
I implement these tests using the practical two-step procedure for testing moment inequal-
ities of Romano, Shaikh and Wolf (2014) and illustrate them using German CSES survey data
in Section 5. The data reject the balance identification condition and fail the strong test of one-
dimensionality across all elections. But, the weak test of one-dimensionality is not rejected in a
subset of these elections, and in these cases the corresponding confidence sets of party orders are
narrow and informative. These confidence sets are broadly consistent with a conventional left-right
scale, but in a subset of elections they may represent a Libertarian–Totalitarian or Establishment–
Antiestablishment dimension. The extra assumption of concavity of individual preferences receives
mixed support and the joint error rate and concavity specification test is rejected in half of the
cases when the weak test of one-dimensionality alone is not rejected.
The present work is related to a number of diverse literatures in Political Science, Economics,
and Psychology, besides the works already cited above. Within economics, its reliance on preference
shape restrictions connects it with the broad literature on non-parametric estimation of models of
choice as in, for example, Manski (1975, 1988); Matzkin (1992); Varian (1982). With a more specific
political economy focus, several works employ non-parametric and/or moment inequality methods:
Merlo and de Paula (2017) recover the distribution of preferences in the electorate; Henry and
2This analysis only requires data to convey expected cardinal utility levels. Though still de-
manding a lot of the data, the analysis of Eggers and Vivyan (2018) provides prima facie evidence
that the sympathy score surveys used in the application of Section 5 do contain individual cardinal
information.
4
Mourifie (2013) test and reject the pure Euclidean spatial model; Kawai and Watanabe (2013)
recover the fraction of strategic voters in the electorate. Since its inception, the scaling literature
in Political Science has relied on data similar to that used in this study,3 for example, Brady (1989,
1990); Cahoon and Hinich (1976); Cahoon, Hinich and Ordeshook (1978); Hinich (2005). The
bulk of scaling work in Political Science, though, uses roll-call votes to recover legislator preference
parameters (e.g., Canen, Kendall and Trebbi (2018); Clinton, Jackman and Rivers (2004); Heckman
and Snyder (1997); Iaryczower and Shum (2012); Martin and Quinn (2002); Poole and Rosenthal
(1985, 1997)). Even the non-parametric version of these roll-call models (Poole (2000); Tahk (2018))
imposes a strong order restriction on individual preferences. Poole (2005) expands on the connection
of this literature with the long tradition of non-parametric scaling or unfolding in Psychology, where
seminal are the non-parametric scales of Mokken (1971). His methods and its variants also rely on
certain stochastic order restrictions variably known in this literature as item monotonicity, double
monotonicity, or monotone homogeneity, that bear a connection to order restrictions on individual
respondent preferences or abilities, and are ably reviewed in, among others, Post (1992); Sijtsma
and Molenaar (2002); Sijtsma and Meijer (2007); van Schuur (2011). In the fully nonparametric
approach pursued here the individual respondents cannot be ordered, though their preferences
maintain enough structure essential for a one-dimensional scaling of the parties.
Other methods to construct one-dimensional scales used in Political Science take the con-
ventionally construed left-right dimension as given and attempt to recover party locations on that
dimension either by analyzing the content of parties’ declarations (e.g., Volkens et al. (2018)) or by
eliciting the locations through surveys of experts (e.g., Castles and Mair (1984)) or the electorate
(e.g., Aldrich and McKelvey (1977)). By design, these methods do not preclude the existence of
additional dimensions with different content nor do they probe the statistical significance of extra
dimensions. On the other hand, the data used in this study allow all possible relevant dimensions
to determine the preference rankings or sympathy scores over parties and for that reason do not a
priori constrain the content or number of relevant dimensions.
3Aldrich and McKelvey (1977) level several objections to the use of such data (page 111, second
paragraph) because of the reliance of extant methods on parametric (Euclidean) restrictions on the
preferences of respondents and the absence of specification tests for these assumptions. Both of
these concern are fully addressed in the present framework.
5
1 Setup
Consider J , J ≥ 3, political parties indexed by j = 1, . . . , J . Each party is characterized
by its location in D-dimensional Euclidean space, D ≥ 1, with the location of party j denoted
by xj ∈ RD. The possibility D > 1 is allowed at this point only as an alternative to the main
working hypothesis that parties are located in a one-dimensional space (D = 1). The restriction
to one dimension is only a necessary component to an interpretable one-dimensional scale, as such
an interpretation relies primarily on a certain alignment of preferences of the population over these
parties. Before fully spelling out the uni-dimensionality assumption, I first introduce the necessary
restriction D = 1 jointly with a standard orientation restriction:
D = 1 and xjL < xjR for some known parties jL, jR.(D)
Assumption (D) fixes both the dimensionality of the space and the relative location of two parties
in that space. The latter restriction is necessary in some such form for identification purposes, as
any one-dimensional scale is possibly identified only up to reflection.
By virtue of assumption (D), the parties are naturally ordered from left to right,4 and
this ordering (besides the parties’ actual location) is of central focus in this analysis. We may
summarize this ordering with a bijection π : {1, . . . , J} → {1, . . . , J} that maps from a possible
order r = 1, . . . , J, to a party j that is located in that order, so that xπ(1) < xπ(2) < . . . < xπ(J). I
have precluded weak inequality above, that is, I assume that party locations are pairwise distinct.
This is a simplifying assumption for the purposes of exposition and it ensures there is a unique
ordering π of the party locations x. Because multiplicity of true orderings due to parties sharing a
location is ruled out by a forthcoming identification restriction, all results stated in the main body
of the paper hold (with only minor adjustments to some definitions) if we allow parties to share a
location at this stage of the analysis (see Appendix D).
In general, both the ordering π and the actual location of the parties x = (x1, . . . , xJ),
is (up to assumption (D)) unknown to the analyst. The information that is assumed available
4I emphasize the term “left-right” is used for orientation purposes only. Should a one-
dimensional scaling prove supported by the data, it is a whole other matter whether the produced
scale is indeed interpretable in the traditional left-right fashion.
6
and may allow inferences on these quantities arises from N survey respondents. Each individual
respondent i = 1, . . . , N, reports a vector of scores yi ∈ Y ⊂ RJ where yi = (yi1, . . . , yiJ) are
sympathy/preference scores for the J parties with higher scores indicating preferred (better) parties
to individual i.5 Underlying this notion of preference is an unobserved preference/utility/sympathy
function ui : R→ R corresponding to each individual i. I assume that the score that respondent i
assigns to party j is related to the location of that party, xj , according to:
yij = g(ui(xj), εij), all i, j.
Here g : R2 → R is a function that takes the level of i’s systematic preference function evaluated
at xj and an error εij as inputs. The vector of error terms εi = (εi1, . . . , εiJ) captures all forces that
might push a respondent to report a sympathy score different than what her systematic preference
function dictates. This error could be additive, multiplicative, or take any other form.
To reduce nomenclature, I assume individual respondent preferences u and error terms ε
are jointly drawn independently and identically across respondents according to a distribution Φ.
As a result, each individual respondent is characterized by a pair (ui, εi) ∈ U ×RJ corresponding to
her systematic sympathy or utility function ui, and a vector of error terms (one coordinate for each
party) εi.6 Note that the conditional distribution over the noise terms εi may vary across respondent
preferences, ui. Furthermore, no restriction whatsoever is placed on the distribution of εi across
its coordinates, allowing for dependent and heteroscedastic errors across parties. Throughout, I
assume Y is a compact set, an assumption that is easily met in the case these data are recorded
in discrete survey scales and spares the need for further integrability conditions on the population
5I follow the convention of using superscripts to index respondents and subscripts to index
parties. I omit the superscript when referring to u or y as a random variable.6The analysis could be equivalently carried out by assuming Φ has support on RJ ×RJ , that is,
reducing ui to a vector in RJ containing the values of ui evaluated at the N party locations. By
standard arguments from revealed preference theory, it suffices to impose the shape restrictions (Q)
and (C) introduced later in this section on the restricted function evaluated at the J party locations,
for if these restrictions are met in the function thus restricted, then there exists an extension of
this function to the entire real line that also satisfies these shape restrictions.
7
distributions.
At this level of generality, the structure imposed on the problem permits little if any political
interpretation of the party locations x, certainly not based on familiar notions of ideological position
on a political scale such as a traditional left-right scale. I entertain a pair of related assumptions
on the functions u that do allow such interpretation (of course, the caveat entered in Footnote 4
still applies). The first part of the analysis only relies on the weakest such assumption, namely,
quasi-concavity:
Quasi-concavity: For all u ∈ U , and for all x, x′, x′′ such that x ≤ x′ ≤ x′′,(Q)
u(x′) ≥ min{u(x), u(x′′)}.
Assumption (Q) is consistent with u being discontinuous or having plateaus, that is, open sets
at which it is constant, including around a non-unique peak. When expressed in terms of the
preferences over the underlying set of parties, it is weaker than the single-peaked property of Black
(1948) and Arrow (1951), or the related domain restriction of Sen (1966), as it only requires that the
middle alternative in any triplet cannot be strictly least preferred over the other two alternatives.
In those terms, (Q) amounts to condition (1) of Dummett and Farquharson (1961) (page 35), a
weak sufficient condition for the existence of a majority core point. Assumption (Q) is ordinal in
nature and unburdens us of any need to impute cardinal interpretations on the respondents’ scores,
y. Such relief always comes at a cost. In particular, assumption (Q) will allow us to identify the
ordering of political parties π from the survey responses under certain conditions, but does not
otherwise restrict the location of the parties, x.
We may further restrict party locations by introducing cardinality assumptions on prefer-
ences u. For that purpose, and in keeping with the overall goal of imposing minimal assumptions,
I will employ the following concavity restriction in Section 3:
Concavity: For all u ∈ U , and for all x, x′, x′′ such that x ≤ x′ ≤ x′′,(C)
(x′′ − x′)(u(x′)− u(x)) ≥ (x′ − x)(u(x′′)− u(x′)).
While concavity is a fairly weak condition compared to standard parametric assumptions, it is
certainly stronger than quasi-concavity and as a result may not be supported by the data. Every
8
function u that satisfies (C) also satisfies (Q) but not vice versa. Neither assumption implies in
any way, shape, or form that population preferences are order restricted7 according to the relative
location of the respondents’ bliss point from left to right: Any two respondents may have arbitrary
differently shaped functions within each class, whether they share or not an ideal point (or plateau).
2 Identification & estimation of ordering π
In this section I consider identification and estimation of the ordering of political parties,
π, under the core assumptions (D) and (Q). Some notation is necessary before we proceed. Let all
the possible orderings of party locations be given by set Π and let p denote a generic element of
Π. Shape restriction (Q) is characterized by a property that holds on triplets of alternatives and,
anticipating the use of sample analogues of this condition, define the set of all possible triplets of
parties T := {τ ⊆ {1, . . . , J} | #τ = 3}. For all τ ∈ T and all p ∈ Π, define a certain restriction of
ordering p as a function denoted pτ : {1, 2, 3} → τ that takes the form:
pτ (r) := jr, where τ = {j1, j2, j3} and p−1(j1) < p−1(j2) < p−1(j3).
The function pτ takes a party’s relative (according to ordering p) location in the triplet as an input
(1, 2, or 3), and returns the party that holds this position in triplet τ . Now, for every triplet τ and
all r = 1, 2, 3, define the population quantity
Pr(τ) := P[{πτ (r)} = arg minj∈τ
u(xj)],(1)
that is, for the trio of parties in τ , Pr(τ) is the fraction of respondents in the population that
prefer the r-th ordered party from left to right the least. Because our assumptions do not rule out
indifferences, it is the case that
(2) P1(τ) + P2(τ) + P3(τ) ≤ 1, for all τ.
7 Therefore, both (Q) and (C) allow an individual with ideal x (i.e., such that u(x) ≥ u(x) for
all x) to prefer x over x′, x < x′, while an individual with ideal x′ < x may prefer x′ over x.
9
Figure 1: Identification of order
x1 x2 x3
i=3
i=2 i=1u1(x3)>u1(x2)>u1(x1)u2(x2)>u2(x3)>u2(x1)u3(x1)>u3(x2)>u3(x3)
J = 3, τ = {1, 2, 3}, and jL = 1, jR = 3 according to (D). Without response error, if the populationcomprises only individuals such as i = 1, 2 (that is, P3(τ) = 0) it is impossible to discriminatebetween x1 < x2 < x3 and x1 < x3 < x2. If the population includes respondent i = 3 and at leastone of i = 1, 2, the true ordering x1 < x2 < x3 is identified.
With this notation in place and in order to better understand the nature of the identification
problem at hand, I first discuss the relatively straightforward case when respondent sympathy
scores reflect true preferences without error.
No error in responses: In this case, we can take full advantage of the shape restriction (Q)
and its key manifestation in the data
(3) P2(τ) = 0, for all τ,
that is, the middle party in any triplet τ cannot be uniquely least preferred. Whenever we observe
some party in a triplet that is strictly least preferred by some respondent, we can safely conclude
that this party is not the middle party in that triplet. This observation, when applied to all
triplets, along with the orientation restriction of (D), would appear sufficient for identification.
Unfortunately that is not the case.
To illustrate, consider Figure 1 where for the purposes of this discussion it is assumed
that there are only three parties (J = 3) and at most three possible preference functions u in the
population as depicted for individuals i = 1, 2, and 3. Imposing assumption (D) with jL = 1, jR = 3,
the three parties are located at x1 < x2 < x3 and our estimation problem amounts to locating party
10
2 to either the left of party 1; or to the right of party 3; or in between these two parties as is actually
the case in this example. If the population only contains respondents with the preferences of i = 1, 2,
then even a sample of size one bars party 1 from being in between parties 2 and 3, by (3). Because,
by restriction (D), party 1 cannot be the right-most party of the three, we conclude it must be the
left-most. But, with the population only comprising individuals with the preferences of i = 1, 2,
we cannot further discriminate between orderings x1 < x2 < x3 or x1 < x3 < x2. But, continuing
with the example of Figure 1, if the population comprised both individuals with the preferences of
i = 3 and at least one of i = 1, 2 with positive probability, then we can fully recover the correct
ordering x1 < x2 < x3. The balance condition (B0) of Theorem 1 exactly requires that both the
left-most and the right-most parties in each triplet are least preferred by some positive fraction of
respondents in the population.
Theorem 1. Assume (D) and (Q), and that yij > yij′ ⇔ ui(xj) > ui(xj′) for all i, j, j′. If
(B0) Pr(τ) > 0, for all τ, r = 1, 3,
then the party ordering π is identified.
The sufficiency of condition (B0) follows from more general results established later in this
section, but the preceding discussion makes the argument fairly clear: (B0) coupled with (Q)
allows us to identify the true ordering of parties in each triplet up to orientation using (3), and
then restriction (D) inductively nails the orientation of all triplets. Note that the condition is not
necessary when J > 3. This is illustrated in Figure 2, where there are four parties (J = 4) and only
two preference types in the population. Condition (B0) is not satisfied for the triplet τ = {1, 2, 3}
since party 3 is never ranked below both parties 1 and 2 in the population. Nevertheless, we can
infer that party 1 is between parties 2 and 4 from triplet τ = {1, 2, 4} and that party 3 is between
parties 1 and 4 from triplet τ = {1, 3, 4}. With this information, the true ordering is recovered
using (D) imposed for any pair of parties jL, jR = 1, . . . , 4.
Allowing for response error: Once we admit response error, we cannot infer the exact ordering
with probability one in any finite sample. Nevertheless, a strengthening of the balance condition
(B0) to adjust for response error will suffice to ensure identification. To get to the specifics, I define
11
Figure 2: Balance sufficient but not necessary for identification (J > 3)
x2 x1 x4
i=2i=1
u1(x4)>u1(x3)>u1(x1)>u1(x2)u2(x1)>u2(x3)>u2(x2)>u2(x4)
x3
J = 4, and population comprises only preferences of i = 1, 2, both with positive probability.Without response error, condition (B0) is not met for triplet τ = {1, 2, 3}, yet the ordering π isidentified because the ordering for triplets {1, 2, 4} and {1, 3, 4} is.
certain rates of correct and erroneous survey responses. First, for all τ , define
qc(τ) := minr=1,3
P[{πτ (r)} = arg min
j∈τyj
∣∣∣∣πτ (r) ∈ arg minj∈τ
u(xj)
].
qc(τ) is a highest lower bound (across r = 1, 3) on the probability of responses that ‘correctly’8
report the r-th party in τ as being strictly least preferred, conditional on that party being least
preferred. Thus, conditional on being a respondent who least prefers either of the two extreme
parties in a triplet, there is probability of at least qc(τ) of uniquely assigning the lowest sympathy
score to that actually least preferred party in triplet τ .
On the other end, for all τ we define a consequential error response rate
qe(τ) := P[{πτ (2)} = arg minj∈τ
yj ].
Accordingly, there is error probability qe(τ) that the middle party in the triplet receives a strictly
8‘Correctly’ can be used literally here if the fraction of respondents that are indifferent between
any pair of parties is null in the population. Ansolabehere and Brady (1989) suggest reasons the
incidence of indifferences may not be negligible. If so, qc(τ) encompasses responses that report only
one party (that is extreme in τ) as strictly least preferred in τ even though two or all three parties
in τ are tied at that ranking.
12
minimum sympathy score. The type of error captured by qe(τ) is termed “consequential” as it only
captures errors that violate condition (Q) and its key implication in equation (3). This error rate
is generally much smaller than the probability that respondents report their preferences with some
error. Not only does this rate pertain exclusively to errors with respondents reporting the middle
party as their least preferred party, but also this party must receive a strictly minimum sympathy
score. Thus, error responses in which two or more parties including the middle party receive
the same lowest sympathy score do not count into the calculation of qe(τ). This is particularly
relevant because typically surveys record sympathy scores on a discrete scale (e.g., from 1 to 10)
and the probability of ties in sympathy scores is therefore non-negligible (even if that probability
is null when it comes to actual preferences). Furthermore, one of the most common response error
attributions is to presume that respondents accurately record their sympathy towards a group of
most preferred parties and then lump a large number of their least preferred parties at the bottom
of their sympathy scale.
We obtain the following generalization of Theorem 1 to the case response error is possible:
Theorem 2. Assume (D) and (Q). If
(B) Pr(τ) >qe(τ)
qc(τ), for all τ, r = 1, 3,
then the party ordering π is identified.
Just like condition (B0), condition (B) is a balance condition requiring that there exists a
sufficient fraction of respondents in the population that least prefer the left-most extreme party in
each triplet and a sufficient fraction of respondents in the population9 that least prefer the right-
most extreme party in each triplet. The required fraction depends on the rate of consequential error
response. If there is zero such error (qe(τ) = 0) then as long as qc(τ) > 0, condition (B) reduces
to condition (B0) and any positive fraction is sufficient. The balance requirement becomes tighter
the larger the rate qe(τ) is and the lower the bound on correct bottom responses, qc(τ), is. To give
an example, assuming no indifferences in true preferences, if respondents correctly10 report their
least preferred party at least 60% of the time independent of the actual least preferred party (that
9In the absence of indifferences, these two fractions sum to one by equations (2) and (3).10Responses may still contain error in the ranking of the first and second preferred parties.
13
is, with probability qc(τ) > 0.6) and consequentially err by reporting the middle party as uniquely
least preferred with probability qe(τ) = 0.1, then (B) amounts to P1(τ),P3(τ) > 16 . Condition (B)
is a sufficient condition. It is certainly not necessary because, as already shown for condition (B0),
it need not hold for all triplets τ . It is conservative because it places little other structure on the
nature of response errors, but is by no means guaranteed to hold. It is less likely to hold (even
if one-dimensonality holds) when extreme parties in the scale are part of the triplet, as they are
likely least preferred by a large fraction of respondents, leaving a small complementary fraction of
least sympathizers for the other extreme party in that triplet.
To further parse condition (B) and Theorem 2 observe that (by (2)) a necessary condition
for (B) to hold is
(R) qc(τ) > 2qe(τ) for all τ.
The error rate bound (R) requires that respondents are at least twice as likely to report a least
preferred extreme party as strictly least preferred than to incorrectly report the middle party as
their strictly least preferred party in each triplet τ . This appears to be a minimal requirement on
the data generating process in order for data to provide intelligible information on the ordering of
parties. To substantiate this claim, define a population response analogue of Pr(τ),
Qr(τ) := P[{πτ (r)} = arg minj∈τ
yj ].
Qr(τ) is defined on the basis of reported preferences, while Pr(τ) in terms of true preferences. While
the counterpart of inequality (2) still applies for these population response quantities, equation (3)
need not as it is now possible that Q2(τ) > 0 for any τ . In fact, we have Q2(τ) = qe(τ). Clearly,
responses y provide poor guidance as to the relative location of parties in τ if the response analogue
of unimodality condition (Q) is violated so often so that
(¬Q) Q2(τ) > maxr=1,3{Qr(τ)},
that is, if the middle party is reported as strictly least preferred with higher probability than either
of the two extreme parties in triplet τ . The following Theorem establishes that condition (B)
14
preserves the response analogue of inequality P2(τ) < Pr(τ), r = 1, 3, that ensures identification of
ordering π when there is no response error, while condition (R) implies a weaker inequality and is
necessary and sufficient to rule out (¬Q).
Theorem 3. Assume (D) and (Q).
1. (B) implies (R) and
(MB) Q2(τ) < Qr(τ), for all τ, r = 1, 3.
2. (R) implies
(MR) 2Q2(τ) < Q1(τ) + Q3(τ), for all τ.
3. (¬Q) may hold if qc(τ) < 2qe(τ).
Theorem 3 establishes a hierarchy of decreasingly informative restrictions on expected frac-
tions of strict bottom responses. In the first part, inequalities (MB) underpin the identification
result of Theorem 2 under the balance condition (B). Inequalities (MR) in the second part
of the Theorem are weaker and imply that at least one (but possibly not both) of inequalities
Qr(τ) > Q2(τ), r = 1, 3, holds. Thus, when error rates are bounded according to (R), (MR) at
least ensures that responses y are not woefully misleading as a source of information on party
ordering, though that information may not be sufficient to identify the true ordering π. This is no
longer true if (R) fails, as established in part 3. Very little can be learned from response data that
violate (R).
2.1 Estimation
Under balance condition (B) the ordering π can be recovered consistently by a number of
analogy principle estimators. For example, a score estimator searches over possible rankings p and
recovers that which minimizes violations of unimodality in responses. This estimator takes the
form
π := arg minp∈Π
{QN (p, y) | p−1(jL) < p−1(jR)
},(4)
15
where the criterion function QN (p, y) is defined as the fraction of triplets of parties across respon-
dents that violate the sample analogue of (Q)
QN (p, y) :=∑τ∈T
1#T
N∑i=1
1N I(yipτ (2) < min{yipτ (1), y
ipτ (3)}
),(5)
and parties jL and jR are those specified in assumption (D). Function I that appears in (5) is
a truth function that evaluates to one (1) when its argument is true and evaluates to zero (0)
otherwise. Under the maintained assumptions (D) and (Q), if (B) holds then
limN→+ inf
QN (π, y) =∑τ
Q2(τ) < limN→+ inf
Q(p, yN )
for all p 6= π by (MB). It follows that:
Theorem 4. Assume (D), (Q), and (B). Then π is a consistent estimator of π.
The proof of Theorem 4 also serves as a constructive proof of identification (Theorem 2).
The score estimator π is certainly not unique and has no claim to efficiency in the class of consistent
estimators. In fact, Theorem 3 and the moment inequality conditions it establishes can be directly
employed to define a family of moment inequality estimators. In the terminology of that literature,
either of the inequalities (MB) or (MR) induces an identified set
Π∗M := {p ∈ Π | (MM ) hold, and p−1(jL) < p−1(jR)},M = B,R.
By the arguments above, Π∗B = {π} if (B) holds, while π ∈ Π∗R if (R) holds. Consistency conditions
for estimators ΠM ,M = B,R, of the identified set using moment inequalities have been discussed
among others by Imbens and Manski (2004) and, in greater generality, by Chernozhukov, Hong
and Tamer (2007) and Rosen (2008).11 Because the parameter space in this application is discrete,
some of the more technical of the general consistency conditions they establish are immediately
11Brady (1985) provides an early line of attack for estimators of sets, though not cast in a moment
inequalities framework.
16
verified,12 so that consistent estimation of the respective identified sets using such estimators follows
even in the case the ordering of parties is not point identified, that is, using moment inequalities
(MR). In the case of moment inequalities (MB), there is a natural connection between such moment
estimators and estimator π, particularly when the weighting matrix used in the moment estimator
criterion function is the identity matrix. Because the question of estimation of the identified set is
naturally linked to questions of inference and the construction of confidence sets for the identified
set which I heavily rely on in the ensuing sections, I do not develop additional notation for moment
inequality based estimation of the identified sets Π∗B and Π∗R in this paper.
3 Concavity
The analysis so far has relied only on ordinal assumptions on preferences that are con-
sistent with a one-dimensional party scale. In this section I explore the additional purchase of
concavity restriction (C). Concavity is often imposed through functional families prevalent in the
parametric framework (negative quadratic, piece-wise linear). To start, I introduce for the first
time a-(n otherwise typical) restriction on the distribution of the error terms entering sympathy
scores, namely:
For all j,
∫g(u(xj), εj)Φ(d(u, ε)) =
∫u(xj)Φ(d(u, ε)).(E)
Assumption (E) is easily met in the classic case the error is additive (yj = g(u(xj), εj) = u(xj)+εj)
and the error term has zero expectation. Even in this special case, the restriction allows dependence
across parties and heteroscedasticity across parties and respondents. Under this added restriction
and the stronger concavity assumption, inequalities (C) are directly inherited by expectations of
the sympathy scores:
Theorem 5. Assume (D), (C), and (E). Then
(xπ(r) − xπ(r−1))(E[yπ(r+1)]− E[yπ(r)]
)− (xπ(r+1) − xπ(r))
(E[yπ(r)]− E[yπ(r−1)]
)≤ 0,(MC)
12Specifically condition C.1 of Chernozhukov, Hong and Tamer (2007), or the continuity condi-
tions in Proposition 2 of Rosen (2008).
17
for all r = 2, . . . , J − 1.
Theorem 5 and the ensuing Theorem 6 are straightforward with proofs relegated to Ap-
pendix C. Inequalities (MC) are the first restrictions we have derived that directly involve party
locations, x, and for that reason appear more promising in further identifying these locations, be-
sides the ordering of parties. To further explore the reach of these added restrictions, define the
set of party locations that are consistent with ordering p under assumption (D) as:
Xp := {x ∈ RJ | xp(r) ≤ xp(r+1),r=1,...,J−1, and xj < xj′ for some j, j′}.
The following Theorem helps clarify the point that concavity of score expectations alone imparts
meager restrictions on the ordering of parties (though it does limit which party locations are
consistent with any one such ordering). Specifically, we can show:
Theorem 6. For all p, there exists x ∈ Xp that satisfies (MC) iff there exists r such that
E[yp(r−1)] ≤ E[yp(r)] for all r 1 < r ≤ r,(MU )
E[yp(r−1)] ≥ E[yp(r)] for all r r > r.
The Theorem states that in order for there to exist party locations x consistent with ordering
p and satisfying the concavity of expectations restriction it is necessary and sufficient that that
ordering renders expectations of party sympathy scores unimodal, in the sense that expectation of
party scores are increasing from the left and decreasing on the right of a party receiving the modal
expectation value. This is so under the restriction that at least two party locations inXp are distinct.
Counting the possible orders p that are consistent with (MU ) for any fixed expectations E[yj ], we
conclude there are∑J−1r=0 (J−1
r )2 = 2J−2 party orderings that are consistent with the concavity of
expectations restriction (MC), a large set.
This level of indeterminacy not-withstanding, the impact of joint assumptions (C) and (E)
is non-trivial for two reasons. First, relying on moment inequalities (MC) we can perform a point
test of the hypothesis that party locations x = x0 where x0 may correspond to some existing
scale. These moment conditions alone are sufficient to reject a number of existing scales of the
German party system (Appendix E). Second, because concavity (C) is stronger than the quasi-
18
concavity assumption (Q), either of the assumptions (R) or (B) allow us to combine the moment
restrictions (MC) with (MB) or (MR). These joint moment restrictions bring the full strength
of these assumptions to bear on the problem of estimation and/or inference on party locations:
moment conditions (MB) or (MR) bring information on the ordering of parties from left to right,
while moment conditions (MC) further restrict possible party locations for any given ordering. As
I discuss more extensively in the next section, Theorem 6 allows us to build efficient confidence sets
and specification tests of these joint assumptions by exploiting the uni-modality of expectations
inequalities, (MU ).
4 Inference and Specification Tests
Both the assumption of one-dimensionality embodied in (D) and (Q) or (C), and the balance
and error rate conditions (B) and (R) may or may not be supported by the data and need to be
subjected to the rigors of statistical inference. At the heart of the approach pursued in this study
is a test of the form
E[mM (θ, yi)] ≤ 0,(HMθ )
corresponding to a suitably defined class of moment inequalities represented by moment function
mM , and parameter(s) θ, as developed in Theorems 3, 5, and 6. There is a long and growing
literature on test procedures of that form, and the most recent lines of attack recognize that test
performance is improved when the test procedure incorporates information on the subset of the
moment inequalities that likely bind. Within the class of such “generalized moment selection”
(GMS) test procedures, two recent papers provide practical avenues for the execution of the tests
in finite samples: The recommended GMS procedure of Andrews and Barwick (2012) and the
practical two-step procedure of Romano, Shaikh and Wolf (2014). Because the former relies on
certain data-tuning parameters that are practically available only when the number of inequalities
is relatively small, all tests performed in the remainder of this study use the procedure of Romano,
19
Shaikh and Wolf (2014), where the reader can find the necessary background details.13
In our case, the moment inequalities are some combination of (the weak form of) (MB),
(MR), (MC), and/or (MU ) and the parameter is either the ordering p, possibly paired with the
location of the modal expectation of sympathy scores r, or party locations x. The (not jointly
combined) moment functions mM ,M = B,R,C,U, are reported in Table 6 of Appendix B. Several
tests are possible building on these inequalities. Of primary focus is a test of the balance condition
(B) that guarantees identification of party ordering π. By virtue of the first part of Theorem 3,
(B) can be tested using inequalities (MB), that is by testing
E[mB(p, yi)] ≤ 0.(HBp )
Test (HBp ) is a test of the hypothesis that ordering π = p and (B) holds, along with the maintained
assumptions (D) and (Q). Similarly, invoking the second part of Theorem 3, the error rate condition
(R) can be tested by
E[mR(p, yi)] ≤ 0.(HRp )
Turning to the cardinality assumption (C) along with (E), we can perform a point test
that the location of the parties is at x0 by testing moment inequalities (MC) evaluated at x = x0,
13 This test satisfies a uniform consistency condition
lim supN→+∞
sup(θ,Φ):(MM ) hold
PΦ[(HMθ ) is rejected] ≤ α,
under mild assumptions that are easily met in the present framework (Theorem 2.1 of Romano,
Shaikh and Wolf (2014)). The test procedure of Andrews and Barwick (2012) has the added benefit
of attaining the property with equality, though by remark S.2 in the supplemental Appendix of
Romano, Shaikh and Wolf (2014), the gap between the left-hand-side of this inequality and α is
no larger than α/10 in the recommended implementation of their test as used in the case study of
Section 5.
20
Table 1: Summary of Point Tests
Test Joint Null Hypothesis Hypothesis on x Moment Inequalities†(HB
p ) π = p, (D), (Q), (B) x ∈ Xp (MB)
(HRp ) π = p, (D), (Q), (R) x ∈ Xp (MR)
(HCp,x0) x = x0, (D), (C), (E) x = x0 (MC)
(HBCp,x0) x = x0, (D), (C), (E), (B) x = x0 (MB) & (MC)
(HRCp,x0) x = x0, (D), (C), (E), (R) x = x0 (MR) & (MC)
(HBUp,r0) π = p, r = r0, (D), (C), (E), (B) x ∈ Xp (MB) & (MU )
(HRUp,r0) π = p, r = r0, (D), (C), (E), (R) x ∈ Xp (MR) & (MU )
† All tests are based on the weak version of the indicated moment inequalities. The correspondingmoment functions, mM , are reported in Table 6 of Appendix B.
x0 ∈ Xp, that is by testing
E[mC(p, x0, yi)] ≤ 0.(HC
p,x0)
By Theorem 5, test (HCp,x0) is a joint test of the hypotheses that party locations x = x0 (x0 ∈ Xp)
and (D), (C), and (E) hold. While tests (HBp ) are nested in that inequalities (MB) imply (MR),
(HCp,x0) is a test of distinct hypotheses. But the two sets of assumptions can be combined to perform
a joint test, by virtue of the fact that concavity (C) implies quasi-concavity (Q). Specifically, for
any p and x0 ∈ Xp we can jointly test
E
mM (p, yi)
mC(p, x0, yi)
≤ 0,M = B,R.(HMCp,x0 )
Test (HMCp,x0 ) performs a joint test that party locations are at x = x0 ∈ Xp with expectations of
party scores that respect concavity at these locations and balance condition (B) (when M = B)
or error bound restriction (R) (when M = R) hold. As discussed at the end of Section 3, the
combination of these moment conditions is quite potent, restricting both party orders and possible
party locations for each order.
For ease of reference, Table 1 summarizes the tests introduced in this subsection and the
(implicit or explicit) joint null hypotheses for each. The last row in that Table includes a test which,
while not interesting in and of itself, is instrumental in efficiently constructing certain confidence
21
sets and specification tests in the next subsection. Test (HMUp,r0 )
E
mM (p, yi)
mU (p, r0, yi)
≤ 0,M = B,R,(HMUp,r0 )
is a joint test of the hypothesis that π = p, the party with the maximum score expectation is at
order r = r0, and assumptions (D), (C), (E), and (B) or (R) hold, depending on the value of
M = B,R.
4.1 Confidence sets & Specification tests
The natural next step capitalizing on this inferential framework is to construct confidence
sets of party locations.14 These sets are built by test-inversion, that is, for any parameters θ and
moments M , and any level of significance α, they take the form:
CMθ (α) := {θ | (HMθ ) is not rejected at level α}.
In addition to providing succinct information on possible party locations consistent with the respec-
tive joint hypotheses, a number of authors (Andrews and Soares (2010); Andrews and Guggenberger
(2009); Romano and Shaikh (2008)) have advocated the use of these confidence sets to perform
model specification tests. The idea is to reject the hypothesis that the model is correctly specified
when there is no parameter value that passes the test, that is, when CMθ (α) = ∅. Under the null
14Such confidence sets can either aim to cover the identified set with probability 1−α, or to cover
each point in the identified set with probability 1−α. In the latter vein, we build confidence sets for
points in the identified set following the procedure of Romano, Shaikh and Wolf (2014). By virtue
of the uniform consistency condition of the corresponding test discussed in Footnote 13, confidence
sets CN built through this procedure satisfy a uniform consistency condition (see Theorem 3.1 of
Romano, Shaikh and Wolf (2014))
lim infN→+∞
infΦ∈Φ
infθ∈Θ∗M (Φ)
PΦ[θ ∈ CN ] ≥ 1− α.
22
hypothesis that the model is well-specified, the actual size of these test is likely less than α.15 That
is, these specification tests are (by virtue of the uniform consistency of the test (HMθ )) uniformly
asymptotically valid, but are possibly asymptotically conservative.16
Construction of these confidence sets requires attention to an additional practical issue,
especially if these sets are used in specification tests. If parameters θ take values in a continuum,
the set CMθ (α) must necessarily be approximated by executing test (HMθ ) on a finite number of
parameter values θ. Use of such grids becomes more costly as the dimensionality of the parameter
space increases. In addition, construction of confidence sets using grids invites doubt to their use
in specification tests, as (without substantial additional arguments and costly increases in grid
precision) we cannot take the fact that the set proved empty on an approximating grid to imply
that the non-approximated confidence set is indeed empty.
This last problem can be circumvented entirely for the purposes of model specification by
virtue of the the inferential framework built through Theorems 3, 5, and 6. With the exception of
tests relying directly on moment inequalities (MC) and are parameterized by the party locations
x, all other confidence sets in our context can be built exactly by inverting the corresponding
hypothesis test on the entire (finite) domain of the parameters in question. Even when we wish to
test a joint hypothesis that includes the concavity assumption (C), we can build a confidence set
in the space of pairs p, r instead of party locations x by using test (HMUp,r0 ). This approach obviates
the need for an approximating grid and adds considerable credence to the use of these confidence
sets as specification tests.
15See the explicit discussion in the above references, especially Footnote 9 of Andrews and Soares
(2010), page 138.16More recently, specification tests for moment inequality models have been explored by Bugni,
Canay and Shi (2015) (see also Bugni, Canay and Shi (2017) for a discussion in the context of the
more general problem of sub-vector inference). They term the above test procedure “test BP” for
by-product, and propose two alternative tests which they show are asymptotically more powerful. I
do not entertain these alternative tests in this study mainly because of the difference in asymptotic
framework in Bugni, Canay and Shi (2015) with either that of Andrews and Barwick (2012) or
Romano, Shaikh and Wolf (2014), and because of the leeway the former allows in specifying a
critical tuning parameter.
23
Though not exhaustive, three types of specification tests seem particularly relevant in our
context. I conclude this section by reviewing each in turn, while Table 2 provides a summary of
the resulting specification test procedures.
Table 2: Specification Tests
Joint Null Hypothesis Moment Inequalities† Rejection Procedure
(D), (Q), (B) (MB) CBp (α) = ∅(D), (Q), (R) (MR) CRp (α) = ∅(D), (C), (E), (B) (MB) & (MU ) CBUp,r (α) = ∅(D), (C), (E), (R) (MR) & (MU ) CRUp,r (α) = ∅
† All tests are based on the weak version of the indicated moment inequalities. The correspondingmoment functions, mM , are reported in Table 6 of Appendix B.
Strong test of one-dimensionality: This test relies on a confidence set
CBp (α) := {p | (HBp ) is not rejected at level α}.
When CBp (α) = ∅, we can reject the joint hypothesis that (D), (Q), and (B) hold. This is a
strong test of uni-dimensionality, in the following sense: We may still have data generated from a
one-dimensional scale satisfying (D) and (Q), yet the hypothesis is rejected because the balance
condition (B) is violated for some triplet of parties.
Weak test of one-dimensionality: An alternative line of attack to testing the one-dimensionality
assumption can be pursued analogously to the strong test of one-dimensionality, by constructing a
confidence set
CRp (α) := {p | (HRp ) is not rejected at level α}.
When CRp (α) = ∅ we can reject the joint hypothesis that (D), (Q), and error bound (R) hold. This
is a weak test of the existence of a one dimensional scale, in that we reject the null either if (D)
and (Q) fail to hold or if the data is so noisy that (R) fails to hold. While the test is weak in the
sense that it is easier for data to ‘pass’ the test, rejection of this test provides strong evidence that
the uni-dimensionality assumption does not hold.
24
Joint tests of one-dimensionality & concavity: Finally, to perform a joint test of the null
hypothesis that (D), (Q), (B) or (R), and (C) and (E) hold, we build a confidence set
CMUp,r (α) := {(p, r) | (HMU
p,r0 ) is not rejected at level α},M = B,R.
We reject the null that the model is well-specified under these assumptions when CMUp,r (α) = ∅.
As already discussed, this is an efficient way to test the concavity assumption without using an
approximating grid on RJ , since there is only a finite number of pairs (p, r) on which we need
to invert the test. Paired with either balance condition (B) or error rate bound (R), the test
amounts to a strong and weak test of one-dimensionality and concavity of respondents’ systematic
preferences.
5 Implementation & a case study
In order to illustrate these estimation and inference techniques, I provide a detailed analysis
of CSES survey data on the German party system in this section. The relevant questions in these
surveys ask respondents to evaluate parties on an eleven point integer scale (0 − 10), with higher
values indicating the respondent likes the party more. The surveys included in the analysis were
timed around the federal elections in the period 1998-2013. I analyze each sub-wave of the 2002
data separately.17 All but the 2013 survey included questions for six political parties, namely, the
Left Partie (LP),18 Alliance 90/Greens (A90G), the Social Democratic Party (SPD), the Liberal
Democratic Party (FDP), and the two branches of the Christian Democrats the CDU and CSU. In
2013, the Pirates and AfD parties were added to the survey.
17Besides differences in timing and sampling scheme, the telephone version of the 2002 survey
over-sampled East Germany.18PDS prior to the 2005 elections.
25
Table 3: Manifesto and Chapel Hill left-right scales
LP A90/G SPD FDP CDU CSU Pirates AfDManifesto ’98 -26.816 -16.256 -1.603 14.704 24.868 – – –Manifesto ’02 -21.327 -13.434 4.099 4.591 20.76 – – –Manifesto ’05 -31.239 -10.445 -2.154 16.913 25.611 – – –Manifesto ’09 -24.519 -13.571 -18.297 4.272 8.724 – – –Manifesto ’13 -34.547 -19.595 -23.568 14.036 2.564 – -12.653 -2.74
Chapel Hill ’02 1.64 3.360 4 6.070 5.930 7.360 – –Chapel Hill ’06 1.27 3.090 3.55 6.640 6.360 7.640 – –Chapel Hill ’10 1.313 3.625 3.625 6.6 6.125 7.067 – –Chapel Hill ’14 1.231 3.615 3.769 6.538 5.923 7.231 3.25 8.923
Rows marked Manifesto report RiLe scores (Volkens et al. (2018)). The two branches of theChristian Democrats (CDU, CSU) are not separately scaled. Rows marked Chapel Hill reportleft-right scales constructed from expert surveys (Bakker et al. (2015)).
To provide necessary background information and as a basis of comparison for the ensuing
analysis, Table 3 reports existing conventionally construed left-right scales of these parties in the
period of study by two different but prominent methods, the RiLe scores form the Manifesto project
and the left-right scale produced by expert surveys conducted by the Chapel Hill project. There
are three chief threats to the adequacy of a one-dimensional scale in the period of study: First,
while most green parties typically have a clear left leaning, the German green party (A90/G) either
by virtue of strategic aspiration or coincidental make-up has often defied such placement. Second,
the Liberal Democrats (FDP) endorse conservative economic ideas but take more liberal stances on
social issues. Finally, the two parties that emerged more recently in the German political scene (the
Pirates and the AfD) endorse populist, anti-systemic, and/or nativist ideas. The data of Table 3
reflect this possible tension: the Manifesto data place A90/G to the right of the SPD in the last
two elections (but A90/G is to the left of SPD in all other cases); Chapel Hill expert data place
the FDP to the right of the CDU (but to the left of the CSU) while Manifesto does so only for
the last election; while the Pirates and the AfD seem to confound the Manifesto data that places
them at the center of the policy space (while the experts more conventionally place the AfD to the
far-right).
As a first step, I apply estimator (4) of Section 2, with the results reported in Table 4. Two
orderings emerge as point estimates across all periods. First, the FDP is placed at the far-right
of the spectrum at the beginning and end of the sample period, while the conventional ordering
26
Table 4: Estimated party order
1998 2002a 2002b 2005 2009 2013c 2013π p1 p2 p2 p2 p2 p1 pd1
Observed violationsf 4,452 4,315 1,406 4,421 4,560 2,576 6,443
N e 1,799 1,912 845 1,940 1,903 1,731 1,276Observed triplets 35,980 38,240 16,900 38,800 38,060 34,620 71,456
J 6 6 6 6 6 6 8Possible orders 360 360 360 360 360 360 20,160
Assumption (D) is imposed by setting jL =SPD and jR =CSU. The party orders p1, p2 are givenby ordering parties (from left to right) asp1 : LP A90/G SPD CDU CSU FDP,p2 : LP A90/G SPD FDP CDU CSU.π is estimator (4) of Section 2.a Telephone survey. b Mail-back survey. c Estimation performed excluding two parties added inthat year’s survey (Afd and Pirates). d Party Pirates is left-most, followed by AfD, which is orderedto the left of LP. e Sample only includes respondents that scored all parties. f Number of violationsof uni-modality across sample triplets when parties are ordered according to π.Data Sources: The Comparative Study of Electoral Systems (2015, 2018), Modules 1-4.
placing the FDP in the middle of the spectrum in-between the SPD and CDU emerges in the
intermediate elections. Second, in the 2013 elections with the two added parties (last column of
Table 4) the Pirates and AfD appear at the far left of the spectrum. In all cases the Greens are
placed to the left of the SPD. The fraction of observed violations of uni-modality in the data is
north of 10% of the observed triplets, with some periods performing better than others. This level
provides a rough gauge of the error rate qe(τ) on average, but does not resolve whether either of
the conditions (B) or (R) hold in the data.
For that purpose we turn to hypothesis tests (HBp ) and (HR
p ) which are executed for all
possible party orders to construct confidence sets. In Table 5 I report these confidence sets built
using all three implementations of the two-step procedure of Romano, Shaikh and Wolf (2014).19
The strong test of one-dimensionality is rejected across all elections. There is no support for the
hypothesis that balance condition (B) holds in these data. Since the consistency of the estimator
(4) is premised on condition (B), these results cast doubt on the point estimates of Table 4. Turning
to the error rate condition (R), the picture is more nuanced. First, we can reject the weak test
19For admissible test statistics for these tests see Romano, Shaikh and Wolf (2014), Andrews and
Barwick (2012), and Andrews and Soares (2010).
27
Table 5: Confidence Sets and Specification Tests
1998 2002a 2002b 2005 2009 2013c 2013
StrongTest
CBp (α)–MMM ∅ ∅ ∅ ∅ ∅ ∅ ∅CBp (α)–QLR ∅ ∅ ∅ ∅ ∅ ∅ ∅CBp (α)–MAX ∅ ∅ ∅ ∅ ∅ ∅ ∅
WeakTest
CRp (α)–MMM ∅ ∅ {p2, p3, p5, p6} {p2, p3} {p1, p2, p4} {p1} {pd1}CRp (α)–QLR ∅ ∅ {p3, p5} {p2} {p1} {p1} ∅CRp (α)–MAX ∅ ∅ {p3, p5, p6} {p2} {p1} {p1} ∅
Comb.WeakTeste
CURp,r (α)–MMM ∅ ∅ ∅ ∅ {p1} {p1} {pd1}CURp,r (α)–QLR ∅ ∅ ∅ ∅ {p1} {p1} {pd1}CURp,r (α)–MAX ∅ ∅ ∅ ∅ {p1} {p1} ∅
Nf 1,799 1,912 845 1,940 1,903 1,731 1,276J 6 6 6 6 6 6 8
Possible orders 360 360 360 360 360 360 20,160
All tests carried out using the two-step procedure of Romano, Shaikh and Wolf (2014) with α = 0.05,β = 0.005, B = 10, 000. Assumption (D) is imposed by setting the SPD to the left of CSU. Partyorders p1 − p6 arep1 : LP A90/G SPD CDU CSU FDP,p2 : LP A90/G SPD FDP CDU CSU,p3 : LP SPD A90/G FDP CDU CSU,p4 : LP SPD A90/G CDU CSU FDP,p5 : SPD A90/G FDP CDU CSU LP,p6 : A90/G SPD FDP CDU CSU LP.a Telephone survey. b Mail-back survey. c Analysis performed excluding two parties added in thatyear’s survey (AfD and Pirates). d Party Pirates is left-most, followed by AfD, which is orderedto the left of LP. e Only the ordering coordinate p is reported. f Sample only includes respondentsthat scored all parties.Data Sources: The Comparative Study of Electoral Systems (2015, 2018), Modules 1-4.
28
of one-dimensionality in the 1998 elections. There is very little room to defend a one-dimensional
representation of the German party system in those elections. The same may be true in 2002
and possibly in 2013 when the test is performed using all eight parties of that survey. The 2002
mail-back survey seems to possibly admit a one-dimensional representation but this may be due to
smaller sample size or the different population of respondents in the two surveys. We cannot reject
the weak one-dimensionality test in the elections of 2005, 2009, and 2013 (without the added parties
Pirates and AfD). It is worth noting that the resulting confidence sets are small (even though (R)
does not necessarily point-identify the true ordering). Furthermore, with the exception of the set
built using the MMM statistic, the order that survives in the 2009 confidence sets is different than
the one that minimizes violations of uni-modality, and places the FDP at the far-right. Finally,
turning to tests that incorporate concavity (C), in two elections (2009, 2013) we cannot reject the
joint assumption of concavity combined with the weak one-dimensionality test.20 On the more
mundane but practically important side of things, the MAX test statistic proves a coherent and
inexpensive implementation of the tests proposed by Romano, Shaikh and Wolf (2014) compared
to the QLR and MMM alternatives.
In the subset of elections when these results do not outright reject the one-dimensional
model, the task remains of interpreting the resulting confidence sets in order to understand what
the content of that dimension might be. Of course, the standard statistical caveat still applies
that failure to reject the one-dimensionality test does not confirm that one dimension is indeed
adequate. Still, the non-empty confidence sets in Table 5 may be consistent with a conventional
economic left-right scale, but then the particular ordering is different than that produced by either
the Manifesto or Chapel Hill methods. Notably, the left-right interpretation requires the FDP at
the right-most position in the second half of the period and the Greens (except for the 2002 or
MMM implementations) to the left of the SPD.
Failure of the balance condition in these data warns us that the most extreme parties in
the scale may be harder to place: A larger fraction of the electorate strictly least prefer these
parties, so that when such parties are contained in a triplet the fraction of respondents that least
20Notably, for the 2013 election with all eight parties included, the combined weak test of one-
dimensionality and concavity is now not rejected for both the MMM and the QLR implementation
of the test.
29
prefer the extreme party on the other side of the spectrum in that triplet is more likely to fail (B).
Therefore, one-dimensional data may pass the weak test but the identified set includes party orders
placing the most extreme parties on both ends of the spectrum. This may account for the dual
placement of party LP in the 2002 mail-back survey results. It might appear tempting to attribute
the placement of the AfD and Pirates on the left-most position of the scale to the same effect, but
there is no corresponding placement of these parties to the right-most position.21 Indeed, we ought
to consider this datum with additional attention: There are 21,160 candidate orderings of the eight
parties (after orientation restriction (D) is imposed) and this is the only one that passed test (HRp )
or (HRUp,r0)! One possible interpretation is that if 2013 is consistent with a one-dimensional scale for
all eight parties, then that scale is a libertarian/totalitarian scale starting with the FDP as most
libertarian party, with the AfD and Pirates being most totalitarian parties. Another possibility
is that the world economic crisis of 2008 forced increased emphasis on narrow/populist economic
well-being (that subsumes nativist perspectives through that narrow lense), thus placing both the
AfD and Pirates at the extreme left and the economically conservative FDP at the right-most
position.
6 Conclusions
One dimensional summaries such as liberal, moderate, conservative, etc., are tremendously
efficient in conveying information about political parties, individual candidates, Supreme Court
nominees, and most other political arenas. It therefore behooves us to use one-dimensional scales
when they are adequate but it is also equally important to understand when is it that we may miss
something significant about the politics in question if we commit to a one dimensional summary?
It might be tempting to surmise that one dimension almost never suffices. But when it comes
to statistical sufficiency, such questions are addressed using rigorous test procedures. Because
statistical test procedures always test joint hypotheses, we also need to ensure we are not rejecting
the one dimensional model solely because individual preferences are not quadratic, or symmetric
around an ideal point, or similarly order restricted.
21Recall that the AfD is placed in the right-most position by the Chapel Hlll survey and in a
central position by the Manifesto scores.
30
The methods developed in this paper aim to give maximal leeway to a one-dimensional scale
and provide guidance on its reconstruction if it, and to the degree possible only it, is not rejected
by the data. Identification of the ordering on the scale is established under the balance condition.
The weaker error rate condition still produces narrow confidence sets without fully identifying the
order of parties. Both conditions are testable in the data. Implementation of this framework using
German CSES survey data yielded strong evidence against the balance condition. These data also
outright reject a one-dimensional scale in some but not all federal elections. Furthermore, cases
when a one-dimensional scale is not rejected by the data often require unconventional placement
of the parties and careful interpretation of the empirical content of the scale.
The analysis in this paper has focused exclusively on one-dimensional scales. Given the
results in Kalandrakis (2010), particularly his embedding Theorem 5, it is unlikely that this ap-
proach can be extended to more than one dimensions without added restrictions on preferences. I
leave such questions for future research. Within the one-dimensional framework, it is possible that
the austere setup in this paper leaves room for the development of procedures for oversampling
certain sub-populations in order to restore balance in the sample when the error rate condition is
met. The challenge with such schemes is to properly guard against biasing the results in favor of
one-dimensionality and I also leave this question for future research. In work that is currently in
progress, I use this framework to study more systematically the adequacy of one-dimensional party
scales in a comparative perspective and in Presidential elections in the US.
31
Appendix A Proofs of Theorems 1-4
The logical chain runs in reverse order, that is, Theorem 1 follows from Theorem 2, which
in turn is proved using Theorems 3 and 4.
Proof of Theorem 1. Since yij > yij′ ⇔ ui(xj) > ui(xj′) for all i, j, j′, we have qe(τ) = 0 for all
τ . (B0) further implies that qc(τ) is well-defined and qc(τ) > 0 for all τ . Consequently (B) holds
and the result follows from Theorem 2.
Proof of Theorem 2. Follows from Theorem 4.
Proof of Theorem 3. We proceed to show the three parts in the same order:
1. Assume (B). To show (MB) consider any τ and r = 1, 3. We have
Qr(τ) ≥ qc(τ) · P[πτ (r) ∈ arg maxj∈τ
u(xj)] (by definition of qc(τ))
≥ qc(τ) · Pr(τ) (because P[πτ (r) ∈ arg maxj∈τ
u(xj)] ≥ P[{πτ (r)} = arg maxj∈τ
u(xj)])
> qc(τ) · qe(τ)
qc(τ)(by (B))
≥ qe(τ) = Q2(τ) (by definition of qe(τ)).
Next, to show (R), we have for all τ
qc(τ) ≥ qc(τ)(P1(τ) + P3(τ)) (by (2))
> qc(τ) · 2qe(τ)
qc(τ)= 2qe(τ) (by (B)).
2. Assume (R) and consider any τ . We have
Q1(τ) + Q3(τ) ≥ qc(τ) ·∑r=1,3
P[πτ (r) ∈ arg minj∈τ
u(xj)] (by definition of qc(τ))
≥ qc(τ) (by (3))
> 2qe(τ) (by (R))
= 2Q2(τ) (by definition).
32
3. We will show there exists a distribution Φ satisfying qc(τ) < 2qe(τ) such that (¬Q) holds for
some τ . So fix τ and assume Φ is such that the mass of respondents that are indifferent among
any two parties in τ is null so that∑
r=1,3 Pr(τ) = 1 (by (3)), and that every respondent either
reports her true least preferred party in τ as strictly least preferred (with probability qc(τ))
or reports the middle party as strictly least preferred (with probability qe(τ) = 1 − qc(τ)).
Then, we have
Q1(τ) + Q3(τ) = qc(τ)(P1(τ) + P3(τ)) (by assumption)
= qc(τ) (by assumption)
< 2qe(τ) (because qc(τ) < 2qe(τ))
= 2Q2(τ) (by definition).
Now, if P1(τ) ≈ P3(τ) we have Qr(τ) < Q2(τ) for both r = 1, 3.
Proof of Theorem 4. It suffices to show that
limN→+∞
QN (π, y) < limN→+∞
QN (p, y),
for all p ∈ Π, p 6= π and p−1(jL) < p−1(jR). Then, since Π is finite, π is a consistent estimator
which also provides a constructive proof of identification of π. Observe that for all such p 6= π the
set
Tp := {τ | pτ (2) 6= πτ (2)},
is non-empty. Therefore, for such p 6= π we have
limN→+∞
(QN (p, y)−QN (π, y)) =∑τ∈Tp
(Qπ−1
τ (pτ (2))(τ)−Q2(τ)).
Because (B) implies (MB) (by Theorem 3) and π−1τ (pτ (2)) ∈ {1, 3} for all τ ∈ Tp, we conclude that
limN→+∞ (QN (p, y)−QN (π, y)) > 0 as desired.
33
References
Afriat, Sydney N. 1967. “The construction of utility functions from expenditure data.” International
Economic Review 8:67–77.
Aldrich, J. and R. D. McKelvey. 1977. “A method of scaling with applications to the 1968 and
1972 presidential elections.” American Political Science Review 71(1):111–130.
Andrews, D. W. K. and Panle Jia Barwick. 2012. “Inference for parameters defined by moment
inequalities: a recommended moment selection procedure.” Econometrica 80(6):2805–2826.
Andrews, Donald W. K. and Gustavo Soares. 2010. “Inference for Parameters Defined by Moment
Inequalities Using Generalized Moment Selection.” Econometrica 78(1):119–157.
Andrews, Donald W. K. and Patrik Guggenberger. 2009. “Validity of Subsampling and ”Plug-in
Asymptotic” Inference for Parameters Defined by Moment Inequalities.” Econometric Theory
25:669–709.
Ansolabehere, Stephen and H.E. Brady. 1989. “The Nature of Utility Functions in Mass Publics.”
American Political Science Review 83(1):143–163.
Arrow, Kenneth J. 1951. Social Choice and Individual Values. New York: Wiley.
Bakker, Ryan, Catherine de Vries, Erica Edwards, Liesbet Hooghe, Seth Jolly, Gary Marks,
Jonathan Polk, Jan Rovny, Marco Steenbergen and Milada Vachudova. 2015. “Measuring Party
Positions in Europe: The Chapel Hill Expert Survey Trend File, 1999-2010.” Party Politics
21(1):143–152.
Black, Duncan. 1948. “On the Rationale of Group Decision-Making.” Journal of Political Economy
56:23–34.
Brady, H.E. 1985. “Statistical Consistency and Hypothesis Testing for Nonmetric Multidimensional
Scaling.” Psychometrika 50(4):509–537.
Brady, H.E. 1989. “Factor and Ideal Point Analysis for Interpersonally Incomparable Data.” Psy-
chometrika 54(2):181–202.
34
Brady, H.E. 1990. “Dimensional Analysis of Ranking Data.” American Journal of Political Science
34(4):1017–1048.
Bugni, Federico A., Ivan A. Canay and Xiaoxia Shi. 2015. “Specification tests for partially identified
models defined by moment inequalities.” Journal of Econometrics 185:259–282.
Bugni, Federico A., Ivan A. Canay and Xiaoxia Shi. 2017. “Inference for subvectors and other func-
tions of partially identified parameters in moment inequality models.” Quantitative Economics
8:1–38.
Cahoon, L. and M. J. Hinich. 1976. “A method for locating targets using range only.” IEEE
Transactions on Information Theory 22(2):217–225.
Cahoon, L., M. J. Hinich and P. C. Ordeshook. 1978. A statistical multidimensional unfolding
method based on the spatial theory of voting. In Wang, P. C., ed. Graphical Representation of
Multivariate Data. New York: Academic Press pp. 243–278.
Canay, Ivan A. and A. Shaikh. 2017. Practical and Theoretical Advances for Inference in Partially
Identified Models. In Advances in Economics and Econometrics, ed. M. Piazzesi B. Honore,
A. Pakes and L. Samuelson. Vol. 2 of Econometric Society Monographs Cambridge University
Press pp. 271–306.
Canen, Nathan, Chad Kendall and Francesco Trebbi. 2018. “Unbundling Polarization.” NBER
Working Paper No. w25110 .
Castles, F. and P. Mair. 1984. “Left-Right Political Scales: Some Expert Judgements.” European
Journal of Political Research 12:73–88.
Chernozhukov, V., H. Hong and E. Tamer. 2007. “Estimation and confidence regions for parameter
sets in econometric models.” Econometrica 75(5).
Clinton, J., S. Jackman and D. Rivers. 2004. “The statistical analysis of roll call data.” American
Political Science Review 98:355–370.
Dummett, Michael and Robin Farquharson. 1961. “Stability in Voting.” Econometrica 29(1):33–43.
35
Eggers, Andrew C. and Nick Vivyan. 2018. “Who votes more strategically?” Working paper. Oxford
University .
Heckman, J. and J. Snyder. 1997. “Linear probability models of the demand for attributes with an
empirical application to estimating the preferences of legislators.” RAND Journal of Economics
28:S142–S189.
Henry, Marc and Ismael Mourifie. 2013. “Euclidean Revealed Preferences: Testing the spatial
voting model.” Journal of Applied Econometrics 28:650–666.
Hinich, M. J. 2005. “A New Method for Statistical Multidimensional Unfolding.” Multivariate
Analysis 34:2299–2310.
Iaryczower, Matias and Matthew Shum. 2012. “The Value of Information in the Court: Get It
Right, Keep It Tight.” American Economic Review 102(1):202–237.
Imbens, G. and C. F. Manski. 2004. “Confidence Intervals for Partially Identified Parameters.”
Econometrica 72:1845–1857.
Kalandrakis, T. 2010. “Rationalizable voting.” Theoretical Economics 5(1):93–125.
Kawai, Kei and Yasutora Watanabe. 2013. “Inferring Strategic Voting.” American Economic Review
103(2):624–662.
Manski, C. F. 1975. “Maximum Score Estimation of the Stochastic Utility Model of Choice.”
Journal of Econometrics 3:205–228.
Manski, C. F. 1988. “Identification of Binary Response Models.” Journal of the American Statistical
Association 83:729–738.
Martin, A.D. and K.M. Quinn. 2002. “Dynamic Ideal Point Estimation via Markov Chain Monte
Carlo for the U.S. Supreme Court, 1953-1999.” Political Analysis 10:134–153.
Matzkin, R. L. 1992. “Nonparametric and Distribution-Free Estimation of the Binary Choice and
the Threshold-Crossing Models.” Econometrica 60:239–270.
Merlo, Antonio and Aureo de Paula. 2017. “Identification and Estimation of Preference Distribu-
tions When Voters Are Ideological.” Review of Economic Studies 84:1238–1263.
36
Mokken, R. J. 1971. A Theory and Procedure of Scale Analysis. The Hague: Mouton/Berlin:
DeGrutyer.
Poole, K. T. 2000. “A non-parametric unfolding of binary choice data.” Political Analysis 8(3):211–
237.
Poole, K. T. and H. Rosenthal. 1985. “A spatial model for legislative roll call analysis.” American
Journal of Political Science 29:357–384.
Poole, K. T. and H. Rosenthal. 1997. Congress: A political-economic history of roll call voting.
New York: Oxford University Press.
Poole, Keith T. 2005. Spatial Models of Parliamentary Voting. Cambridge University Press.
Post, Wendy J. 1992. Nonparametric Unfolding Models: A latent structure approach. Number 21
in “M&T Series” Leiden University, The Netherlands: DSWO Press.
Romano, J. P., A. Shaikh and M. Wolf. 2014. “A practical two-step method for testing moment
inequalities.” Econometrica 82(5):1979–2002.
Romano, Joseph P. and Azeem M. Shaikh. 2008. “Inference for identifiable parameters in partially
identified econometric models.” Journal of Statistical Planning and Inference 138:2786–2807.
Rosen, A. M. 2008. “Confidence sets for partially identified parameters that satisfy a finite number
of moment inequalities.” Journal of Econometrics 146:107–117.
Sen, Amartya K. 1966. “A Possibility Theorem on Majority Decisions.” Econometrica 34(2):491–
499.
Sijtsma, Klaas and Ivo W. Molenaar. 2002. Introduction to Nonparametric Item Response Theory.
Vol. 5 of Measurement Methods for the Social Sciences SAGE Publications.
Sijtsma, Klaas and R.R. Meijer. 2007. Nonparametric item response theory and related topics.
In Psychometrics, ed. C. R. Rao and S. Sinharay. Number 26 in “Handbook of statistics”
Amsterdam: Elsevier pp. 719–746.
Tahk, Alexander. 2018. “Nonparametric Ideal-Point Estimation and Inference.” Political Analysis
26:131–146.
37
The Comparative Study of Electoral Systems. 2015. www.cses.org.
The Comparative Study of Electoral Systems. 2018. www.cses.org.
van Schuur, Wijbrandt H. 2011. Ordinal Item Response Theory: Mokken Scale Analysis. Quanti-
tative Applications in the Social Sciences SAGE Publications.
Varian, Hal R. 1982. “The nonparametric approach to demand analysis.” Econometrica 50(4):945–
973.
Volkens, Andrea, Pola Lehmann, Theres Matthiess, Nicolas Merz, Sven Regel and Bernhard Wes-
sels. 2018. “The Manifesto Data Collection. Manifesto Project (MRG/CMP/MARPOR). Berlin:
Wissenschaftszentrum Berlin fur Sozialforschung (WZB).”.
URL: https://doi.org/10.25522/manifesto.mpds.2018a
38
Appendix B – Supplemental: Moment functions
Table 6: Moment functions
θ Θ mM (θ, yi) (MM )
p Π
I(yipτ (2) < min
k=1,3{yipτ (k)}
)−I(yipτ (r) < min
k 6=r{yipτ (k)}
)
τ∈Tr=1,3
(MB)
p Π
2I(yipτ (2) < min
r=1,3{yipτ (r)}
)−∑r=1,3
I(yipτ (r) < min
k 6=r{yipτ (k)}
)τ∈T
(MR)
(p, x) ({p} ×Xp)p∈Π
(xp(r) − xp(r−1))(yip(r+1) − y
ip(r))
−(xp(r+1) − xp(r))(yip(r) − yip(r−1))
J−1
r=2
(MC)
(p, r) Π× {1, . . . , J}(
(I(r′ ≤ r)− I(r′ > r))(yip(r′−1) − yip(r′))
)Jr′=2
(MU )
Function mM (θ, yi) represents the contribution of sample response yi to each moment inequality onthe right column. The parameter input for moment inequalities (MC) involves some redundancyas it specifies both a party order and the actual location of the parties for ease of comparison withthe remaining moment inequalities.
Appendix C – Supplemental: Proof of Theorems 5 & 6
Proof of Theorem 5. For all r = 2, . . . , J − 1 we have
0 ≥ E[(xπ(r) − xπ(r−1))(u(xπ(r+1))− u(xπ(r))
)− (xπ(r+1) − xπ(r))
(u((xπ(r))− u(xπ(r−1))
)]
= (xπ(r) − xπ(r−1))(E[u(xπ(r+1))]− E[u(xπ(r))]
)− (xπ(r+1) − xπ(r))
(E[u(xπ(r))]− E[u(xπ(r−1))]
)]
= (xπ(r) − xπ(r−1))(E[yπ(r+1)]− E[yπ(r)]
)− (xπ(r+1) − xπ(r))
(E[yπ(r)]− E[yπ(r−1)]
).
The first line follows from (C), the second from linearity of expectation, the third from (E).
39
Both directions of Theorem 6 are, barring small details, well-understood. The proof pro-
vided here is rudimentary, but does cover and rely on the fact that at least two party locations are
distinct. If no pair of parties have distinct locations in Xp, then the second direction holds trivially
while the first direction does not hold, in general. The proof of the second direction is performed
by solving a reverse Afriat inequality problem of sorts, in that we are given ‘utility’ levels and seek
alternatives and supergradients that jointly satisfy concavity restrictions.
Proof of Theorem 6. ⇒ Suppose there exists x ∈ Xp that satisfies (MC) but there does not
exist r such that (MU ) holds. Let ¯r be the smallest candidate value such that E[yp(r−1)] ≥ E[yp(r)]
for all r > ¯r. Since (MU ) does not hold, ¯r > 1 and E[yp(¯r−1)] < E[yp(¯r)] (for otherwise ¯r is not the
smallest), and there exists r, 1 < r < ¯r such that E[yp(r−1)] > E[yp(r)] (it cannot be that r = ¯r
because E[yp(¯r−1)] < E[yp(¯r)]). Let this r be the largest such. We thus have
E[yp(r−1)] > E[yp(r)] ≤ E[yp(¯r−1)] < E[y(¯r)].
Then for τ = {p(r − 1), p(r), p(¯r)} we have xp(r−1) ≤ xp(r) ≤ xp(¯r) and (by E[mC(p, x, y)] ≤ 0)
(xp(r) − xp(r−1))(E[yp(¯r)]− E[yp(r)])− (x¯r − xp(r))(E[yp(r)]− E[yp(r−1)]) ≤ 0,
which is possible only if xp(r−1) = xp(r) = xp(¯r) = x. Then, since xj 6= xj′ for some j, j′, x 6= xk for
some k ∈ {j, j′}. If xk < x then for τ = {k, p(r), p(¯r)}
(xp(r)−xk)(E[yp(¯r)]−E[yp(r)])−(xp(¯r)−xp(r))(E[yp(r)]−E[yk]) = (xp(r)−xk)(E[yp(¯r)]−E[yp(r)]) > 0,
contradicting the assumption that (MC) hold. If, on the other hand, xk > x, then we obtain a
similar contradiction for triplet τ = {k, p(r), p(r − 1)}, that is,
(xp(r)−xp(r−1))(E[yk]−E[yp(r)])−(xk−xp(r))(E[yp(r)]−E[yp(r−1)]) = −(xk−xp(r))(E[yp(r)]−E[yp(r−1)]) > 0.
⇐ Fix p and suppose there exists r such that (MU ) holds. We need to show that there exists
x ∈ Xp such that (MC) holds. We must have xj 6= xj′ for some j, j′, so set xp(1) = 0 and xp(J) = 1
40
(hence, xj 6= xj′ for j = p(1), j′ = p(J)). Define
D :=
∑J
r=2 | E[yp(r)]− E[yp(r−1)] | if∑J
r=2 | E[yp(r)]− E[yp(r−1)] |> 0,
1 if∑J
r=2 | E[yp(r)]− E[yp(r−1)] |= 0.
For all r, define a pair of scalars xp(r), dr so that
dr =
D if r < r and E[yp(r)] < E[yp(r)]
0 if E[yp(r)] = E[yp(r)]
−D if r > r and E[yp(r)] < E[yp(r)],
and xp(1) = 0, xp(J) = 1 as above and for r = 2, . . . , J − 1 inductively set
xp(r) = xp(r−1) +| E[yp(r)]− E[yp(r−1)] |
D.
Observe, first that x = (x1, . . . , xJ) ∈ Xp. Furthermore, it is easy to verify22 that the Afriat
inequalities Afriat (1967)
E[yp(r)]− E[yp(r′)]− dr(xp(r) − xp(r′)) ≥ 0, for all r, r′
are satisfied. As a result, we can construct a function E : R→ R that takes the form
E(z) := minr=1,...,J
{E[yp(r)] + dr · (z − xp(r))}.
22By construction when dr = 0, with equality for r, r′ with sign(dr) = sign(dr′) when dr 6= 0,
and by the triangle inequality in the remaining cases.
41
E is concave as the minimum of concave functions and satisfies E(xp(r)) = E[yp(r)] for all r. We
now have:
E[mC(p, x, y)] =((xpτ (r) − xpτ (r−1))(E[ypτ (r+1)]− E[ypτ (r)])− (xpτ (r+1) − xpτ (r))(E[ypτ (r)]− E[ypτ (r−1)])
)J−1
r=2
=(
(xpτ (r) − xpτ (r−1))(E(xpτ (r+1))− E(xpτ (r)))− (xpτ (r+1) − xpτ (r))(E(xpτ (r))− E(xpτ (r−1))))J−1
r=2
≤ 0,
with the last inequality following by the concavity of E.
Appendix D – Supplemental: Ties in party locations
For expositional purposes, the analysis in the main text has been carried out under the
assumption that party locations are pairwise distinct. If we allow parties to share locations23 then
there may be multiple orderings, π, such that
xπ(1) ≤ xπ(2) ≤ . . . ≤ xπ(J).
Denote the set of such true orderings by Π∗ ⊆ Π. The population quantities Pr(τ) are not affected
by this possibility: If two parties j, j′ share a location xj = xj′ , we have u(xj) = u(xj′) for all u,
therefore for all τ = {j, j′, j′′}, we have Pr(τ) = 0 for all π ∈ Π∗ and all r such that πτ (r) = j or
πτ (r) = j′. Some other quantities defined in the main text may take different values depending on
the choice of π ∈ Π∗. To avoid this possible ambiguity, generalize qc(τ) to
qc(τ) := minr=1,3π∈Π∗
P[{πτ (r)} = arg min
j∈τyj | πτ (r) ∈ arg min
j∈τu(xj)
],
and qe(τ) to
qe(τ) := maxπ∈Π∗
P[{πτ (2)} = arg minj∈τ
yj ].
Using these generalized definitions, it easily follows that the weaker of the two condition (B) and
(R) precludes ties between pairs of parties:
23Except of course for parties jL, jR, of (D).
42
Lemma 1. If (R) holds, then Π∗ = {π} is singleton.
Proof. Suppose qc(τ) > 2qe(τ) but there exist parties j, j′ such that xj = xj′ . Consider any j′′
and the triplet τ = {j, j′, j′′}. Assume, without loss of generality, that P[{j} = arg minj∗∈τ yj∗ ] ≥
P[{j′} = arg minj∗∈τ yj∗ ]. Using the generalized definitions of qe, qc we conclude
qe(τ) ≥ P[{j} = arg minj∗∈τ
y∗ ] ≥ P[{j′} = arg minj∗∈τ
y∗ ] ≥ qc(τ),
which is impossible because (R) holds for all τ .
Since (B) implies (R), it follows that when either of the two holds, Qr(τ) as given in the
main text is well-defined.24 Therefore, Theorems 1, 2, 3, and 4 hold as stated using the above
generalized definitions. Theorems 5 and 6 are not affected either way.
Appendix E – Supplemental: Tests of existing scales using (MC)
24Although it is not necessary in any of the numbered results, we could also generalize the
definition of Qr(τ) to Qr(τ, π) := P[{πτ (r)} = arg minj∈τ yj ] to allow for multiple π ∈ Π∗.
43
Table 7: Testing Alternative Scales, Germany 2009
1998 2002a 2002b 2005 2009 2013
Manifesto
MMM 707.91 405.07 114.78 127.34 130.51 990.90critical level 4.58 2.92 2.92 4.49 4.19 5.36
QLR 706.10 400.59 113.48 120.79 153.10 1355.06critical level 4.38 2.86 2.82 66.07† 4.60 6.06
MAX 26.57 20.13 10.71 10.51 10.93 25.19critical level 1.97 1.72 1.71 1.99 1.97 2.18
ChapelHill
MMM – 262.40 38.87 72.05 60.17 1728.61critical level – 4.50 4.49 4.32 4.54 5.53
QLR – 262.31 42.91 74.67 54.70 2086.13critical level – 4.35 4.28 4.28 4.36 5.73
MAX – 14.49 6.23 7.45 5.99 30.46critical level – 2.01 2.01 1.99 2.02 2.19
N c 1,799 1,912 845 1,940 1,903 1,276
Tests (HCp,x0) carried out with x0 set at the scale values of Table 3 for the corresponding (nearest)
election CSES survey data. Tests executed using the two-step procedure of Romano, Shaikh andWolf (2014) on the square sample with α = 0.05, β = 0.005, B = 10, 000. The null hypothesis isrejected in all cases.a Telephone survey. b Mail-back survey. c Sample only includes respondents that scored all parties.† Numerically unstable value.Data Sources: The Comparative Study of Electoral Systems (2015, 2018), Modules 1-4.
44