some counterexamples to causal decision theory1 andy …fitelson.org/few/few_05/egan.pdf · 1 some...

1

Some Counterexamples to Causal Decision Theory1 Andy Egan

Australian National University

Introduction

Many philosophers (myself included) have been converted to causal decision

theory by something like the following line of argument: Evidential decision theory

endorses irrational courses of action in a range of examples, and endorses “an irrational

policy of managing the news”.2 These are fatal problems for evidential decision theory.

Causal decision theory delivers the right results in the troublesome examples, and does

not endorse this kind of irrational news-managing. So we should give up evidential

decision theory, and be causal decision theorists instead.

Unfortunately, causal decision theory has its own family of problematic examples

for which it endorses irrational courses of action, and its own irrational policy that it is

committed to endorsing. These are, I think, fatal problems for causal decision theory. I

wish that I had another theory to offer in its place.

1. The Case against Evidential Decision Theory

Evidential decision theory says that the action that it’s rational to perform

(ignoring the possibility of ties) is the one with the greatest expected utility – the one

such that your expectations for how the world will turn out, conditional on your

1 Special thanks are due to David Braddon-Mitchell for the series of conversations that led to this paper, and for many further conversations as it was in progress. Thanks also to Adam Elga, Brian Weatherson, Karen Bennett, Daniel Stoljar, Alan Hajek, James Joyce, and audiences at MIT, the University of Pittsburgh, Oxford University, and the ANU Philosophical Society for very helpful questions, comments, and objections. 2 Lewis 1981.

2

performing it, are greater than the expectations conditional on performing some other

action.. So the action that it’s rational to perform will also be the one that you (or a

friend with you’re your own interests in mind, and with the same ideas about where your

interests lie as you have) would be happiest to learn that you had performed. The case

against evidential decision theory is based upon examples like the following:

The Smoking Lesion

Susan is debating whether or not to smoke. She knows that smoking is strongly

correlated with lung cancer, but only because there is a common cause – a

condition that tends to cause both smoking and cancer. Once we fix the presence

or absence of this condition, there is no additional correlation between smoking

and cancer. Susan prefers smoking without cancer to not smoking without cancer,

and prefers smoking with cancer to not smoking with cancer. Should Susan

smoke? Is seems clear that she should. (Set aside your theoretical commitments

and put yourself in Susan’s situation. Would you smoke? Would you take

yourself to be irrational for doing so?)3

Causal decision theory distinguishes itself from evidential decision theory by

delivering the right result for The Smoking Lesion, where its competition – evidential

decision theory – does not. The difference between the two theories is in how they

compute the relative value of actions. Roughly: evidential decision theory says to do the

3 This example is a standard medical Newcomb problem, representative of the many to be found in the literature. The original Newcomb’s problem is from Nozick 1969. For some excellent discussions of medical (and other) Newcomb problems, see (among many others) Gibbard and Harper 1976, Eells 1982, Lewis 1979 and Lewis 1981.

3

thing you’d be happiest to learn that you’d done, and causal decision theory tells you to

do the thing most likely to bring about good results.

Evidential decision theory tells Susan not to smoke, roughly because it treats the

fact that her smoking is evidence that she has the lesion, and therefore is evidence that

she is likely to get cancer, as a reason not to smoke. Causal decision theory tells her to

smoke, roughly because it does not treat this sort of common-cause based evidential

connection between an action and a bad outcome as a reason not to perform the action.

Let’s look at how the differences between the formal theories deliver these results:

Following Lewis, let a dependency hypothesis be a proposition which is

maximally specific about how things that the agent cares about depend causally on what

the agent does. Also following Lewis, let us think of such propositions as long

conjunctions of subjunctive conditionals (of the appropriate, non-backtracking kind) of

the form, if I were to do A, then P. (Written, from now on, “AP”.)

The difference between causal and evidential decision theory is that causal

decision theory privileges the agent’s unconditional assignment of credences to

dependency hypotheses in determining the relative values of actions.

If the H’s form a partition of the worlds that the agent assigns non-zero credence,

the value assigned to an action A by evidential decision theory (henceforth EDT) is given

by:

VALEDT = Hc(H|A)v(HA)

(Note a harmless ambiguity: I’m using ‘A’ to name both an action and the proposition

that the agent performs that action.)

4

In particular, in the case of the partition of dependency hypotheses (let these be

the Ks), the value assigned by EDT is given by:

VALEDT = Kc(K|A)v(KA)

The important thing to notice about this formula is that it’s the agent’s conditional

credences in dependency hypotheses that figure in it.

The value assigned by causal decision theory (henceforth CDT) is given by:

VALCDT = Kc(K)v(KA)

The crucial difference is that now the assignments of values to actions are

sensitive only to the agent’s unconditional credences in dependency hypotheses, not her

credences conditional on her performing A. The effect of this is to hold fixed the agent’s

beliefs about the causal structure of the world, and force us to use the same beliefs about

the causal order of things in determining the choiceworthiness of each candidate action.

Rather than the expected payoffs of smoking being determined by reference to how

Susan thinks the causal structure of the world is likely to be, conditional on her smoking,

and the expected payoffs of not smoking determined by reference to how she thinks the

causal structure of the world is likely to be, conditional on her not smoking, the expected

payoffs of both smoking and not smoking are determined by reference to Susan’s

unconditional beliefs about how the causal structure of the world is likely to be.

Cases like The Smoking Lesion motivate the move from EDT to CDT. In The

Smoking Lesion¸ there is a strong correlation between smoking and getting cancer,

despite the fact that smoking has no tendency to cause cancer, due to the fact that

smoking and cancer have a common cause. Still, since Susan’s c(CANCER|SMOKE) is

5

much higher than her c(CANCER|NOT SMOKE), EDT assigns not smoking a higher

value than smoking. And this seems wrong.

So we have an argument against EDT: The correct theory of rational decision

won’t endorse any irrational actions or policies. In The Smoking Lesion, EDT endorses

an irrational course of action: it’s irrational for Susan not to smoke, and EDT endorses

not smoking. EDT also endorses an irrational policy: it endorses a policy of performing

the action with the greatest evidential value, rather than the action with the best expected

causal upshot. So EDT isn’t the correct theory of rational decision.

CDT, on the other hand, uses the agent’s unconditional credences in dependency

hypotheses to assign values to actions. The effect of this is to make our assignments of

values to actions blind to the sort of common-cause correlations that make EDT’s value

assignments in The Smoking Lesion go bad.

Causal decision theory now looks very attractive. It gets the cases that made

trouble for EDT right, and it seems to get them right for the right reasons – by assigning

the agent’s causal beliefs a special role.

3. The Case against Causal Decision Theory

Causal decision theory is supposed to be a formal way of cashing out the slogan,

“do what you expect will bring about the best results”. The way of implementing this

sound advice is to hold fixed the agent’s unconditional credences in dependency

hypotheses. The resulting theory enjoins us to do whatever has the best expected

outcome, holding fixed out initial views about the likely causal structure of the world.

6

The following examples show that these two principles come apart, and that where they

do, causal decision theory endorses irrational courses of action.

(Obviously I think that each of the cases succeeds in showing this. But it’s not

important that you agree with me about both cases. For my purposes, all I need is one

successful case.)

The Murder Lesion

Mary is debating whether to shoot Alfred. If she shoots and hits, things will

be very good for her. If she shoots and misses, things will be very bad. (Alfred

always finds out about unsuccessful assassination attempts, and he is sensitive

about such things.) If she doesn’t shoot, things will go on in the usual, okay-but-

not-great kind of way. She thinks that it is very likely that, if she were to shoot,

then she would hit. So far, so good. But Mary also knows that there is a certain

sort of brain lesion that tends to cause both murder attempts and bad aim at the

critical moment. Happily for most of us (but not so happily for Mary) most

shooters have this lesion, and so most shooters miss. Should Mary shoot? (Set

aside your theoretical commitments and put yourself in Mary’s situation. Would

you shoot? Would you take yourself to be irrational for not doing so?)

The Psychopath Button4

Paul is debating whether to press the ‘kill all psychopaths’ button. It would,

he thinks, be much better to live in a world with no psychopaths. Unfortunately,

Paul is quite confident that only a psychopath would press such a button. Paul 4 This case was suggested to me by David Braddon-Mitchell.

7

very strongly prefers living in a world with psychopaths to dying. Should Paul

press the button? (Set aside your theoretical commitments and put yourself in

Paul’s situation. Would you press the button? Would you take yourself to be

irrational for not doing so?)

It’s irrational for Mary to shoot. It’s irrational for Paul to press.5 In general,

when you are faced with a choice of two options, it’s irrational to choose the one that you

confidently expect will cause the worse outcome.6 Causal decision theory endorses

shooting and pressing. In general, causal decision theory endorses, in these kinds of

cases, an irrational policy of performing the action which one confidently expects will

cause the worse outcome. The correct theory of rational decision will not endorse

irrational actions or policies. So causal decision theory is not the correct theory of

rational decision.

What’s generating the problem is that the very same mechanism that allows

causal decision theory to deliver the right results in cases like The Smoking Lesion leads

it to deliver the wrong results for cases like The Murder Lesion and The Psychopath

Button.

Let’s look at what happens in The Murder Lesion. (The analysis of The

Psychopath Button will be relevantly similar.) Let S be the proposition that Mary shoots,

5 Some people report that they lack the clear intuition of irrationality for the Murder Lesion case. Pretty much everyone seems to have the requisite intuition for The Psychopath Button, though. That’s enough for my purposes. Personally, I think both cases work as counterexamples to causal decision theory. But all I need is that at least one of them does. 6 Whether it’s irrational in a particular case depends, of course, on just what the payoffs are. It can be worth doing something that’s more likely than not to cause a bad outcome if the low-probability good outcome is good enough. But in the cases above (and as spelled out below), it’s better not to do the thing that you expect will cause the worse outcome. See below for some sample numbers.

8

and H the proposition that Mary hits. The relevant partition of dependency hypotheses is

{SH, S¬H}.

Some constraints on Mary’s credences:

c(SH) > .5.

(Because she’s been going to the shooting range, the gun is well-maintained, accurate and

reliable, Alfred is a large, slow-moving target, etc.)

c(SH|S) < .5

(Because if she shoots, it’s very likely because she has the lesion, and if she has the

lesion, she’s very likely to have bad aim when push comes to shove.)7,8

Mary's value assignments:

v(S.H) = 10

v(S.¬H) = -10

v(¬S) = 0

.

If Mary is a causal decision theorist, she must use c(SH), not c(SH|S), when

7 Another reason: We know that Mary’s c(H|S) < .5, since shooting is such good evidence for having the lesion, and her credence that she hits conditional on both shooting and having the lesion is very low. Given that, we can prove that c(SH|S) < .5: By the definition of conditional probability, c(SH|H) = c(S & SH)/c(S) Since every world in which both S and SH are true is a world in which H is true as well, c(S & SH) ≤ c(SH). So we know that: c(SH|H) ≤ c(SH)/c(S). Again by the definition of conditional probability, c(SH)/c(S) = c(H|S). So c(SH|H) ≤ c(H|S) < .5. 8 Note, for future reference, that c(S) must be < .5 for these credences to be coherent.

9

she's determining the relative values of shooting or not. (Since it’s unconditional

credences in dependency hypotheses that feature in CDT’s formula for determining the

choiceworthiness of actions.) So shooting is going to come out better than not shooting.9

But that’s the wrong result. It’s irrational for Mary to shoot. Unfortunately, if

that’s right, then causal decision theory is wrong.

The same phenomenon occurs in a particularly striking way in time travel cases.

Suppose that you have a time machine, and you are convinced that time travel works in

the single-timeline, no-branching way outlined by Lewis (1976). You want to use your

time machine to preserve some document, thought to be lost in the fire at the library of

Alexandria. One option is to attempt to surreptitiously spirit the document out of the

library before the fire. Another is to attempt to prevent the fire from ever happening. If

you don’t have a firm opinion about which course you’ll actually pursue, you’re likely to

be confident that, if you were to attempt to prevent the fire, you would succeed. (After

all, you’re competent and knowledgeable, you have many willing and able accomplices,

access to excellent equipment, plenty of time to plan and train, etc.)

But you know that the fire really did happen. So you know that any attempt you

make to go back and prevent it will fail.10 It’s irrational to pursue this sort of doomed

plan – a plan that you already know will fail, and the failure of which you take to be

9 Because CDT says that Mary should determine the value of smoking by computing:

Kc(K)v(KA), which in this case gives us: VALCDT(S) = c(SH)v(SH & S) + c(S¬H)v(S¬H & S) Assuming that Mary doesn’t care about dependency hypotheses for their own sakes, v(SH & S) = v(S.H), and v(S¬H & S) = v(S.¬H). (The value of shooting while in a Shoot Hit world is the value of shooting and hitting; the value of shooting while in a Shoot Miss world is the value of shooting and missing.) So we get: VALCDT(S) = c(SH)v(S.H) + c(S¬H)v(S.¬H) And since c(SH) > c(S¬H), it will turn out that VALCDT(S) > 0, and so VALCDT(S) > V(¬S). As Lewis (1981) points out, other formulations of causal versions of decision theory deliver the same results. 10 There are complications. Some of these are discussed in Braddon-Mitchell and Egan (MS).

10

worse than the expected result of some alternative plan – and so it’s irrational to try to

prevent the fire. (Similarly, when you go back in time to set up a holding company that

will, when the investments mature, pay a large lump sum into your bank account, you

should arrange for the cash to be deposited in your account after the last time you

checked your balance and saw that there hadn’t been any large deposits.) But CDT

doesn’t deliver these results. Determining the relative choiceworthiness of actions using

only your unconditional credences in dependency hypotheses makes your ranking of

actions insensitive to your knowledge – knowledge to which your decision-making

should be sensitive – that the past-changing plans are doomed.

Oracle cases are relevantly similar. It’s irrational to try to avoid the fate that the

(infallible) oracle predicts for you. The thing to do, faced with an unpleasant oracular

prediction, is to try to ensure that the predicted fate comes about in the best possible way.

If the oracle predicts that you’ll be bitten by a rabid dog, the thing to do is to get

vaccinated and wear thick clothes so that the bite won’t do much harm, not to poison

your neighbors’ dogs in hopes of avoiding the bite.

(It’s worth pointing out that neither the oracle nor the time-travel cases rely on

absolute certainty. What’s really going on is that, the more reliable you take the oracle,

or your information about the past, to be, the worse an idea it is to try to avert the

predicted fate, or change the apparent past.)

I include the time travel and oracle cases because (a) they provide particularly

stark examples of cases where CDT endorses performing an action that one confidently

expects will bring about a worse outcome than some alternative, and (b) they may serve

to make clearer just what’s gone wrong in the other cases. In these cases, just as in cases

11

like The Murder Lesion and The Psychopath Button, the fact that CDT forces us to use

only the agent’s unconditional credences in dependency hypotheses in determining the

choiceworthiness of actions makes its verdicts blind to features of the agent’s beliefs to

which it should be sensitive – namely, the agent’s confidence that a particular course of

action, if undertaken, is doomed to fail, and bring about a worse outcome than the

alternative.

I don’t want to rest very much argumentative weight on the time travel and oracle

cases, since it’s not completely obvious how big a problem it is for CDT to give the

wrong results in these peculiar sorts of situations – perhaps it’s okay to just bite the bullet

here. In fact, I don’t think that this is very attractive, but it doesn’t really matter. Even if

it is okay to bite the bullet in time travel and oracle cases, it’s not okay to bite the bullet

on The Murder Lesion and The Psychopath Button. Or at least – and this is enough for

my purposes – it’s not okay for the causal decision theorist to bite the bullet on The

Murder Lesion and The Psychopath Button if it’s not okay for the evidential decision

theorist to bite the bullet on The Smoking Lesion.

Here is the moral that I think we should draw from all of this: Evidential decision

theory told us to perform the action with the best expected outcome. Examples like The

Smoking Lesion show us that having the best expected outcome comes apart from having

the best expected causal impact on how things are, and that rationality tracks the latter

rather than the former. So, they show us that evidential decision theory is mistaken.

Causal decision theory told us to perform the action which, holding fixed our current

views about the causal structure of the world, has the best expected outcome. Examples

like The Murder Lesion and The Psychopath Button show us that this too comes apart

12

from having the best expected causal impact on how things are. So, they show us that

causal decision theory is mistaken.11

4. Objections, Responses, and Further Problems

There are some responses available to the causal decision theorist. Unfortunately,

I don’t think that any of them work. In fact, the most promising response fails in a way

that shows us that the problem is actually quite a bit worse than I’ve suggested so far, and

that the advocate of evidential decision theory ought to take no comfort in the difficulties

for CDT.

Are the cases too science-fictional and/or morally loaded to make good counterexamples?

One might be concerned that the cases I’ve used against CDT – The Murder

Lesion and The Psychopath Button – are either too science-fictional or too morally loaded

to make good counterexamples, perhaps because our intuitions about such cases are not

to be trusted. I’m inclined to insist on the legitimacy of the cases as given. I’m

particularly inclined to insist in the case of the ‘too science fictional’ objection, because

all that’s needed is a case where the subject believes that there are the relevant sorts of

lesions, buttons, oracles, or what have you – the actual presence of the science fictional

apparatus is not important. But it’s not important that you agree with me about the cases

as given. Once you know where to look, there are many more such cases to be found,

11 What they actually show us so far is that causal decision theory, as stated in Lewis 1979 is mistaken. Lewis argues that other versions of causal decision theory will deliver the same results. If he is correct, then we have a perfectly general problem. (And even if he is wrong, we may well have a perfectly general problem, so long as the differences don’t effect the theory’s endorsements in the cases in question.) As I have neither the space nor the expertise to provide a useful evaluation of Lewis’s arguments that his articulation of CDT really does give the same results as other versions of the view, I will simply take his word for it, and continue on the assumption that we really do have a general problem for CDT here.

13

and many of them are much less exotic, and less fraught with potentially distracting

moral issues.

For example, it’s easy to modify The Smoking Lesion in order to make it a

counterexample to CDT rather than EDT. We just have to change the case in the

following way: Rather than letting Suzy believe that the lesion (a) causes one to smoke,

and (b) causes one to get cancer, let her believe that the lesion (a) causes one to smoke,

and (b) causes one’s lungs to be vulnerable to cigarette smoke, such that smoking causes

cancer in those with the lesion, but not in those without.

In this sort of situation, it is irrational to smoke. But CDT still endorses smoking.

Further, this modified smoking lesion case is certainly not objectionably morally loaded.

Nor is it objectionably science-fictional. At least, it’s not objectionably science-fictional

unless the original Smoking Lesion case is objectionably science-fictional. So as long as

The Smoking Lesion succeeds as a counterexample to EDT, the modified smoking lesion

case will succeed as a counterexample to CDT.

This is an instance of a quite general recipe for generating counterexamples to

CDT: Start with a counterexample to EDT in which some condition is (believed to be) a

common cause both of some action A and of some undesirable outcome O. Change the

case so that, rather than directly causing O, the condition puts in place an enabling

condition which allows A to cause O. Finally, point out to your audience that our

intuitions about what one ought to do switch when we change the causal background in

this way, while CDT’s recommendations remain the same.12

These anti-CDT examples will be no more science-fictional or morally loaded

than the original anti-EDT examples we started with. If the original examples were 12 Thanks to Martin Smith for pointing out this recipe.

14

unacceptable, then CDT is unmotivated – we don’t have a counterexample to EDT. If the

original examples were acceptable, then the modified examples are well, and CDT is

subject to counterexamples. Neither outcome is a good one for the advocate of CDT.

Do the cases put unacceptable constraints on the agents’ credences regarding their own

actions?

Notice that, in order for CDT to endorse shooting in The Murder Lesion, Mary

must start off confident that, if she were to shoot, she would hit. For her to be confident

of this, she must also start off confident that she does not have the lesion. And so she

must start off confident that she will not shoot. Similarly, for CDT to endorse pressing in

The Psychopath Button, Paul to start off confident that, if he were to press the button, he

would live. For him to be confident of this, he must start off confident that he is not a

psychopath. And so he must start off confident that he will not press the button. So in

order for my cases to work, the agents credences about what they are likely to do must be

a certain fairly specific way. Is this a problem?

No. The cases do indeed place some constraints on the agents’ credences

regarding their own future actions. But so too do the examples, like The Smoking Lesion,

that motivate CDT over EDT. For those cases to succeed, the agents mustn’t be certain

of what they’re going to choose. So if the fact that a case places any constraints on the

agent’s credences about their own future actions renders it ineligible to serve as a

counterexample, then the counterexamples to EDT will be ruled out along with the

counterexamples to CDT, and CDT loses its motivation..

15

But perhaps it’s not the fact that a case places some constraints on the agent’s

credences that rules it out as a counterexample, but the fact that it places a certain,

objectionable sort of constraint on the agent’s credences, that rules it out as a

counterexample. And while the counterexamples to EDT impose only innocent

constraints, those imposed by the would-be counterexamples to CDT are objectionable.

I don’t think that there is any plausible way to cash out the distinction between

innocent and objectionable constraints that will deliver this result. Certainly neither the

counterexamples to EDT nor the counterexamples to CDT require the agents to have

credences that violate the constraints of Bayesian rationality. And it’s unclear where else

we might non-arbitrarily draw the line.

(Also, given the availability of the strategy for converting anti-EDT examples into

anti-CDT examples outlined above, it looks as if there won’t be any sort of constraint that

only the anti-CDT examples need to impose.)

We might also be concerned that the putative counterexamples are illegitimate

because they force agents to have credences about their own actions, and that this is

unacceptable – agents don’t, or ought not to, have any credences at all in propositions

about which actions they will freely perform. If A is a proposition stating which action I

will perform, c(A) should not be defined. (Or, alternatively, should not take any value

other than 0 or 1.)13

But in fact the cases don’t require the agents to have such credences. At no point

do we need to appeal directly to Mary’s or Paul’s credences about which action they’re

going to perform when calculating the values that CDT assigns to the candidate actions in

The Murder Lesion or The Psychopath Button. What we do need to appeal to is the 13 See, for example, Levi 1997, Kyburg 1988, Gilboa 1994, and Spohn 1977 for views of this kind.

16

agents’ conditional credences of the form c(P|A), where A is a proposition stating which

action they will perform. (For example, Mary’s credence that she has the lesion

conditional on her shooting, and Paul’s credence that he is a psychopath conditional on

his pressing.) For the cases to work, Mary’s and Paul’s conditional credences of this sort

do need to meet certain constraints. And those constraints are enough, if we accept the

standard formula for conditional probability (that is: c(B|A) = c(AB)/c(A)), to impose

constraints on Mary’s and Paul’s credences about what they’re going to do.

Proponents of the view that we cannot have well-defined credences (or cannot

have well-defined credences other than 0 or 1) in propositions specifying which free

actions we will perform ought not to deny that we can have well-defined conditional

credences for various outcomes, conditional on our various possible choices. What they

ought to do is deny that the standard formula for conditional credences is correct. And in

fact, taking conditional credences to be primitive, or at least separating them to some

extent from the standard formula, is independently well-motivated. (Price (1986),

Edgington (1995), and Hajek (2003), for example, are all advocates of separating, to

some extent, c(B|A) from c(AB)/c(A).)

So even if we don’t want to admit well-defined credences (other than 0 and 1) for

propositions about which free actions I’ll perform, we can still admit well-defined

conditional credences of various outcomes conditional on my various candidate actions.

And it’s these conditional credences, not the unconditional credences in the performance

of the actions, that are actually doing the heavy lifting in the examples. If we do allow

that Mary and Paul have well-defined unconditional credences for propositions like

SHOOT and PRESS, and we accept the standard formula as a definition of conditional

17

credence, then we do get some results about just what their credences in those

propositions have to be. But this conditional result should be unobjectionable.

Finally, notice two things: we also need to appeal to such conditional credences in

order to determine EDT’s endorsements in the cases (like The Smoking Lesion) that are

supposed to provide the motivation to abandon EDT in favor of CDT. So if this reliance

on well defined c(P|A)s undermines my counterexamples to CDT, it undermines the

CDTer’s counterexamples to EDT as well. So this is a bad defense for the advocate of

CDT to appeal to: if it succeeds, CDT is unmotivated. If it fails, CDT is subject to

counterexamples.

Second, giving up even these conditional credences really does seem like it will

lead to very serious trouble. A theory according to which we’re not allowed to have any

views at all about what’s likely or unlikely, conditional our choosing one thing rather

than another, cannot be correct. Certainly it cannot underwrite a theory of rational

decision.

Can we fix everything by going ratificationist?

Consider Paul’s situation as he deliberates about whether or not to press the ‘kill

all psychopaths’ button. Suppose that Paul is an orthodox causal decision theorist.

Pressing will, at the beginning of his deliberations, look better than refraining. Paul

becomes convinced that pressing is the thing to do, and so he becomes convinced that he

will, at the end of his deliberations, choose to press. But as Paul becomes more and more

convinced that he’s going to choose to press, he becomes more and more confident that

he’s a psychopath. And as he becomes more and more confident that he’s a psychopath,

18

pressing starts to look like less and less of a good idea. At a certain point, as Paul

becomes increasingly convinced that he’s going to press, CDT will stop telling him to

press, and start telling him to refrain.

Pressing the psychopath button is unratifiable by the lights of CDT: it’s

impossible for Paul both to be convinced that he will press the button, and also to

rationally endorse doing so. It’s tempting to think that we can exploit this fact in order to

save (a version of) CDT from the apparent counterexamples, by imposing a ratifiability

requirement on rational actions.

Perhaps the simplest way to impose a ratifiability requirement is just to add the

following Maxim of Ratifiability to our original version of EDT:

Maxim of Ratifiability. An agent can rationally perform act A only if A is ratifiable in the sense that there is no alternative B such that VALCDT(B) exceeds VALCDT(A) on the supposition that A is decided upon.

The resulting theory tells us that it’s rational to perform an action A iff:

1) A is ratifiable

2) There is no other ratifiable option with greater (present) VALCDT than A.

(Another way to implement a ratificationist version of CDT is just to say that it’s

rational to perform A iff A is ratifiable, in the sense specified in the Maxim of

Ratifiability above. For our purposes here, we needn’t decide which of these

ratificationist theories is better – the differences between them will not be relevant to the

objections that I’ll make below.)

A version of CDT that includes a ratifiability requirement will not endorse

shooting in The Murder Lesion, or pressing in The Psychopath Button. When Mary

19

becomes convinced that she will choose to shoot, shooting will look bad to her –

VALCDT(SHOOT) will be less than VALCDT(NOT SHOOT). When Paul becomes

convinced that he will choose to press, pressing will look bad to him – VALCDT(PRESS)

will be less than VALCDT(NOT PRESS). So a theory that rules out unratifiable actions as

irrational will not deliver the bad endorsements that we got from the version of CDT that

we considered above, which did not include a ratifiability requirement. Problem solved?

Unfortunately, no. There are two reasons why this response fails. The first is that,

if successful, it does too much: If an appeal to ratifiability succeeds here, then the

EDTer’s appeal to ratifiability in the cases that were supposed to motivate the move to

CDT will succeed as well. (In fact, the appeal to ratifiability was originally a move in

defense of EDT in the face of just such examples - see Jeffrey 1983.) In The Smoking

Lesion, not smoking is unratifiable: once Susan becomes convinced that she will choose

not to smoke, her smoking or not ceases to be evidence one way or the other for her

having the lesion, and smoking looks better, by EDT’s lights, than not. So again, we

have a situation in which, if the CDTer’s defense works, it works for the EDTer as well,

and CDT loses its motivation. (But see Joyce (forthcoming) for an argument that the

appeal to ratifiability is, in fact, only available to causal decision theorists.)

The second difficulty with this response is that it doesn’t do enough. Here are

two constraints on any adequate theory of rational decision:

SOUNDNESS: If it’s irrational to φ, the correct theory of rational decision will not

endorse φing.

20

COMPLETENESS: If it’s rational to φ, the correct theory of rational decision will

endorse φing.

While the imposition of a ratifiability requirement prevents CDT from falling

afoul of the SOUNDNESS requirement, the resulting theory falls afoul of

COMPLETENESS.

In The Psychopath Button, it’s irrational for Paul to press. It’s rational for Paul to

refrain from pressing. Neither action, however, is ratifiable. (When Paul becomes

convinced that he will choose to refrain, he will become quite confident that he is not a

psychopath, and pressing will look better than refraining.)

It’s rational for Paul to refrain. So the correct theory of rational decision will

endorse refraining. Refraining is not ratifiable. So no theory that imposes a ratifiability

requirement will endorse refraining. So no theory that imposes a ratifiability requirement

is the correct theory of rational decision.14

This shows us that imposing a ratifiability requirement will not help us to save

CDT. It also shows us that what fans of EDT should take no comfort in the difficulties

for CDT – what we have here is definitely not an argument for a return to EDT. These

cases are all counterexamples to versions of EDT that impose ratifiability requirements,

as well, and these seem to be the only versions of EDT with the resources to deal with

cases like The Smoking Lesion.

14 This, incidentally, also demonstrates the important difference between cases like The Murder Lesion and The Psychopath Button and cases like Gibbard and Harper’s (1978) Death in Damascus, in which it’s also the case that neither option is ratifiable. In the cases we’re concerned with, unlike in Death in Damascus, we still have clear intuitions about which action it’s rational to perform.

21

So things are actually worse than I’ve been making them out to be – these cases

are trouble not just for CDT, but also for any version of EDT with the resources to avoid

refutation at the hands of common-cause based counterexamples like The Smoking Lesion.

In fact, there are cases where imposing a ratifiability requirement makes things

worse, particularly for CDT. Consider the following modification of the original

Newcomb’s problem:

Newcomb’s Firebomb

There are two boxes before you. Box A definitely contains $1,000,000. Box B definitely

contains $1,000. You have two choices: take only box A (call this one-boxing), or take

both boxes (call this two-boxing). You will signal your choice by pressing one of two

buttons. There is, as usual, an uncannily reliable predictor on the scene. If the predictor

has predicted that you will two-box, he has planted an incendiary bomb in box A, wired

to the two-box button, so that pressing the two-box button will cause the bomb to

detonate, burning up the $1,000,000. If the predictor has predicted that you will one-box,

no bomb has been planted – nothing untoward will happen, whichever button you press.

The predictor, again, is uncannily accurate.

It is, I submit, rational to one-box, and irrational to two-box, in Newcomb’s

Firebomb. (You should expect that, if you press the two-box button, you will be causing

the incineration of your $1,000,000, which is certainly sitting there in Box A just waiting

for you to carry it off to the bank. Crucially, it is your choice will cause its incineration –

22

this is the key difference between Newcomb’s Firebomb and the original Newcomb’s

problem.)

But neither option is ratifiable. A ratificationist theory will not endorse two-

boxing, but it won’t endorse one-boxing either. So if we adopt a ratificationist theory, we

will be forced to say that there is no rational option in this case. And this seems wrong –

one boxing is pretty clearly the rational thing to do here.

The imposition of a ratifiability requirement makes things worse in this case,

because versions of CDT that do not include a ratifiability requirement deliver, on almost

every way of spelling out the case, the verdict that it’s rational to one-box. (The

exceptions are cases in which one starts off extremely confident that one is going to

choose one-boxing, and so starts off extremely confident that there is no bomb in box A.)

Holding fixed any but the most extreme credences about whether or not there’s a bomb in

box A, we get the result that one-boxing has greater VALCDT than two-boxing. It is only

in the cases where one assigns a very, very low unconditional credence to the presence of

the firebomb that CDT will tell us that the possibility of gaining the extra $1,000 is worth

the risk of setting fire to the $1,000,000.

So CDT without a ratifiability requirement almost always tells us, in accordance

with our intuitions about the case, that it is rational to one-box, and irrational to two-box,

in Newcomb’s Firebomb. In fact, the cases where it arguably goes wrong by endorsing

two-boxing are ones where, perhaps, our intuitions are not so clear – after all, the

defender of CDT might say, if you were that certain that there wasn’t a bomb, wouldn’t it

really be worth the risk?

23

Ratificationist versions of CDT, however, can never endorse one-boxing in

Newcomb’s Firebomb. This is still more bad news, I think, for the ratificationist defense

of CDT. Evidentialists cannot rejoice in this, however – the news for the ratificationist

defense of EDT is equally bad, as ratificationist EDT also fails to endorse the unratifiable

option of one-boxing in Newcomb’s Firebomb.

Conclusion

If all of the above is correct, causal decision theory is in a bad way. Either it’s

subject to counterexamples, or there’s no reason to prefer it to EDT. That’s what I hope

to have shown above, and that is what I’m primarily concerned to emphasize in this

concluding section. I will close, though, with some speculation about what’s gone wrong

and the how to fix it.

What conclusions should we draw from all this? I take cases like The Smoking

Lesion to show that EDT is informed by the wrong principle of rational decision. It’s

informed by the principle (roughly), do the thing which would give you the best evidence

that the best things are happening. Where the advice of this principle comes apart from

that of the principle, do what’s most likely to bring about the best results, it delivers

advice that it’s irrational to follow. Enter causal decision theory, which aims to give a

satisfactory formal characterization of the correct, causal principle. What I take cases

like The Murder Lesion and The Psychopath Button to show is that Lewisian CDT’s

formal characterization of the informal principle isn’t satisfactory. The principle that

Lewisian CDT actually endorses, do what has the best expected outcome, holding fixed

24

your current views about the causal structure of the world, isn’t quite the right way of

understanding the original principle, do what’s most likely to bring about the best results.

My hope, then, is that there will be an alternative formal theory which provides a

better understanding of the appealing principle. I regret that I do not have such a theory

to offer.

25

References:

Braddon-Mitchell, D. and Egan, A. (MS) How Ignorance Empowers Time Travelers Edgington, D. (1995) On Conditionals, Mind 104:235-329. Eells, E., (1982) Rational Decision and Causality, Cambridge University Press Gibbard, A. and Harper, W. (1978) Counterfactuals and Two Kinds of Expected Utility,

in Hooker, Leach, and McClennen, eds., Foundations and Applications of Decision Theory, v.1, Dordrecht: Riedel.

Gilboa, I. (1994) Can Free Choice Be Known?, in Bicchieri, Jeffrey, and Skyrms (eds) The Logic of Strategy, Oxford University Press.

Hajek, A. (2003) What Conditional Probability Could Not Be, Synthese 137: 273-323. Jeffrey, R. (1983) The Logic of Decision, 2nd Edition, McGraw-Hill Joyce, J. (forthcoming) Are Newcomb Problems Really Decisions? Kyburg, H. (1988) Powers, in Harper and Skyrms (eds) Causation in Decision, Belief

Change, and Statistics, Kluwer. Levi, I. (1997) The Covenant of Reason: Rationality and the Commitments of Thought,

Cambridge: Cambridge University Press. Lewis, D. (1976) The Paradoxes of Time Travel, American Philosophical Quarterly 13:

145-52. Lewis, D. (1979) Prisoners’ Dilemma Is a Newcomb Problem, Philosophy and Public

Affairs 8: 239-49. Lewis, D. (1981) Causal Decision Theory, Australasian Journal of Philosophy 59: 5-30. Nozick, R. (1969) Newcomb’s Problem and Two Principles of Choice, in Rescher, ed.,

Essays in Honor of Carl G. Hempel (Dordrecht: Riedel). Spohn, W. (1977) Where Luce and Kranz Do Really Generalize Savage’s Decision

Model, Erkenntnis 11: 113-134.

some counterexamples to causal decision theory1 andy …fitelson.org/few/few_05/egan.pdf · 1 some...

Documents