simplicity truth zen

159
Simplicity Truth Zen Kevin T. Kelly Conor Mayo-Wilson Hanti Lin Department of Philosophy Carnegie Mellon University http://www.andrew.cmu.edu/user/kk3n/ockham/ Ockham.htm

Upload: tim

Post on 24-Feb-2016

26 views

Category:

Documents


0 download

DESCRIPTION

Simplicity Truth Zen. Kevin T. Kelly Conor Mayo-Wilson Hanti Lin Department of Philosophy Carnegie Mellon University http://www.andrew.cmu.edu/user/kk3n/ockham/Ockham.htm. Theoretical Underdetermination. I dare you to choose one!. ???. Western Epistemology. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Simplicity Truth Zen

SimplicityTruthZen

Kevin T. Kelly Conor Mayo-Wilson

Hanti LinDepartment of Philosophy

Carnegie Mellon Universityhttp://www.andrew.cmu.edu/user/kk3n/ockham/Ockham.htm

Page 2: Simplicity Truth Zen

Theoretical Underdetermination

???

I dare you to choose one!

Page 3: Simplicity Truth Zen

Western Epistemology The demon undermines science, so

defeat him.

I smite thee with scientific rationality!

R

Page 4: Simplicity Truth Zen

Western Epistemology The demon undermines science, so

defeat him. Portray him as weak.

You can’t fool me!

R

Page 5: Simplicity Truth Zen

Zen Epistemology The demon justifies science with his

strength. Stronger demons justify weaker

inferences!

Page 6: Simplicity Truth Zen

Zen Epistemology

How is this supposed to work, eh?

Page 7: Simplicity Truth Zen

Zen Epistemology• If you could do better, you should. • But because of me, you can’t. • So you are optimal and, hence, justified.

Page 8: Simplicity Truth Zen

Zen EpistemologyIsn’t that how computer scientists routinely evaluate algorithms?

Yes.

Page 9: Simplicity Truth Zen

Zen EpistemologyBut computing is deduction. Isn’t induction a completely different animal?

No. That’s what formal learning theory is about.

Page 10: Simplicity Truth Zen

Zen EpistemologyCool. But how does it apply to science?

Page 11: Simplicity Truth Zen

Ockham’s Razor

Page 12: Simplicity Truth Zen

Planetary retrograde motion

MarsEarth

Sun

Astronomy 1543

Only at solar opposition

Page 13: Simplicity Truth Zen

Ptolemaic Explanation

q

Retrograde path is due to an epicycle.Can happen regardless of the position of the sun

epicycle

Page 14: Simplicity Truth Zen

Copernican Explanation“Lapped” competitor appears to backtrack against the bleachers.

lappingq

epicycle

Page 15: Simplicity Truth Zen

Copernicus VictoriousExplains the data rather than merely accommodating them.

Has fewer adjustable parameters.Is more falsifiable (failure of correlation would rule it out).

Is more unified.Is simpler.

Page 16: Simplicity Truth Zen

More Victories for Simplicity

universal gravitation vs. divided cosmos

wave theory of light vs. particles and aether

oxygen vs. phlogistonnatural selection vs. special creationspecial relativity vs. classical electrodynamics

Dirac’s equationchaos vs. infinite series of random variables

Page 17: Simplicity Truth Zen

Puzzle An indicator must be sensitive to

what it indicates.

simple

Page 18: Simplicity Truth Zen

Puzzle An indicator must be sensitive to

what it indicates.

complex

Page 19: Simplicity Truth Zen

Puzzle But Ockham’s razor always points

at simplicity.

simple

Page 20: Simplicity Truth Zen

Puzzle But Ockham’s razor always points

at simplicity.

complex

Page 21: Simplicity Truth Zen

Puzzle How can a broken compass help

you find something unless you already know where it is?

complex

Page 22: Simplicity Truth Zen

Metaphysicians for Ockham

Somehow, I don’t like the sound of that…

Page 23: Simplicity Truth Zen

Information Channel

Simplicity bias Simple reality

Mystical visionNatural selection??

Page 24: Simplicity Truth Zen

Pre-established Harmony

Simplicity bias Simple reality

G

Page 25: Simplicity Truth Zen

Idealism

Simplicity bias Simple reality

Page 26: Simplicity Truth Zen

Metaphysicians for Ockham

So before you can use your razor to eliminateMetaphysical causes, you have to assume metaphysical causes.

Page 27: Simplicity Truth Zen

Theoretical “Virtues”Simpler theories:1. Are more unified;2. Explain better;3. Are more falsifiable.

But the truth might not be virtuous.Wishful thinking to assume

otherwise.

Page 28: Simplicity Truth Zen

Statistical Explanations1. Prior Simplicity Bias

Bayes, BIC, MDL, MML, etc.

2. Risk MinimizationSRM, AIC, cross-validation, etc.

Page 29: Simplicity Truth Zen

Prior Simplicity Bias

I am inclined to believe the simpler theory because I am inclined to believe that the world is simple.

Page 30: Simplicity Truth Zen

More Subtle VersionSimple data are a miracle in the complex theory but not in the simple theory.

P C

Has to be!

Page 31: Simplicity Truth Zen

However… correlation would not be a miracle given P(q);

Why not this?

P C

q

Page 32: Simplicity Truth Zen

The Real MiracleIgnorance about model: p(C) p(P);

+ Ignorance about parameter setting: p’(P(q) | P) p(P(q’ ) | P).

= Knowledge about worlds.p(P(q)) << p(C).

CP

qqqqqqqq

Ignorance is knowledge.

Page 33: Simplicity Truth Zen

= Paradox of IndifferenceIgnorance of red vs. not-red

+ Ignorance over not-red: = Knowledge about red vs. white.

qq

Knognorance = All the priveleges of knowledgeWith none of the responsibilitiesI’m for it!

Page 34: Simplicity Truth Zen

In Any Event

The coherentist foundations of Bayesianism have nothing to do with short-run truth-conduciveness.Not so loud!

Page 35: Simplicity Truth Zen

Ockham Bayesians Converge

T1

T2

T3

Page 36: Simplicity Truth Zen

Ockham Bayesians Converge

T1

T2

T3

Page 37: Simplicity Truth Zen

Ockham Bayesians Converge

T1

T2

T3

Page 38: Simplicity Truth Zen

Ockham Bayesians Converge

T1

T2

T3

Page 39: Simplicity Truth Zen

Ockham Bayesians Converge

T1

T2

T3

Oops!

Page 40: Simplicity Truth Zen

Ockham Bayesians Converge

T1

T2

T3

Page 41: Simplicity Truth Zen

Ockham Bayesians Converge

T1

T2

T3

Page 42: Simplicity Truth Zen

Ockham Bayesians Converge

T1

T2

T3

Page 43: Simplicity Truth Zen

Ockham Bayesians Converge

T1

T2

T3

Oops!

Page 44: Simplicity Truth Zen

Ockham Bayesians Converge

T1

T2

T3

Oops!

Page 45: Simplicity Truth Zen

But So Would Other Methods

T1

T2

T3

Oops!

Page 46: Simplicity Truth Zen

Summary of Bayesian Approach

Prior-based explanations of Ockham’s razor are circular.

Convergence-based explanations of Ockham’s razor fail to single out Ockham’s razor.

Page 47: Simplicity Truth Zen

2. Risk Minimization Ockham’s razor minimizes

expected distance of empirical estimates from the true value.

Truth

Page 48: Simplicity Truth Zen

Unconstrained Estimates

are Centered on truth but spread around it.

Pop!Pop!Pop!Pop!

Unconstrained aim

Page 49: Simplicity Truth Zen

Off-center but less spread.

Clamped aim

Truth

Constrained Estimates

Page 50: Simplicity Truth Zen

Off-center but less spread Overall improvement in expected

distance from truth…

Truth

Pop!Pop!Pop!Pop!

Constrained Estimates

Clamped aim

Page 51: Simplicity Truth Zen

Doesn’t Find True Theory

False theories that aren’t too false theory typically predict more accurately than the true theory.

Four eyes!

Clamped aim

Page 52: Simplicity Truth Zen

When Theory Doesn’t Matter

Predicting lung cancer from ash trays.

Page 53: Simplicity Truth Zen

When Theory Does Matter

Predicting effectiveness of a ban on ash trays.

Page 54: Simplicity Truth Zen

Correlation does imply causation if there are multiple variables, some of which are common effects. [Pearl, Spirtes, Glymour and Scheines, etc.]

Great News

Protein A

Protein BProtein C Cancer protein

Page 55: Simplicity Truth Zen

Joint distribution p is causally compatible with causal network G iff:

Causal Markov Condition: each variable X is independent of its non-effects given its immediate causes.

Faithfulness Condition: no other conditional independence relations hold in p.

Core assumptions

Page 56: Simplicity Truth Zen

F1 F2

Tell-tale CorrelationsH C

FGiven F, H gives some info about C(Faithfulness)

C

Given C, F1 gives no further info about F2(Markov)

Page 57: Simplicity Truth Zen

Standard Applications Linear Causal Case: each variable

X is a linear function of its parents and a normally distributed hidden variable called an “error term”. The error terms are mutually independent.

Discrete Multinomial Case: each variable X takes on a finite range of values.

Page 58: Simplicity Truth Zen

The CatchProtein A

Protein BProtein C Cancer protein

It’s full of protein C!

English Breakfast?

Page 59: Simplicity Truth Zen

As the Sample Increases…

Protein A

Protein BProtein C Cancer proteinweak

Protein D

I do! Bon apetit!

This situation empiricallyapproximates the last one. So who cares?

Page 60: Simplicity Truth Zen

As the Sample Increases Again…

Protein A

Protein BProtein C Cancer protein

Wasn’t that last approximationto the truth good enough?

weak

Protein D

Aaack!

Protein Eweaker

weaker

Page 61: Simplicity Truth Zen

Causal Flipping Theorem

For each convergent M and standard (G, p), there exists standard (G’, p’ ) such that:

p’ is indistinguishable from p at the current sample size and

consistent M flips the orientation of causal arrow X Y in (G’, p’ ) at least n times as sample size increases.

oops

I meant oops

oopsI meant

I meant

Page 62: Simplicity Truth Zen

Simulation Using CPC Algorithm

Proportion of outputs out of 1000 trials at each sample size.

Page 63: Simplicity Truth Zen

Zen Response Many explanations have been

offered to make sense of the here-today-gone-tomorrow nature of medical wisdom — what we are advised with confidence one year is reversed the next — but the simplest one is that it is the natural rhythm of science.

(Do We Really Know What Makes us Healthy, NY Times Magazine, Sept. 16, 2007).

Page 64: Simplicity Truth Zen

Pursuit of Truth Short-run Tracking

Too strong to be feasible when theory matters.

Page 65: Simplicity Truth Zen

Pursuit of Truth Long-run Convergence

Too weak to single out Ockham’s razor

Page 66: Simplicity Truth Zen

The Middle Way Straightest Convergence

Just right?

Page 67: Simplicity Truth Zen

Ancient Roots"Living in the midst of ignorance and considering themselves intelligent and enlightened, the senseless people go round and round, following crooked courses, just like the blind led by the blind." Katha Upanishad, I. ii. 5, c. 600 BCE.

Page 68: Simplicity Truth Zen

II. Navigation by Broken Compass

simple

Page 69: Simplicity Truth Zen

Asking for Directions

Where’s …

Page 70: Simplicity Truth Zen

Asking for Directions

Turn around. The freeway ramp is on the left.

Page 71: Simplicity Truth Zen

Asking for Directions

Goal

Page 72: Simplicity Truth Zen

Best Route

Goal

Page 73: Simplicity Truth Zen

Best Route to Any Goal

Page 74: Simplicity Truth Zen

Disregarding Advice is Bad

Extra U-turn

Page 75: Simplicity Truth Zen

Helpful A Priori Advice

…so fixed advice can help you reach a hidden goalwithout circles, evasions, or magic.

Page 76: Simplicity Truth Zen

Polynomial Laws Data = open intervals around Y

at rational values of X.

Page 77: Simplicity Truth Zen

Polynomial Laws No effects:

Page 78: Simplicity Truth Zen

Polynomial Laws First-order effect:

Page 79: Simplicity Truth Zen

Polynomial Laws Second-order effect:

Page 80: Simplicity Truth Zen

In Step with the Demon

ConstantLinear

QuadraticCubic

There yet?Maybe.

Page 81: Simplicity Truth Zen

In Step with the Demon

ConstantLinear

QuadraticCubic

There yet?Maybe.

Page 82: Simplicity Truth Zen

In Step with the Demon

ConstantLinear

QuadraticCubic

There yet?Maybe.

Page 83: Simplicity Truth Zen

In Step with the Demon

ConstantLinear

QuadraticCubic

There yet?Maybe.

Page 84: Simplicity Truth Zen

Passing the Demon

ConstantLinear

QuadraticCubic

There yet?Maybe.

Page 85: Simplicity Truth Zen

Passing the Demon

ConstantLinear

QuadraticCubic

I know you’re coming!

Page 86: Simplicity Truth Zen

Passing the Demon

ConstantLinear

QuadraticCubic

Maybe.

Page 87: Simplicity Truth Zen

Passing the Demon

ConstantLinear

QuadraticCubic

!!!

Hmm, it’s quite nice here…

Page 88: Simplicity Truth Zen

Passing the Demon

ConstantLinear

QuadraticCubic

You’re back!Learned your lesson?

Page 89: Simplicity Truth Zen

Ockham Violator’s Path

ConstantLinear

QuadraticCubic

See, you shouldn’t run aheadEven if you are right!

Page 90: Simplicity Truth Zen

Ockham Path

ConstantLinear

QuadraticCubic

Page 91: Simplicity Truth Zen

Empirical Problems

T1 T2 T3

Set K of infinite input sequences. Partition of K into alternative

theories.K

Page 92: Simplicity Truth Zen

Empirical Methods

T1 T2 T3

Map finite input sequences to theories or to “?”.

K

T3

e

Page 93: Simplicity Truth Zen

Method Choice

T1 T2 T3

e1 e2 e3 e4

Input history

Output historyAt each stage, scientist can choose a new method (agreeing with past theory choices).

Page 94: Simplicity Truth Zen

Aim: Converge to the Truth

T1 T2 T3

K

T3 ? T2 ? T1 T1 T1 T1 . . .T1 T1 T1

Page 95: Simplicity Truth Zen

Retraction Choosing T and then not choosing

T next

T’T

?

Page 96: Simplicity Truth Zen

Aim: Eliminate Needless Retractions

Truth

Page 97: Simplicity Truth Zen

Aim: Eliminate Needless Retractions

Truth

Page 98: Simplicity Truth Zen

Aim: Eliminate Needless Delays to Retractions

theory

Page 99: Simplicity Truth Zen

applicationapplicationapplication

applicationcorollary

applicationtheory

applicationapplicationcorollary applicationcorollary

Aim: Eliminate Needless Delays to Retractions

Page 100: Simplicity Truth Zen

An Epistemic Motive1. Future retraction is Gettier

situation even if current belief is true.

2. If Gettier situations are bad, more of them are worse.

Page 101: Simplicity Truth Zen

Easy Retraction Time Comparisons

T1 T1 T2 T2

T1 T1 T2 T2 T3 T3T2 T4 T4

T2 T2

Method 1

Method 2

T4 T4 T4

. . .

. . .

at least as many at least as late

Page 102: Simplicity Truth Zen

Worst-case Retraction Time Bounds

T1 T2

Output sequences

T1 T2

T1 T2

T4

T3

T3

T3

T3

T3 T3

T4

T4

T4

T4 T4

. . .

(1, 2, ∞)

. . .

. . .

. . .. . .. . .

T4

T4

T4

T1 T2 T3 T3 T3 T4T3 . . .

Page 103: Simplicity Truth Zen

IV. Ockham Without Circles, Evasions,

or Magic

Page 104: Simplicity Truth Zen

Empirical Effects

Page 105: Simplicity Truth Zen

Empirical Effects

Page 106: Simplicity Truth Zen

Empirical Effects

May take arbitrarily long to discover

Page 107: Simplicity Truth Zen

Empirical Effects

May take arbitrarily long to discover

Page 108: Simplicity Truth Zen

Empirical Effects

May take arbitrarily long to discover

Page 109: Simplicity Truth Zen

Empirical Effects

May take arbitrarily long to discover

Page 110: Simplicity Truth Zen

Empirical Effects

May take arbitrarily long to discover

Page 111: Simplicity Truth Zen

Empirical Effects

May take arbitrarily long to discover

Page 112: Simplicity Truth Zen

Empirical Effects

May take arbitrarily long to discover

Page 113: Simplicity Truth Zen

Empirical Theories True theory determined by which

effects appear.

Page 114: Simplicity Truth Zen

Empirical Complexity

More complex

Page 115: Simplicity Truth Zen

Assume No Short PathsWeaker results if some path is shorter than some other path.

Page 116: Simplicity Truth Zen

Ockham’s Razor Don’t select a theory unless it is

uniquely simplest in light of experience.

Page 117: Simplicity Truth Zen

Stalwartness Don’t retract your answer while it is

uniquely simplest.

Page 118: Simplicity Truth Zen

Stalwartness Don’t retract your answer while it is

uniquely simplest.

Page 119: Simplicity Truth Zen

Timed Retraction Bounds

r(M, e, n) = the least timed retraction bound covering the total timed retractions of M along input streams of complexity n that extend e

Empirical Complexity 0 1 2 3 . . .

. . .

M

Page 120: Simplicity Truth Zen

Efficiency of Method M at e

M converges to the truth no matter what;

For each convergent M’ that agrees with M up to the end of e, and for each n: r(M, e, n) r(M’, e, n)

Empirical Complexity 0 1 2 3 . . .

. . .

M M’

Page 121: Simplicity Truth Zen

M is Beaten at e There exists convergent M’ that

agrees with M up to the end of e, such that r(M, e, n) > r(M’, e, n), for each n.

Empirical Complexity 0 1 2 3 . . .

. . .

M M’

Page 122: Simplicity Truth Zen

Basic Idea Ockham efficiency: Nature can

force arbitary, convergent M to produce the successive answers down an effect path arbitrarily late, so stalwart, Ockham solutions are efficient.

Page 123: Simplicity Truth Zen

Basic Idea Unique Ockham efficiency: A

violator of Ockham’s razor or stalwartness can be forced into an extra retraction or a late retraction at the time of the violation, so the violator is beaten by each stalwart, Ockham solution.

Page 124: Simplicity Truth Zen

Ockham Efficiency Theorem

Let M be a solution. The following are equivalent:

M is henceforth Ockham and stalwart;

M is henceforth efficient; M is henceforth never beaten.

Page 125: Simplicity Truth Zen

Some Applications Polynomial laws Conservation laws Causal networks

Page 126: Simplicity Truth Zen

Concrete RecommendationsExtra retraction by PC

algorithm

Page 127: Simplicity Truth Zen

Generalizations Generalized Simplicity Bayesian retractions Mixed Strategies

Times of retractions in chance Expected retraction times

Drop the no short paths assumption…

Retractions weighted by content…

Statistical inference…

Page 128: Simplicity Truth Zen

Conclusions Ockham’s razor is necessary for

staying on the straightest path to the true theory but does not point at the true theory.

A priori justification---no evasions or circles are required.

Analogous to computational complexity theory.

Page 129: Simplicity Truth Zen

IV. Simplicity

Page 130: Simplicity Truth Zen

ApproachEmpirical complexity reflects

nested problems of induction posed by the problem.

Hence, simplicity is question-relative but topologically invariant.

Page 131: Simplicity Truth Zen

Empirical Problems

T1 T2 T3

Topological space (W, T) Countable partition Q of W into

alternative theories.

W

Page 132: Simplicity Truth Zen

Why Topology? Topology is the pure mathematics of

(ideal) verifiability.

Proof: The tautology and contradiction are

verifiable; If A, B are verifiable, A & B is verifiable; If A1, A2, … are verifiable, A1 v A2 v… is

verifiable.

Page 133: Simplicity Truth Zen

Simplicity Concepts

c = 0

c = 1

Page 134: Simplicity Truth Zen

Simplicity Concepts

c = 0

c = 1

c = 2

Page 135: Simplicity Truth Zen

Simplicity Concepts

c = 0

c = 1

c = 2

Page 136: Simplicity Truth Zen

Simplicity Concepts A simplicity concept (without short

paths) for (W, T, Q) is just an assignment c of natural numbers to worlds such that:

1. The condition c < n is closed;2. Each answer is clopen given that c =

n;3. the condition A & (c = n) is in the

boundary of the condition not-A & (c = n + 1) , for each answer A and non-terminal complexity n.

Page 137: Simplicity Truth Zen

Virtues Can still prove the Ockham efficiency

theorem. Grounds simplicity in the logic of the

question. Topologically invariant---grue-proof

relative to question. Generalizes dimension---works for

continuous and discrete spaces.

Page 138: Simplicity Truth Zen

AGM Belief Revision No induction without refutation. After refutation, almost anything

goes. Greedy---leaps to new theory

immediately. No concern with truth-

conduciveness. Not much like science or even

standard Bayesians

Page 139: Simplicity Truth Zen

Stalwart Ockham Method Can delay expansion to new

Ockham theory after a surprise, as a Bayesian would.

Converges to truth with optimal efficiency.

Scientifically natural. More like Bayesian with standard

prior.

Page 140: Simplicity Truth Zen

V. Bayesian Efficiency

Page 141: Simplicity Truth Zen

Anti-Simplicity Bayesian

T1

T2

T3

Page 142: Simplicity Truth Zen

Simplicity-biased Bayesian

T1

T2

T3

Page 143: Simplicity Truth Zen

Greedy Ockham

T1

T2

T3

Page 144: Simplicity Truth Zen

Focus on Retractions Euclidean distance measures

retractions + expansions. Charge for retractions only.

Page 145: Simplicity Truth Zen

Not Entropy Increase

T1

T2

T3

Retractions can’tbe increase in ascalar field.

0

Page 146: Simplicity Truth Zen

Not Kullback Leibler Divergence

T1

T2

T3

Still charges forexpansions.

> 0

Page 147: Simplicity Truth Zen

Classical Logic

pq

r

p v q q v rp v r

T

p v q v r

Page 148: Simplicity Truth Zen

Logical Retractions

1

1

Page 149: Simplicity Truth Zen

Code Disjunctions

(1,0,0) (0,0,1)(0,1,0)

(0,0,0)

(0,1,1)(1,1,0) (1,0,1)

(1,1,1)

Page 150: Simplicity Truth Zen

Fuzzy Disjunctions

p v q v r = (1, .8, .2). Valuation in assignment (0, 1, 0) is

just the inner product: (0, 1, 0) . (1, .8, .2) = .8.

Entailment: p entails q iff b . q > b . p, for all Boolean b.

Page 151: Simplicity Truth Zen

Geologic on Euclidean Cube

(1, .8, .2)

Page 152: Simplicity Truth Zen

Geological Retractions

(.5, 1, 0) (0, 1, .5).5 0

Page 153: Simplicity Truth Zen

Probability Lives in Geologic

p(q) = p . qp

q

Page 154: Simplicity Truth Zen

Classical Principle of Indifference= perspective projection of corners

Page 155: Simplicity Truth Zen

General Principle of Indifference= perspective projection in general

f

Page 156: Simplicity Truth Zen

Bayesian Retractions= shadow of geological retractions

f

Page 157: Simplicity Truth Zen

Bayesian Retractions

0

1

1

0

Page 158: Simplicity Truth Zen

“Retraction Cones”

Retraction-free

Pure retractionMixedMixed

Page 159: Simplicity Truth Zen

Efficient Bayesian

0

0

<1

0<1

0