computational cognitive science - mit …...the brain as a generative modeling engine kenneth craik...
TRANSCRIPT
![Page 1: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/1.jpg)
Computational cognitive science
Generative models probabilistic
programs and common sense
Josh Tenenbaum MIT
Brains Minds and Machines Summer School 2015
11
Two notions of intelligence Classifyingrecognizingpredicting data vs
Explainingunderstandingmodeling the world
bull Whatrsquos the difference between classification and explanation bull What makes a good explanation
22
Two notions of intelligence Classifyingrecognizingpredicting data vs
Explainingunderstandingmodeling the world
Both notions have roles to play but here Irsquoll emphasize explanation because it is at the heart of human intelligence and much of current AI machine learning computational neuroscience is so focused on classification
(Why Building machines that explain and understand is harder than building machines that merely recognize and classify Classification is easier to map to neural networks and neural circuits)
But not only are both probably essential they can interact in powerful probably essential ways Wersquoll talk about how deep neural networks can help model-based methods work more quickly efficiently ndash or how model-based methods can help model-free methods become richer and more flexible
33
Two notions of intelligence Classifyingrecognizingpredicting data vs
Explainingunderstandingmodeling the world
bull Whatrsquos the difference between classification and explanation bull What makes a good explanation
bull Compact unifying nonarbitrary rdquohard to varyrdquo bull Generative Output is the world not how we should perform a task bull Causal actionable for an endless range of tasks via planning bull Compositional flexible extensible
Phenomena the motion of objects in the solar system
Contrast Keplerrsquos laws and Newtonrsquos lawshellip copy Wikimedia User Theresa Knott License CC BY-SA This content is
excluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-use
44
bull Keplerrsquos laws
bull Newtonrsquos laws
Law of gravitational force
Courtesy of Wikimedia user Dennis Nilsson License CC BY
Courtesy of Wikimedia user Hankwang License CC BY
55
Two notions of intelligence
Classifyingrecognizingpredicting data vs
Explainingunderstandingmodeling the world
bull Whatrsquos the difference between classification and explanation bull What makes a good explanation
bull Compact unifying nonarbitrary rdquohard to varyrdquo bull Generative Output is the world not how we should perform a task bull Causal actionable for an endless range of tasks via planning bull Compositional flexible extensible
Newton but not Kepler explainshellip bull Not just the orbits of planets but other solar-
system objects bull Not just the motion of planets but also the
apple I drop right here on Earth bull Why some things orbit other things but not
others bull How you could get a man to the moon and
back again bull How you could build a rocket or solar sail or
sling shot to escape Earthrsquos gravity
copy Wikimedia User Theresa Knott License CC BY-SA This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-use
6
The brain as a generative modeling engine
Kenneth Craik (1914-1945)
The Nature of Explanation (1943) One of the most fundamental properties of thought is its power of
predicting eventshellip It enables us for instance to design bridges with a sufficient factor of safety instead of building them haphazard and waiting to see whether they collapsehellip If the organism carries a lsquosmall-scale modelrsquo of external reality and of its own possible actions within its head it is able to try out various alternatives conclude which is the best of them react to future situations before they arise utilize the knowledge of past events in dealing with the present and future and in every way to react in a much fuller safer and more competent manner to the emergencies which face it Most of the greatest advances of modern technology have been instruments which extended the scope of our sense-organs our brains or our limbs Such are telescopes and microscopes wireless calculating machines typewriters motor cars ships and aeroplanes Is it not possible therefore that our brains themselves utilize comparable mechanisms to achieve the same ends and that these mechanisms can parallel phenomena in the external world as a calculating machine can parallel the development of strains in a bridge 7
Portrait photo removed due
to copyright restrictions
The big question
How does the mind get so much out of so little
Our minds build rich models of the world and make strong generalizations from input data that is sparse noisy and ambiguous ndash in many ways far too limited to support the inferences we make
How do we do it
88
The big question
How does the mind get so much out of so little so quickly so flexibly on such little energy
99
Visual scene perception
Buthellip look around you
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
10
Where are the people
Photos copy source unknown Al rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1111
Where are the people
1212
hellip books hellip glasses
1313
Learning and generalizing object concepts
ldquotufardquo
1414
Whatrsquos this
copy Omega Pacific All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1515
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 2: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/2.jpg)
Two notions of intelligence Classifyingrecognizingpredicting data vs
Explainingunderstandingmodeling the world
bull Whatrsquos the difference between classification and explanation bull What makes a good explanation
22
Two notions of intelligence Classifyingrecognizingpredicting data vs
Explainingunderstandingmodeling the world
Both notions have roles to play but here Irsquoll emphasize explanation because it is at the heart of human intelligence and much of current AI machine learning computational neuroscience is so focused on classification
(Why Building machines that explain and understand is harder than building machines that merely recognize and classify Classification is easier to map to neural networks and neural circuits)
But not only are both probably essential they can interact in powerful probably essential ways Wersquoll talk about how deep neural networks can help model-based methods work more quickly efficiently ndash or how model-based methods can help model-free methods become richer and more flexible
33
Two notions of intelligence Classifyingrecognizingpredicting data vs
Explainingunderstandingmodeling the world
bull Whatrsquos the difference between classification and explanation bull What makes a good explanation
bull Compact unifying nonarbitrary rdquohard to varyrdquo bull Generative Output is the world not how we should perform a task bull Causal actionable for an endless range of tasks via planning bull Compositional flexible extensible
Phenomena the motion of objects in the solar system
Contrast Keplerrsquos laws and Newtonrsquos lawshellip copy Wikimedia User Theresa Knott License CC BY-SA This content is
excluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-use
44
bull Keplerrsquos laws
bull Newtonrsquos laws
Law of gravitational force
Courtesy of Wikimedia user Dennis Nilsson License CC BY
Courtesy of Wikimedia user Hankwang License CC BY
55
Two notions of intelligence
Classifyingrecognizingpredicting data vs
Explainingunderstandingmodeling the world
bull Whatrsquos the difference between classification and explanation bull What makes a good explanation
bull Compact unifying nonarbitrary rdquohard to varyrdquo bull Generative Output is the world not how we should perform a task bull Causal actionable for an endless range of tasks via planning bull Compositional flexible extensible
Newton but not Kepler explainshellip bull Not just the orbits of planets but other solar-
system objects bull Not just the motion of planets but also the
apple I drop right here on Earth bull Why some things orbit other things but not
others bull How you could get a man to the moon and
back again bull How you could build a rocket or solar sail or
sling shot to escape Earthrsquos gravity
copy Wikimedia User Theresa Knott License CC BY-SA This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-use
6
The brain as a generative modeling engine
Kenneth Craik (1914-1945)
The Nature of Explanation (1943) One of the most fundamental properties of thought is its power of
predicting eventshellip It enables us for instance to design bridges with a sufficient factor of safety instead of building them haphazard and waiting to see whether they collapsehellip If the organism carries a lsquosmall-scale modelrsquo of external reality and of its own possible actions within its head it is able to try out various alternatives conclude which is the best of them react to future situations before they arise utilize the knowledge of past events in dealing with the present and future and in every way to react in a much fuller safer and more competent manner to the emergencies which face it Most of the greatest advances of modern technology have been instruments which extended the scope of our sense-organs our brains or our limbs Such are telescopes and microscopes wireless calculating machines typewriters motor cars ships and aeroplanes Is it not possible therefore that our brains themselves utilize comparable mechanisms to achieve the same ends and that these mechanisms can parallel phenomena in the external world as a calculating machine can parallel the development of strains in a bridge 7
Portrait photo removed due
to copyright restrictions
The big question
How does the mind get so much out of so little
Our minds build rich models of the world and make strong generalizations from input data that is sparse noisy and ambiguous ndash in many ways far too limited to support the inferences we make
How do we do it
88
The big question
How does the mind get so much out of so little so quickly so flexibly on such little energy
99
Visual scene perception
Buthellip look around you
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
10
Where are the people
Photos copy source unknown Al rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1111
Where are the people
1212
hellip books hellip glasses
1313
Learning and generalizing object concepts
ldquotufardquo
1414
Whatrsquos this
copy Omega Pacific All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1515
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 3: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/3.jpg)
Two notions of intelligence Classifyingrecognizingpredicting data vs
Explainingunderstandingmodeling the world
Both notions have roles to play but here Irsquoll emphasize explanation because it is at the heart of human intelligence and much of current AI machine learning computational neuroscience is so focused on classification
(Why Building machines that explain and understand is harder than building machines that merely recognize and classify Classification is easier to map to neural networks and neural circuits)
But not only are both probably essential they can interact in powerful probably essential ways Wersquoll talk about how deep neural networks can help model-based methods work more quickly efficiently ndash or how model-based methods can help model-free methods become richer and more flexible
33
Two notions of intelligence Classifyingrecognizingpredicting data vs
Explainingunderstandingmodeling the world
bull Whatrsquos the difference between classification and explanation bull What makes a good explanation
bull Compact unifying nonarbitrary rdquohard to varyrdquo bull Generative Output is the world not how we should perform a task bull Causal actionable for an endless range of tasks via planning bull Compositional flexible extensible
Phenomena the motion of objects in the solar system
Contrast Keplerrsquos laws and Newtonrsquos lawshellip copy Wikimedia User Theresa Knott License CC BY-SA This content is
excluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-use
44
bull Keplerrsquos laws
bull Newtonrsquos laws
Law of gravitational force
Courtesy of Wikimedia user Dennis Nilsson License CC BY
Courtesy of Wikimedia user Hankwang License CC BY
55
Two notions of intelligence
Classifyingrecognizingpredicting data vs
Explainingunderstandingmodeling the world
bull Whatrsquos the difference between classification and explanation bull What makes a good explanation
bull Compact unifying nonarbitrary rdquohard to varyrdquo bull Generative Output is the world not how we should perform a task bull Causal actionable for an endless range of tasks via planning bull Compositional flexible extensible
Newton but not Kepler explainshellip bull Not just the orbits of planets but other solar-
system objects bull Not just the motion of planets but also the
apple I drop right here on Earth bull Why some things orbit other things but not
others bull How you could get a man to the moon and
back again bull How you could build a rocket or solar sail or
sling shot to escape Earthrsquos gravity
copy Wikimedia User Theresa Knott License CC BY-SA This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-use
6
The brain as a generative modeling engine
Kenneth Craik (1914-1945)
The Nature of Explanation (1943) One of the most fundamental properties of thought is its power of
predicting eventshellip It enables us for instance to design bridges with a sufficient factor of safety instead of building them haphazard and waiting to see whether they collapsehellip If the organism carries a lsquosmall-scale modelrsquo of external reality and of its own possible actions within its head it is able to try out various alternatives conclude which is the best of them react to future situations before they arise utilize the knowledge of past events in dealing with the present and future and in every way to react in a much fuller safer and more competent manner to the emergencies which face it Most of the greatest advances of modern technology have been instruments which extended the scope of our sense-organs our brains or our limbs Such are telescopes and microscopes wireless calculating machines typewriters motor cars ships and aeroplanes Is it not possible therefore that our brains themselves utilize comparable mechanisms to achieve the same ends and that these mechanisms can parallel phenomena in the external world as a calculating machine can parallel the development of strains in a bridge 7
Portrait photo removed due
to copyright restrictions
The big question
How does the mind get so much out of so little
Our minds build rich models of the world and make strong generalizations from input data that is sparse noisy and ambiguous ndash in many ways far too limited to support the inferences we make
How do we do it
88
The big question
How does the mind get so much out of so little so quickly so flexibly on such little energy
99
Visual scene perception
Buthellip look around you
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
10
Where are the people
Photos copy source unknown Al rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1111
Where are the people
1212
hellip books hellip glasses
1313
Learning and generalizing object concepts
ldquotufardquo
1414
Whatrsquos this
copy Omega Pacific All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1515
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 4: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/4.jpg)
Two notions of intelligence Classifyingrecognizingpredicting data vs
Explainingunderstandingmodeling the world
bull Whatrsquos the difference between classification and explanation bull What makes a good explanation
bull Compact unifying nonarbitrary rdquohard to varyrdquo bull Generative Output is the world not how we should perform a task bull Causal actionable for an endless range of tasks via planning bull Compositional flexible extensible
Phenomena the motion of objects in the solar system
Contrast Keplerrsquos laws and Newtonrsquos lawshellip copy Wikimedia User Theresa Knott License CC BY-SA This content is
excluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-use
44
bull Keplerrsquos laws
bull Newtonrsquos laws
Law of gravitational force
Courtesy of Wikimedia user Dennis Nilsson License CC BY
Courtesy of Wikimedia user Hankwang License CC BY
55
Two notions of intelligence
Classifyingrecognizingpredicting data vs
Explainingunderstandingmodeling the world
bull Whatrsquos the difference between classification and explanation bull What makes a good explanation
bull Compact unifying nonarbitrary rdquohard to varyrdquo bull Generative Output is the world not how we should perform a task bull Causal actionable for an endless range of tasks via planning bull Compositional flexible extensible
Newton but not Kepler explainshellip bull Not just the orbits of planets but other solar-
system objects bull Not just the motion of planets but also the
apple I drop right here on Earth bull Why some things orbit other things but not
others bull How you could get a man to the moon and
back again bull How you could build a rocket or solar sail or
sling shot to escape Earthrsquos gravity
copy Wikimedia User Theresa Knott License CC BY-SA This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-use
6
The brain as a generative modeling engine
Kenneth Craik (1914-1945)
The Nature of Explanation (1943) One of the most fundamental properties of thought is its power of
predicting eventshellip It enables us for instance to design bridges with a sufficient factor of safety instead of building them haphazard and waiting to see whether they collapsehellip If the organism carries a lsquosmall-scale modelrsquo of external reality and of its own possible actions within its head it is able to try out various alternatives conclude which is the best of them react to future situations before they arise utilize the knowledge of past events in dealing with the present and future and in every way to react in a much fuller safer and more competent manner to the emergencies which face it Most of the greatest advances of modern technology have been instruments which extended the scope of our sense-organs our brains or our limbs Such are telescopes and microscopes wireless calculating machines typewriters motor cars ships and aeroplanes Is it not possible therefore that our brains themselves utilize comparable mechanisms to achieve the same ends and that these mechanisms can parallel phenomena in the external world as a calculating machine can parallel the development of strains in a bridge 7
Portrait photo removed due
to copyright restrictions
The big question
How does the mind get so much out of so little
Our minds build rich models of the world and make strong generalizations from input data that is sparse noisy and ambiguous ndash in many ways far too limited to support the inferences we make
How do we do it
88
The big question
How does the mind get so much out of so little so quickly so flexibly on such little energy
99
Visual scene perception
Buthellip look around you
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
10
Where are the people
Photos copy source unknown Al rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1111
Where are the people
1212
hellip books hellip glasses
1313
Learning and generalizing object concepts
ldquotufardquo
1414
Whatrsquos this
copy Omega Pacific All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1515
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 5: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/5.jpg)
bull Keplerrsquos laws
bull Newtonrsquos laws
Law of gravitational force
Courtesy of Wikimedia user Dennis Nilsson License CC BY
Courtesy of Wikimedia user Hankwang License CC BY
55
Two notions of intelligence
Classifyingrecognizingpredicting data vs
Explainingunderstandingmodeling the world
bull Whatrsquos the difference between classification and explanation bull What makes a good explanation
bull Compact unifying nonarbitrary rdquohard to varyrdquo bull Generative Output is the world not how we should perform a task bull Causal actionable for an endless range of tasks via planning bull Compositional flexible extensible
Newton but not Kepler explainshellip bull Not just the orbits of planets but other solar-
system objects bull Not just the motion of planets but also the
apple I drop right here on Earth bull Why some things orbit other things but not
others bull How you could get a man to the moon and
back again bull How you could build a rocket or solar sail or
sling shot to escape Earthrsquos gravity
copy Wikimedia User Theresa Knott License CC BY-SA This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-use
6
The brain as a generative modeling engine
Kenneth Craik (1914-1945)
The Nature of Explanation (1943) One of the most fundamental properties of thought is its power of
predicting eventshellip It enables us for instance to design bridges with a sufficient factor of safety instead of building them haphazard and waiting to see whether they collapsehellip If the organism carries a lsquosmall-scale modelrsquo of external reality and of its own possible actions within its head it is able to try out various alternatives conclude which is the best of them react to future situations before they arise utilize the knowledge of past events in dealing with the present and future and in every way to react in a much fuller safer and more competent manner to the emergencies which face it Most of the greatest advances of modern technology have been instruments which extended the scope of our sense-organs our brains or our limbs Such are telescopes and microscopes wireless calculating machines typewriters motor cars ships and aeroplanes Is it not possible therefore that our brains themselves utilize comparable mechanisms to achieve the same ends and that these mechanisms can parallel phenomena in the external world as a calculating machine can parallel the development of strains in a bridge 7
Portrait photo removed due
to copyright restrictions
The big question
How does the mind get so much out of so little
Our minds build rich models of the world and make strong generalizations from input data that is sparse noisy and ambiguous ndash in many ways far too limited to support the inferences we make
How do we do it
88
The big question
How does the mind get so much out of so little so quickly so flexibly on such little energy
99
Visual scene perception
Buthellip look around you
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
10
Where are the people
Photos copy source unknown Al rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1111
Where are the people
1212
hellip books hellip glasses
1313
Learning and generalizing object concepts
ldquotufardquo
1414
Whatrsquos this
copy Omega Pacific All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1515
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 6: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/6.jpg)
Two notions of intelligence
Classifyingrecognizingpredicting data vs
Explainingunderstandingmodeling the world
bull Whatrsquos the difference between classification and explanation bull What makes a good explanation
bull Compact unifying nonarbitrary rdquohard to varyrdquo bull Generative Output is the world not how we should perform a task bull Causal actionable for an endless range of tasks via planning bull Compositional flexible extensible
Newton but not Kepler explainshellip bull Not just the orbits of planets but other solar-
system objects bull Not just the motion of planets but also the
apple I drop right here on Earth bull Why some things orbit other things but not
others bull How you could get a man to the moon and
back again bull How you could build a rocket or solar sail or
sling shot to escape Earthrsquos gravity
copy Wikimedia User Theresa Knott License CC BY-SA This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-use
6
The brain as a generative modeling engine
Kenneth Craik (1914-1945)
The Nature of Explanation (1943) One of the most fundamental properties of thought is its power of
predicting eventshellip It enables us for instance to design bridges with a sufficient factor of safety instead of building them haphazard and waiting to see whether they collapsehellip If the organism carries a lsquosmall-scale modelrsquo of external reality and of its own possible actions within its head it is able to try out various alternatives conclude which is the best of them react to future situations before they arise utilize the knowledge of past events in dealing with the present and future and in every way to react in a much fuller safer and more competent manner to the emergencies which face it Most of the greatest advances of modern technology have been instruments which extended the scope of our sense-organs our brains or our limbs Such are telescopes and microscopes wireless calculating machines typewriters motor cars ships and aeroplanes Is it not possible therefore that our brains themselves utilize comparable mechanisms to achieve the same ends and that these mechanisms can parallel phenomena in the external world as a calculating machine can parallel the development of strains in a bridge 7
Portrait photo removed due
to copyright restrictions
The big question
How does the mind get so much out of so little
Our minds build rich models of the world and make strong generalizations from input data that is sparse noisy and ambiguous ndash in many ways far too limited to support the inferences we make
How do we do it
88
The big question
How does the mind get so much out of so little so quickly so flexibly on such little energy
99
Visual scene perception
Buthellip look around you
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
10
Where are the people
Photos copy source unknown Al rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1111
Where are the people
1212
hellip books hellip glasses
1313
Learning and generalizing object concepts
ldquotufardquo
1414
Whatrsquos this
copy Omega Pacific All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1515
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 7: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/7.jpg)
The brain as a generative modeling engine
Kenneth Craik (1914-1945)
The Nature of Explanation (1943) One of the most fundamental properties of thought is its power of
predicting eventshellip It enables us for instance to design bridges with a sufficient factor of safety instead of building them haphazard and waiting to see whether they collapsehellip If the organism carries a lsquosmall-scale modelrsquo of external reality and of its own possible actions within its head it is able to try out various alternatives conclude which is the best of them react to future situations before they arise utilize the knowledge of past events in dealing with the present and future and in every way to react in a much fuller safer and more competent manner to the emergencies which face it Most of the greatest advances of modern technology have been instruments which extended the scope of our sense-organs our brains or our limbs Such are telescopes and microscopes wireless calculating machines typewriters motor cars ships and aeroplanes Is it not possible therefore that our brains themselves utilize comparable mechanisms to achieve the same ends and that these mechanisms can parallel phenomena in the external world as a calculating machine can parallel the development of strains in a bridge 7
Portrait photo removed due
to copyright restrictions
The big question
How does the mind get so much out of so little
Our minds build rich models of the world and make strong generalizations from input data that is sparse noisy and ambiguous ndash in many ways far too limited to support the inferences we make
How do we do it
88
The big question
How does the mind get so much out of so little so quickly so flexibly on such little energy
99
Visual scene perception
Buthellip look around you
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
10
Where are the people
Photos copy source unknown Al rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1111
Where are the people
1212
hellip books hellip glasses
1313
Learning and generalizing object concepts
ldquotufardquo
1414
Whatrsquos this
copy Omega Pacific All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1515
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 8: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/8.jpg)
The big question
How does the mind get so much out of so little
Our minds build rich models of the world and make strong generalizations from input data that is sparse noisy and ambiguous ndash in many ways far too limited to support the inferences we make
How do we do it
88
The big question
How does the mind get so much out of so little so quickly so flexibly on such little energy
99
Visual scene perception
Buthellip look around you
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
10
Where are the people
Photos copy source unknown Al rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1111
Where are the people
1212
hellip books hellip glasses
1313
Learning and generalizing object concepts
ldquotufardquo
1414
Whatrsquos this
copy Omega Pacific All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1515
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 9: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/9.jpg)
The big question
How does the mind get so much out of so little so quickly so flexibly on such little energy
99
Visual scene perception
Buthellip look around you
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
10
Where are the people
Photos copy source unknown Al rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1111
Where are the people
1212
hellip books hellip glasses
1313
Learning and generalizing object concepts
ldquotufardquo
1414
Whatrsquos this
copy Omega Pacific All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1515
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 10: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/10.jpg)
Visual scene perception
Buthellip look around you
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
10
Where are the people
Photos copy source unknown Al rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1111
Where are the people
1212
hellip books hellip glasses
1313
Learning and generalizing object concepts
ldquotufardquo
1414
Whatrsquos this
copy Omega Pacific All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1515
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 11: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/11.jpg)
Where are the people
Photos copy source unknown Al rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1111
Where are the people
1212
hellip books hellip glasses
1313
Learning and generalizing object concepts
ldquotufardquo
1414
Whatrsquos this
copy Omega Pacific All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1515
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 12: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/12.jpg)
Where are the people
1212
hellip books hellip glasses
1313
Learning and generalizing object concepts
ldquotufardquo
1414
Whatrsquos this
copy Omega Pacific All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1515
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 13: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/13.jpg)
hellip books hellip glasses
1313
Learning and generalizing object concepts
ldquotufardquo
1414
Whatrsquos this
copy Omega Pacific All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1515
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 14: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/14.jpg)
Learning and generalizing object concepts
ldquotufardquo
1414
Whatrsquos this
copy Omega Pacific All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1515
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 15: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/15.jpg)
Whatrsquos this
copy Omega Pacific All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1515
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 16: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/16.jpg)
Photo of rock climbing equipment removed due to copyright restrictions See video
1616
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 17: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/17.jpg)
Photo of rock climbing equipment removed due to copyright restrictions See video
1717
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 18: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/18.jpg)
Photos copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
1818
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 19: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/19.jpg)
Concept learning is not simply classification
1919
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 20: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/20.jpg)
Understanding events with common-
sense theories
(Southgate and Csibra) (Heider and Simmel) copy unknown attributed to work of V Southgate and G Csibra All rightsreserved This content is excluded from our Creative Commons license
For more information see httpsocwmiteduhelpfaq-fair-use
Intuitive physics objects forces and massesIntuitive psychology beliefs and desiresIntuitive sociology us and them Intuitive morality good and bad
2020
copy University of Illinois Press All rights reserved This content isexcluded from our Creative Commons license For more informationsee httpsocwmiteduhelpfaq-fair-useSource Heider F amp Simmel M (1944) An experimental study inapparent behaviorldquo The American Journal of Psychology 57 243-259
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 21: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/21.jpg)
Learning to play video games the way people do
Deep Mind 2015 Reprinted by permission from Macmillan Publishers Ltd NatureSource Mnih V et al Human-level control through deep reinforcementlearning Nature 518 no 7540 (2015) 529-533 copy 2015 21
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 22: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/22.jpg)
Learning to play video games the way people really do
Courtesy of sean dreilinger on FlickrLicense CC BY-NC-SA
copy Activision All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
2222
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 23: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/23.jpg)
Learning to play video games the way people really do
(Stadie Levine Abbeel 2015)
Random
copy Bradly Stadie All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Source Stadie Bradly C Sergey Levine and Pieter Abbeel Incentivizing exploration in
reinforcement learning with deep predictive models arXiv preprint arXiv150700814 (2015)
2323
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 24: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/24.jpg)
The big question
How does the mind get so much out of so little ndashRecovering the entire world around you from a glance in a
flash ndashLearning a generalizable concept from just one examplendashDiscovering causal relations from just a single observed event ndashSeeing forces and seeing inside other minds from just the
motion of a few two-dimensional shapes ndashLearning to play games solve problems and act in a whole
new world ndash all in under one minute ndashUnderstanding the words yoursquore reading now
The goal A computational framework for understanding how people make these inferences and how they can be successful expressed in engineering terms
2424
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 25: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/25.jpg)
The problems of induction
Abstract knowledge (Constraints Inductive bias Priors)
1 How does abstract knowledge guide learning and inference from sparse data
2 What form does abstract knowledge take across different domains and tasks
3 How is abstract knowledge itself constructed from some combination of innate specifications and experience
hellip
2525
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 26: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/26.jpg)
The ldquoGenerative modelsrdquo approach1 How does abstract knowledge guide learning and
inference from sparse data Bayesian inference in probabilistic generative models
Hhii
i
hPhdPhPhdPdhP
)()|()()|()|(
2 What form does that knowledge take across different domains and tasks
Probabilities defined richly structured symbolic representations spaces graphs grammars logical predicates schemas Probabilistic Programs
3 How is that knowledge itself constructed Hierarchical models with inference at multiple levels
Learning models as probabilistic inference ldquolearning to learnrdquo transfer learning learning representations and learning inductive biases not fundamentally different
2626
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 27: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/27.jpg)
The approach (contrsquod)4 How can learning and inference proceed efficiently and
accurately even with very complex hypothesis spaces Sampling-based algorithms for approximate inference eg
MCMC sequential Monte Carlo (ldquoparticle filteringrdquo) importance sampling Cost-sensitive sampling (ldquoOne and donerdquo) Fast initialization with bottom-up recognition models (ldquoNeural networksrdquo)
5 How can probabilistic inferences be used to drive action Utility-based frameworks for decision and planning under
uncertainty and risk such as Bayesian decision theory or Markov decision processes (MDPs)
6 How could these computations be implemented in neural hardware or massively parallel computing machines
Probabilistic interpretations of cortical circuitry and neural population codes stochastic digital circuits
2727
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 28: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/28.jpg)
1990s-present Cognition as probabilistic inference
Visual perception [Yuille Weiss Simoncelli Adelson Richards Freeman Feldman Kersten Knill Maloney Olshausen Jacobs Pouget ]
Language processing and acquisition [Brent de Marken Niyogi Klein Manning Jurafsky Chater Keller Levy Hale Johnson Griffiths Perfors Tenenbaum Frank Piantadosi OrsquoDonnell Goodmanhellip]
Motor learning and motor control [Ghahramani Jordan Wolpert Koerding Kawato Doya Todorov Shadmehr Maloney hellip]
Reinforcement learning [Dayan Daw Niv Frank Gershman Gureckis hellip]
Memory [Anderson Schooler Shiffrin Steyvers Griffiths McClelland Gershman hellip]
Attention [Mozer Huber Torralba Oliva Geisler Yu Itti Baldi Vul hellip]
Categorization and concept learning [Anderson Nosfosky Rehder Navarro Griffiths Feldman Tenenbaum Rosseel Goodman Kemp Mansinghka hellip]
Reasoning [Chater Oaksford Sloman McKenzie Heit Tenenbaum Kemp Goodmanhellip]
Causal inference and learning [Waldmann Sloman Steyvers Griffiths Tenenbaum Yuille Lu Holyoak Lagnado hellip ]
2828
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 29: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/29.jpg)
Basic cognitive capacities as intuitive probabilistic inference
bull Similarity (Tenenbaum amp Griffiths BBS 2001 Kemp amp Tenenbaum Cog Sci 2005)
bull Representativeness and evidential support (Tenenbaum amp Griffiths Cog Sci 2001)
bull Causal judgment (Steyvers et al 2003 Griffiths amp Tenenbaum Cog Psych 2005)
bull Coincidences and causal discovery (Griffiths amp Tenenbaum Cog Sci 2001 Cognition 2007 Psych Review in press)
bull Diagnostic inference (Krynski amp Tenenbaum JEP General 2007)
bull Predicting the future (Griffiths amp Tenenbaum Psych Science 2006)
2929
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 30: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/30.jpg)
Causes and coincidences
Mere randomness or a hidden cause
(Griffiths amp Tenenbaum Cognition 2007 Psych Review 2009)
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716
3030
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 31: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/31.jpg)
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
C
x x xx xx x xx x
uniform uniform +
regularity
3131
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 32: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/32.jpg)
C
x x x x x
uniform
+ regularity
Bayesian measure of evidence)|(
)|(lograndomdPlatentdP
Random Latent common cause
x x xx x
uniform
3232
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 33: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/33.jpg)
Cancer clusters Judging the probability of a hidden environmental cause
Courtesy of American Psychological Association Used with permissionSource Griffiths T L and J B Tenenbaum Theory-Based CausalInduction Psychological Review 116 no 4 (2009) 661-716 3333
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 34: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/34.jpg)
3434
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 35: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/35.jpg)
Everyday prediction problems (Griffiths amp Tenenbaum Psych Science 2006)
bull You read about a movie that has made $60 million to date How much money will it make in total
bull You see that something has been baking in the oven for 34 minutes How long until itrsquos ready
bull You meet someone who is 78 years old How long will they live
bull Your friend quotes to you from line 17 of his favorite poem How long is the poem
bull You meet a US congressman who has served for 11 years How long will he serve in total
bull You encounter a phenomenon or event with an unknown extent or duration ttotal at a random time or value of t ltttotal What is the total extent or duration ttotal
3535
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 36: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/36.jpg)
Priors P(ttotal) based on empirically measured durations or magnitudes for many real-world events in each class
Median human judgments of the total duration or magnitude ttotal of events in each class given one random observation at a duration or magnitude t versus Bayesian predictions (median of P(ttotal|t))
See Griffiths and Tenenbaum ldquoOptimal Predictions in Everyday Cognitionrdquo Psychological Science 17 no 9 (2006) 3636
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 37: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/37.jpg)
Learning words for objects
ldquotufardquo
3737
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 38: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/38.jpg)
Word learning as Bayesian inference (Xu amp Tenenbaum Psych Review 2007)
What is the right priorWhat is the right hypothesis spaceHow do learners acquire that background knowledge
3838
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 39: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/39.jpg)
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
3939
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 40: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/40.jpg)
Word learning as Bayesian inference
(Xu amp Tenenbaum Psych Review 2007)
4040
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 41: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/41.jpg)
Property induction
Gorillas have T9 hormones Seals have T9 hormones
Horses have T9 hormones
ldquoSimilarityrdquo ldquoTypicalityrdquo ldquoDiversityrdquo
Gorillas have T9 hormones Seals have T9 hormones
Anteaters have T9 hormones
Gorillas have T9 hormones Chimps have T9 hormones Monkeys have T9 hormones Baboons have T9 hormones
Horses have T9 hormones
4141
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 42: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/42.jpg)
Experiments on property induction (Osherson Smith Wilkie Lopez Shafir 1990)
bull 20 subjects rated the strength of 45 arguments X1 have property P (eg Cows have T4 hormones)X2 have property PX3 have property P
All mammals have property P [General argument]
bull 20 subjects rated the strength of 36 arguments X1 have property PX2 have property P
Horses have property P [Specific argument]4242
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 43: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/43.jpg)
Feature rating data (Osherson and Wilkie)
bull People were given 48 animals 85 features and asked to rate whether each animal had each feature
Eg elephant
gray hairless toughskinbig bulbous longlegtail chewteeth tuskssmelly walks slowstrong musclersquo quadrapedalinactive vegetation grazeroldworld bush jungleground timid smartgrouplsquo hellip
4343
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 44: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/44.jpg)
The computational problem
Horses have T9 hormones Rhinos have T9 hormones
Cows have T9 hormones
Horse
Cow
Chimp
Gorilla
Mouse
Squirrel
Dolphin
Seal
Rhino
Elephant
Features New property
Cf semi-supervised learning sparse matrix completion
4444
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 45: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/45.jpg)
Hierarchical Bayesian Framework (Kemp amp Tenenbaum Psych Review 2009)
P(form)
F form Tree with species at leaf nodes
mouse P(structure | form) squirrel
chimp S structure gorilla
P(data | structure)
F1
F2
F3
F4
Ha
s T
9
ho
rmo
ne
s
mouse squirrel hellip D data chimp
gorilla 4545
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 46: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/46.jpg)
A graph-based prior (cf diffusion model of genetic variation)
Let dij = length of the edge between objects i and j (= if i and j are not connected in S)
fi = value of the feature for object i
ij
T
ij
ji ffd
ffSfp
21)(
41exp)|(
2
A Gaussian prior ~ N(0 S) with (Zhu Lafferty amp Ghahramani 2003)
)(~ 1 SS
4646
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 47: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/47.jpg)
1
2
3
4
5
6
7
8
9
10
Structure S
Species
Species
Species
Species
Species
Species
Species
Species
Species
Species
Features f New property 4747
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 48: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/48.jpg)
Results
Cows have property P Elephants have property P
Horses have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Dolphins have property P Seals have property P
Horses have property P (Osherson et al Smith et al)
4848
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 49: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/49.jpg)
Results Gorillas have property P Mice have property P Seals have property P
All mammals have property P
Dat
a
Model Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models of
Inductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Cows have property P Elephants have property P Horses have property P
All mammals have property P4949
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 50: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/50.jpg)
Hierarchical Bayesian Framework
F form Low-dimensional space of species
gorilla chimp
mouse S structure squirrel
F1
F2
F3
F4
Has T
9
ho
rmo
ne
s
mouse
squirrel D data chimp
gorilla
hellip
5050
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 51: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/51.jpg)
[cf Lawrence Smola amp Kondor]
S ||||1exp
2 2 jiij xx
5151
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 52: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/52.jpg)
Cows have property P Elephants have property P
Horses have property P
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Gorillas have property PMice have property PSeals have property P
All mammals have property P 5252
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 53: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/53.jpg)
Reasoning about spatially varying properties
Geographic inference task eg ldquoGiven that a certain kind of native American artifact has been found in sites near city X how likely is the same artifact to be found near city Yrdquo
Tree
2D
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58 53
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 54: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/54.jpg)
Do people learn explicit structures of different forms
A neural-network alternative (Rogers and McClelland 2004 Saxe McClelland Ganguli 2013)
Species
Features
Emergent structure clustering on hiddenunit activation vectors
Tree diagrams copy source unknown All rights reserved This content is excluded from ourCreative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
5454
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 55: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/55.jpg)
The need for inductive bias
bull Learning from sparse data requires constraints or a prior on the hypothesis space
bull An analogy Learning a smooth probability density by local interpolation (kernel density estimation)
Assuming an appropriately structured form for density (eg Gaussian) leads to better generalization from sparse data
N = 5N = 55555
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 56: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/56.jpg)
Beyond similarity-based induction
bull Reasoning based on dimensional thresholds (Smith
Poodles can bite through wire
German shepherds can bite through wire
et al 1993) Dobermans can bite through wire
German shepherds can bite through wire
bull Reasoning based on causal relations (Medin Salmon carry E Spirus bacteria
et al 2004 Coley amp Shafto 2003)
Grizzly bears carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Salmon carry E Spirus bacteria
5656
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 57: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/57.jpg)
Different priors from different kinds of causes
Chimps have T9 hormones
Gorillas have T9 hormones
Poodles can bite through wire
Dobermans can bite through wire
Salmon carry E Spirus bacteria
Grizzly bears carry E Spirus bacteria
Taxonomic similarity
Jaw strength
Food web relations
5757
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 58: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/58.jpg)
Property type ldquohas T9 hormonesrdquo ldquocan bite through wirerdquo ldquocarry E Spirus bacteriardquo
Causal Structure taxonomic tree directed chain directed network
+ diffusion process + drift process + noisy transmission Class D
Class A
Class B
Class D Class A Class B Class A
Class F Class E Class C Class D Class C Class C Class E Class G Class E Class F Class B Class G
Class F Properties Class G
Class A Class B Class C
Class D Class E Class F Class G
5858
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 59: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/59.jpg)
Reasoning with linear-threshold properties 1D
+dr
ift
Tree
+di
ffusi
on
1D +
diffu
sion
Courtesy of American Psychological Association Used with permissionSource Kemp C and J B Tenenbaum Structured Statistical Models ofInductive Reasoning Psychological Review 116 no 1 (2009) 20-58
Blok et al Blok et al Smith et al Smith et al4 colleges 5 colleges Adapts to dark Thick skin
Elephant
Hippo
Camel Lion
Cat
eg ldquohas skin that is more resistant to penetration than most synthetic fibersrdquo
5959
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 60: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/60.jpg)
Reasoning with two property types
ldquoGiven that X has property P how likely is it that Y doesrdquo
Herring
Tuna Mako shark Sand shark
Dolphin Human
Kelp
Bio
logi
cal
prop
erty
D
isea
sepr
oper
ty
Tree Web
Sand shark
(Shafto Kemp Mako shark Human Herring Tuna KelpBonawitz Coley amp Tenenbaum)
Dolphin 6060
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 61: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/61.jpg)
ldquoCommon-sense reasoningrdquo as sparse matrix completion at the heart of classical associationism probabilistic or connectionist models of semantic cognition contemporary machine-learning approaches to building general AI systems
61
But itrsquos not going to work
61
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 62: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/62.jpg)
Never-Ending
Language
Learning
(NELL)
Lohr Steve ldquoAiming to Learn asWe Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
62
copy The New York Times All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
62
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 63: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/63.jpg)
Never-Ending
Language
Learning
(NELL)
Text excerpt removed due to copyright restrictions See Lohr Steve ldquoAiming to Learn as We Do a Machine Teaches Itselfrdquo The New York Times October 4 2010
6363
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 64: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/64.jpg)
Engineering common sense what and how
6464
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 65: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/65.jpg)
The roots of common sense
Images of children playing removed due to copyright restrictions Please see thevideo or httpprovidencechildrensmuseumblogspotcom201205loosen-uphtml
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Several other similar photos removed due to copyright restrictions See video
6565
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 66: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/66.jpg)
Engineering common sense what and how
What The ldquocommon sense corerdquo Human thought is structured around a basic understanding of
physical objects intentional agents and their interactions ndash intuitive physics (forces masseshellip) and psychology (desires beliefs planshellip) [Spelke Baillargeon Gergeley Csibra Carey Kanwisher Saxe Dehaene Tomasellohellip]
Develops early in infancy Shared to some extent with other species Enriched and extended massively in humans The targets of understanding visual scenes language and action planning
How can these internal models be realized computationally How can they be studied rigorously in behavior How are they instantiated in neural circuits How are they built through evolution development and learning
6666
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 67: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/67.jpg)
The development of object knowledge in infancy
httpwwwbbccomnewstechnology-19637175 httpswwwyoutubecomwatchv=0jaxzURLylc
copy Mike Zathureczky All rights reserved This content isexcluded from our Creative Commons license For moreinformation see httpsocwmiteduhelpfaq-fair-use
6767
copy BBC All rights reserved This content is excluded fromour Creative Commons license For more information seehttpsocwmiteduhelpfaq-fair-use
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 68: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/68.jpg)
The development of object
2-3 monthsknowledge in infancy
(Baillargeon Spelke et al)
3 months
5 months
65 months
12 months
68
4-5 months
8 months
68
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 69: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/69.jpg)
How knowledge grows Learning and abstraction as theory-building (or the ldquochild as scientistrdquo not ldquodata analystrdquo) Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning sociallearning [Carey Karmiloff-Smith Gopnik Schulz Feigenson hellip]
11 months
6969
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 70: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/70.jpg)
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
7070
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 71: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/71.jpg)
Bayesian networks Probabilities on graphs
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7171
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 72: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/72.jpg)
Bayesian networks Probabilities on graphs
Bayes nets (and probabilistic graphical models more generally) brought a potent combination to AI and many fields
1 General-purpose languages for representing the structure of the world
2 General-purpose algorithms for inference and decision under uncertainty
But theyrsquore not enough copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 7272
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 73: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/73.jpg)
Modeling the world with programs
73
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 74: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/74.jpg)
Model building as program learning
74
copy Source Unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
74
copy Wikimedia User Theresa Knott License CC BY-SAThis content is excluded from our Creative Commons license For more information see httpsocwmiteduhelpfaq-fair-use
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 75: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/75.jpg)
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
From httpsprobmodsorg 7575
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 76: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/76.jpg)
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7676
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 77: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/77.jpg)
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7777
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 78: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/78.jpg)
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7878
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 79: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/79.jpg)
Probabilistic programs Church probmodsorg
(Goodman Mansinghka Roy Bonawitz amp Tenenbaum 2008 Goodman amp Tenenbaum 2014)
7979
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 80: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/80.jpg)
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg] Representations the ldquogame engine in your headrdquo
(graphics engine physics engine planning engine)
copy source unknown All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
8080
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 81: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/81.jpg)
Probabilistic programs
Courtesy of National Academy of Sciences U S A Used with permission Beliefs Desires Source Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
physics World state (t) World state (t+1)
graphics Actions
planning
agent Image (t) Image (t+1)
Photo of young students in crosswalk with crossing guard physics removed due to copyright restrictions
World Agent state state
perception
8181
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 82: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/82.jpg)
Engineering common sense what and how
What The ldquocommon sense corerdquo How A modeling engine built on probabilistic programs
(Goodman Mansinghka Roy Freer hellip) [See probmodsorg]
Representations the ldquogame engine in your headrdquo(graphics engine physics engine planning engine)
Algorithms ldquoinference programsrdquoReally fast Bottom-up guesses based on cached experience
(Perception)
Fast Forward simulation (Prediction imagination top-down percepts)
Slower Sampling by reverse simulation (Thinking reasoning)
Slow (amp really slow amp really really slow) Stochastic search (Learning development evolution)
8282
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 83: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/83.jpg)
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
83
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
83
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 84: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/84.jpg)
Vision as inverse graphics
World state (t)
Prob approx rendering
Image (t)
84
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
84
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 85: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/85.jpg)
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings
Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
Time Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical sceneunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
8585
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 86: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/86.jpg)
Vision as inverse graphics
Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Scene
Prob approx rendering
ImageLo
gp
rob
abili
ty
8686
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene Timeunderstanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013National Academy of Sciences USA
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 87: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/87.jpg)
Architecture (Kulkarni et al CVPR 2015)
Random samples hellip
Courtesy of Ilker Yildirim Used with permission
From Yildirim Kulkarni Friewald and Tenenbaum (2015) 8787
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 88: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/88.jpg)
Architecture
Summary statistic (Feature vector)
Observed
Inferred
(Kulkarni et al CVPR 2015)
8888
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 89: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/89.jpg)
Architecture
Observed
Inferred
Convolutional neural network
(Kulkarni et al CVPR 2015)
8989
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 90: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/90.jpg)
Generalizing across (Kulkarni et al CVPR 2015)
viewing conditions
Courtesy of Tejas Kulkarni Used with permission
9090
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 91: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/91.jpg)
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission
(Kulkarni et al CVPR 2015) 9191
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 92: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/92.jpg)
Human body pose estimation
Courtesy of Tejas Kulkarni Used with permission (Kulkarni et al CVPR 2015) 9292
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 93: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/93.jpg)
Inferring generic 3D shape
(Kulkarni et al CVPR 2015) Courtesy of Tejas Kulkarni Used with permission
9393
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 94: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/94.jpg)
Faster (and more brain-like) inference (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Learning to do inference a la Helmholtz machine (Hinton et al 1995) - Initialize inference with recognition
model (a deep neural network) - Trained in a self-supervised way
from fantasies of the generative model
Courtesy of Ilker Yildirim Used with permission9494
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 95: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/95.jpg)
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
Courtesy of Ilker Yildirim Used with permission
9595
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 96: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/96.jpg)
Psychophysics and neural data (Yildirim Kulkarni Freiwald Tenenbaum Cog Sci lsquo15 in prep)
(Freiwald and Tsao 2010)
Mon
key
Mod
el
Courtesy of Ilker Yildirim Used with permission
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 9696
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 97: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/97.jpg)
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
9797
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 98: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/98.jpg)
The intuitive physics engine (Battaglia Hamrick Tenenbaum PNAS 2013)
Prob approxNewton
World state (t) World state (t+1) helliphellip World state (t-1)
Prob approx rendering
Image (t-1) Image (t) Image (t+1)
= state uncertainty f = latent force magnitude
9898
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 99: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/99.jpg)
The intuitive physics engine
(Battaglia Hamrick Tenenbaum PNAS 2013)
Courtesy of National Academy of Sciences U S A Used with permissionSource Battaglia P W et al Simulation as an engine of physical scene understanding PNAS 110 no 45 (2013) 18327-18332 Copyright copy 2013
National Academy of Sciences USA
Ground truth physics
99
= 02f = 02
99
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 100: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/100.jpg)
The intuitive psychology engine
Beliefs Desires
Actions
planning
World state
Agent state
perception
physics
agent
Photos (1) from TV show The Office (2) young students in crosswalk with crossing guard removed due to copyright restrictions
100100
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 101: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/101.jpg)
Goal inference as World Agent
inverse planning state state(Baker Saxe Tenenbaum 2009)
Beliefs Desires
planning
perception
agent
)()()( aCsRsaU
1r = 098
Peo
ple
0 Model 0 05 1
Actions
R(s) large reward for achieving goal
C(a) small cost per step
Mod
el
Peo
ple
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 101101
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 102: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/102.jpg)
physics Joint inference of
World Agent beliefs and desires state state (Baker Saxe Tenenbaum Cog Sci 2011
in prep)
Desires inference Beliefs
planning
perception
Actionsagent
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use 102102
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 103: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/103.jpg)
inference
Joint inference of beliefs and desires (Baker Saxe Tenenbaum Cog Sci 2011 in prep)
Desires Initial Beliefs
copy Christopher Baker All rights reserved This content is excluded from our CreativeCommons license For more information see httpsocwmiteduhelpfaq-fair-use
103103
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 104: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/104.jpg)
104104
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 105: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/105.jpg)
If you bump the tablehellip
Model simulates table ldquobumpsrdquo 100 100 integrating over a range of force
yellow red magnitudes and directions (R = 084)
105105
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 106: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/106.jpg)
Varying objects constraints forces Uncued forces
Cued forces
106106
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 107: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/107.jpg)
Probabilistic programs for model building (ldquoprogram-learningrdquo programs)
World state (t) World state (t+1)
Image (t) Image (t+1)
physics
graphics
hellip
Learning
107
Courtesy of National Academy of Sciences U S A Used with permission
Source Battaglia P W et al Simulation as an engine of physical scene understanding PNAS
110 no 45 (2013) 18327-18332 Copyright copy 2013 National Academy of Sciences USA
107
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 108: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/108.jpg)
The child as scientist Learning as ldquotheory buildingrdquo not ldquodata analysisrdquo Knowledge grows through hypothesis- and explanation-driven interpretations of sparse data causal learning learning theories learning compositional abstractions learning to learn exploratory learning social learning [Carey Karmiloff-Smith Gopnik Schulz Feigensonhellip]
copy AAAS All rights reserved This content is excluded from our Creative Commonslicense For more information see httpsocwmiteduhelpfaq-fair-use 108108
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 109: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/109.jpg)
Explaining the dynamics of development (w T Ullman Spelke others)
9 months
12 months
15 months
109
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
109
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 110: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/110.jpg)
Explaining the dynamics of development (w T Ullman
Spelke others)
9 months
12 months
15 months
Capture different knowledge stages with a sequence of probabilistic programs Explain the trajectory of stages as rational statistical inference in the space of programs
rights reserved This contentis excluded from our CreativeCommons license For moreinformation see httpsocw
miteduhelpfaq-fair-use
Courtesy of Elsevier Inc httpwwwsciencedirectcom Used with permissionSource Sommerville Jessica A Amanda L Woodward and Amy Needham Actionexperience alters 3-month-old infants perception of others actions Cognition 96no 1 (2005) B1-B11
110
3 months
5 months
65 months
125 months
copy Psychology Press All
Courtesy of Elsevier Inc httpwwwsciencedirectcomUsed with permissionSource Baillargeon Reneacutee Infants understanding of thephysical world Journal of the Neurological Sciences 143no 1-2 (1996) 199-199
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 111: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/111.jpg)
Conclusion What makes us so smart 1 How we start Common-sense core theories of intuitive physics and
intuitive psychology 2 How we grow Learning as theory construction revision and
refinement
The tools of probabilistic programs and program induction are beginning to let us reverse-engineer these capacities with languages that arendashProbabilistic ndashGenerative ndashCausally structured ndashCompositionally structured flexible fine-grained dependencies
hierarchical recursive unbounded
We have to view the brain not simply as a pattern-recognition device but as a modeling engine an explanation engine ndash and we have to understand how these views work together
Much promise but huge engineering and scientific challenges remainhellip full of opportunities for bidirectional interactions between cognitive science neuroscience developmental psychology AI and machine learning
111111
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-
![Page 112: Computational Cognitive Science - MIT …...The brain as a generative modeling engine Kenneth Craik (1914-1945) The Nature of Explanation (1943): One of the most fundamental properties](https://reader035.vdocument.in/reader035/viewer/2022070710/5ec69a7425ea1f1b6d46a0ca/html5/thumbnails/112.jpg)
MIT OpenCourseWarehttpsocwmitedu
Resource Brains Minds and Machines Summer CourseTomaso Poggio and Gabriel Kreiman
The following may not correspond to a particular course on MIT OpenCourseWare but has beenprovided by the author as an individual learning resource
For information about citing these materials or our Terms of Use visit httpsocwmiteduterms
- MITRES_9_003SUM15_Lec2-1pdf
-