neurosymbolic aiintrotodeeplearning.com/2020/slides/6s191_mit_deep...narrow ai single task, single...

81
MIT-IBM WATSON AI LAB Neurosymbolic AI David Cox, Ph.D. IBM Director, MIT-IBM Watson AI Lab IBM Research

Upload: others

Post on 03-Jan-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

M I T - I B MW A T S O NA I L A B

Neurosymbolic AI—David Cox, Ph.D.IBM Director, MIT-IBM Watson AI LabIBM Research

Page 2: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

“Artificial Intelligence”

Page 3: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Narrow AIEmerging

Broad AIDisruptive and

Pervasive

General AIRevolutionary

▼ We are here 2050 and beyond 3IBM Research AI © 2018 IBM Corporation

The evolution of AI

Page 4: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Narrow AISingle task, single domainSuperhuman accuracy and

speed for certain tasks

Broad AIMulti-task, multi-domain

Multi-modalDistributed AI

Explainable

General AICross-domain

learning and reasoning

Broad autonomy

4IBM Research AI © 2018 IBM Corporation

The evolution of AI

Page 5: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Narrow AISingle task, single domainSuperhuman accuracy and

speed for certain tasks

Broad AIMulti-task, multi-domain

Multi-modalDistributed AI

Explainable

General AICross-domain

learning and reasoning

Broad autonomy

5IBM Research AI © 2018 IBM Corporation

The evolution of AI

Page 6: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain
Page 7: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Narrow AISingle task, single domainSuperhuman accuracy and

speed for certain tasks

Broad AIMulti-task, multi-domain

Multi-modalDistributed AI

Explainable

General AICross-domain

learning and reasoning

Broad autonomy

7IBM Research AI © 2018 IBM Corporation

The evolution of AI

Page 8: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

The path to a “Broad AI” toolbox

+

restaurant

cook Follow recipe

person

sweet

cheesecake

dessert

satisfy hunger oven

bake survive

swallow

eatcake

Learn more from small data

Reasoning

Explainability Security Ethics

Platform for AI Lifecycle

Infrastructure

Learns to transfer

+Physics of AI

Compute Data & Models Applications Workflow

Page 9: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

M I T - I B MW A T S O NA I L A B

Page 10: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Narrow AIEmerging

Broad AIDisruptive and

Pervasive

General AIRevolutionary

▼ We are here 2050 and beyond 14IBM Research AI © 2018 IBM Corporation

The evolution of AI

Page 11: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

So what’s “narrow” about today’s AI toolbox?

Page 12: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain
Page 13: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Karpathy and Li, 2015

Page 14: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain
Page 15: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Gatys et al. 2015 Brock et al. 2018

Page 16: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

“Teddy Bear”

Meret Oppenheim, Le Déjeuner en fourrure

Page 17: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Wang et al. 2018

Page 18: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Karpathy and Li, 2015

Page 19: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Lake, Ullman, Tenenbaum & Gershman, 2016

Page 20: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain
Page 21: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

What’s this?

Page 22: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain
Page 23: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain
Page 24: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain
Page 25: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain
Page 26: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

#MITIBM #AI

ObjectNet

Andrei BarbuMIT

Boris KatzMIT

Dan GutfreundIBM

Page 27: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

#MITIBM #AI

ObjectNet

• ~50K images

• ~300 object classes

• 4 different room types

Page 28: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

#MITIBM #AI

Testing ImageNet-trained models on ObjectNet

Page 29: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Chen et al. 2018

Pin-yu ChenIBM

Page 30: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Xu et al. 2019

Page 31: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain
Page 32: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain
Page 33: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain
Page 34: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

“Apple”

Neural Networks / Deep Learning

Page 35: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

apple

origin structurekind

apple tree body stem fruit

shape size color taste

round hand red green apple

Reproduced from Minksy, 1991

Symbolic AI

Page 36: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Disentangling reasoning from vision and language understanding

Neural-symbolic AI

Joshua TenenbaumChuang GanJiajun Wu

Small bluerubbercube

Small greenrubbercylinder

Large redmetalsphere

Question: Are there an equal number of large things and metal spheres?

Program: equal_number(count(filter_size(S

cene, Large)), count(filter_material(filter_shape(Scene, Sphere), Metal)))

Answer: Yes

MIT-IBM Watson AI Lab

Page 37: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

End-to-End Visual Reasoning

Visual Question AnsweringQ: What’s the shape of the red object?

End-to-EndNeural Network A: Sphere.

NMN [Andreas et al., 2016]IEP [Johnson et al., 2017]FiLM [Perez et al., 2018],MAC [Hudson & Manning, 2018]Stack-NMN [Hu et al., 2018]TbD [Mascharka et al. 2018]

Page 38: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

End-to-EndNeural Network A: Sphere.

Visual Question AnsweringQ: What’s the shape of the red object?

NMN [Andreas et al., 2016]IEP [Johnson et al., 2017]FiLM [Perez et al., 2018],MAC [Hudson & Manning, 2018]Stack-NMN [Hu et al., 2018]TbD [Mascharka et al. 2018]

Concept(e.g., colors, shapes)

Reasoning(e.g., count)

End-to-End Visual Reasoning

Page 39: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

End-to-EndNeural Network A: Sphere.

Visual Question AnsweringQ: What’s the shape of the red object?

NMN [Andreas et al., 2016]IEP [Johnson et al., 2017]FiLM [Perez et al., 2018],MAC [Hudson & Manning, 2018]Stack-NMN [Hu et al., 2018]TbD [Mascharka et al. 2018]

Concept(e.g., colors, shapes)

Reasoning(e.g., count)

Entangled

End-to-End Visual Reasoning

Page 40: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

End-to-EndNeural Network A: Sphere.

Visual Question AnsweringQ: What’s the shape of the red object?

Image CaptioningInstance Retrieval

NMN [Andreas et al., 2016]IEP [Johnson et al., 2017]FiLM [Perez et al., 2018],MAC [Hudson & Manning, 2018]Stack-NMN [Hu et al., 2018]TbD [Mascharka et al. 2018]

Concept(e.g., colors, shapes)

Reasoning(e.g., count)

Entangled

Hard to transfer

End-to-End Visual Reasoning

Page 41: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Task: Visual Reasoning

Question: Are there an equal number of large things and metal spheres?

Page 42: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Question: Are there an equal number of large things and metal spheres?

Task: Visual Reasoning

Page 43: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Question: Are there an equal number of large things and metal spheres?

3 large things!

Task: Visual Reasoning

Page 44: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Question: Are there an equal number of large things and metal spheres?

3 large things!

3 metal spheres!

Task: Visual Reasoning

Page 45: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Question: Are there an equal number of large things and metal spheres?

3 large things!

3 metal spheres!

Equal? Yes!

Task: Visual Reasoning

Page 46: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Question: Are there an equal number of large things and metal spheres?

Visual Perception

Question Understanding

Logic Reasoning

3 large things!

3 metal spheres!

Equal? Yes!

Task: Visual Reasoning

Page 47: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Vision (CNN)

Language (RNN)

Structured Representation

SymbolicProgram

Page 48: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Incorporate Concepts in Visual ReasoningNS-VQA [Yi et al. 2018]

Vision

SceneParsing

Q: What’s the shape ofthe red object?

Language

Page 49: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Incorporate Concepts in Visual Reasoning

Vision

1ID Color Shape Material1 Green Cube Metal

SceneParsing

Q: What’s the shape ofthe red object?

Language

NS-VQA [Yi et al. 2018]

Page 50: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Incorporate Concepts in Visual Reasoning

Vision

ID Color Shape Material1 Green Cube Metal2 Red Sphere Rubber

1

2

Q: What’s the shape ofthe red object?

Language

SceneParsing

SemanticParsing Filter(Red)

Query(Shape)

Program

NS-VQA [Yi et al. 2018]

Page 51: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Incorporate Concepts in Visual Reasoning

Vision

ID Color Shape Material1 Green Cube Metal2 Red Sphere Rubber

1

2

Q: What’s the shape ofthe red object?

LanguageSemanticParsing Filter(Red)

Query(Shape)

Program

SymbolicReasoning

SceneParsing

NS-VQA [Yi et al. 2018]

Page 52: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Q: What’s the shape ofthe red object?

SceneParsing

Incorporate Concepts in Visual Reasoning

Vision

ID Color Shape Material1 Green Cube Metal2 Red Sphere Rubber

1

2

Language

SymbolicReasoning

SemanticParsing Filter(Red)

Query(Shape)

Program

NS-VQA [Yi et al. 2018]

Page 53: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

SceneParsing

Incorporate Concepts in Visual Reasoning

Vision

ID Color Shape Material1 Green Cube Metal2 Red Sphere Rubber

1

2Symbolic

Reasoning

Q: What’s the shape ofthe red object?

LanguageSemanticParsing Filter(Red)

Query(Shape)

Program

NS-VQA [Yi et al. 2018]

Page 54: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

SceneParsing

Incorporate Concepts in Visual Reasoning

Vision

1

2Symbolic

Reasoning

ID Color Shape Material1 Green Cube Metal2 Red Sphere Rubber

Q: What’s the shape ofthe red object?

LanguageSemanticParsing Filter(Red)

Query(Shape)

Program

NS-VQA [Yi et al. 2018]

Sphere

Page 55: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Advantage 1: High Accuracy

Method Accuracy (%)Human 92.6

RN 95.5IEP 96.9

FiLM 97.6MAC 98.9TbD 99.1

NS-VQA (Ours) 99.8

[Yi et al. NeurIPS 2018, Johnson et al. ICCV 2017, Santoro et al. NIPS 2017, Perez et al. AAAI 2018, Hudson et al. ICLR 2018, Mascharka et al. CVPR 2018]

Effectively perfect!

Page 56: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

High accuracy when trained with just 1% the of the data that other methods require

[Yi et al. NeurIPS 2018]

Advantage 2: Data Efficiency

Page 57: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Question: What number of cylinders are gray objects or tiny brown matte objects?

scenefilter_smallfilter_brownfilter_rubber

scenefilter_gray

unionfilter_cylinder

count

filter_smallfilter_brownfilter_largefilter_cyan

...(25 modules)filter_metal

unionfilter_cylinder

count

Ours

Answer: 1

IEP

Answer: 2

Question: Are there more yellow matte things that are right of the gray ball than cyan metallic objects?

scenefilter_cyanfilter_metal

count...(4 modules)

scenefilter_yellowfilter_rubber

countgreater_than

filter_smallfilter_cyan

unionfilter_brown

...(25 modules)filter_smallfilter_yellowfilter_rubber

countgreater_than

Ours

Answer: no

IEP

Answer: no

Advantage 3: Transparency and Interpretability

[Yi et al. NeurIPS 2018, Johnson et al. ICCV 2017]

Question: What number of cylinders are gray objects or tiny brown matte objects?

scenefilter_smallfilter_brownfilter_rubber

scenefilter_gray

unionfilter_cylinder

count

filter_smallfilter_brownfilter_largefilter_cyan

...(25 modules)filter_metal

unionfilter_cylinder

count

Ours

Answer: 1

IEP

Answer: 2

Question: Are there more yellow matte things that are right of the gray ball than cyan metallic objects?

scenefilter_cyanfilter_metal

count...(4 modules)

scenefilter_yellowfilter_rubber

countgreater_than

filter_smallfilter_cyan

unionfilter_brown

...(25 modules)filter_smallfilter_yellowfilter_rubber

countgreater_than

Ours

Answer: no

IEP

Answer: no

Page 58: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

62

NeurIPS 2018: Neurosymbolic VQA: Properties (e.g. “color”) and values (“red”) predefined

ICLR 2019: Neurosymbolic Concept Learner: Properties predefined, can learn new values autonomously

NeurIPS 2019: Neurosymbolic Metaconcept Learner:Autonomously learns new concepts

less predefined, more autonomous →

ICML 2020 (target submission): Real world images

Page 59: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Filter Queryred shape

Neuro-Symbolic Concept Learning

Q: What’s the shapeof the red object?

2

1Visual RepresentationObj 1

Concept Embeddingsred......

Obj 2

Page 60: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Filter Queryred shape

Q: What’s the shapeof the red object?

2

1Visual RepresentationObj 1

Concept Embeddingsred......

Obj 2

Color Space

General Representation SpaceObj 1

Color(Obj 1)

ColorProj.

Neuro-Symbolic Concept Learning

Page 61: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Filter Queryred shape

Q: What’s the shapeof the red object?

2

1Visual RepresentationObj 1

Concept Embeddingsred......

Obj 2

Color Space

General Representation SpaceObj 1

Color(Obj 1)

ColorProj.

Obj 2

Color(Obj 2)

Neuro-Symbolic Concept Learning

Page 62: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Filter Queryred shape

Q: What’s the shapeof the red object?

2

1Visual RepresentationObj 1

Concept Embeddingsred......

Obj 2

Color Space

General Representation SpaceObj 1

Color(Obj 1)

Obj 2

Color(Obj 2)

red

Neuro-Symbolic Concept Learning

Page 63: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

apple

origin structurekind

apple tree body stem fruit

shape size color taste

round hand red green apple

Reproduced from Minksy, 1991

Symbolic AI

Page 64: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Visual reasoning questions + Metaconcept questionsQ: Is red a same kind of concept as green?A: Yes.

Q: Is cube a synonym of block?A: Yes.

Q: Is Laridae a hypernym of Ivory gull?A: Yes.

CLEVR(Johnson et al. 2017)

color:red

color:green

Q: Is there any red cube?A: Yes.

Q: Is there any green block?A: Yes

CUB(Wah et al. 2011)

IvoryGull

LaridaeBlackTern

Q: Is there any Ivory Gull?A: Yes.Q: Is there any Laridae?A: Yes.Q: Is there any Black Tern?A: Yes.Q: Is there any Laridae?A: Yes.

Meta-concept Learning Han et al. NeurIPS 2019

Page 65: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Augmenting VQA with MetaconceptsVisual reasoning questions + Metaconcept questions

Q: Is red a same kind of concept as green?A: Yes.

Q: Is cube a synonym of block?A: Yes.

Q: Is Laridae a hypernym of Ivory gull?A: Yes.

CLEVR(Johnson et al. 2017)

color:red

color:green

Q: Is there any red cube?A: Yes.

Q: Is there any green block?A: Yes

CUB(Wah et al. 2011)

IvoryGull

LaridaeBlackTern

Q: Is there any Ivory Gull?A: Yes.Q: Is there any Laridae?A: Yes.Q: Is there any Black Tern?A: Yes.Q: Is there any Laridae?A: Yes.

Page 66: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Program Execution Animated

Visual reasoning questions

ObjectDetection

FeatureExtraction

Q: Is there any red object? P: Exist( Filter( red ) )

SemanticParsing

red

Metaconcept questions

Q: Is red a same kind ofconcept as yellow?

P: MetaVerify(red, yellow, same-kind

)

SemanticParsing

score=0.9

score=0.1Max score=0.9

score=0.9redsame-kindyellow

MetaVerify

Similarity

Similarity

Answer: Yes

Obj. 1

Obj. 2

Answer: Yes

P: Exist( Filter( red ) )P: Exist( Filter( red ) )P: Exist( Filter( red ) )

P: MetaVerify(red, yellow, same-kind

)

P: MetaVerify(red, yellow, same-kind

)

Page 67: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Generalization

Metaconcept GeneralizationQ: Is there any airplane?A: Yes

Q: Is there any plane?A: Yes

Q: Is there any kid?A: Yes

Q: Is there any child?A: Yes

Q: Is airplane a synonym of plane?A: YesQ: Is kid a synonym of child?A: Yes

synonym?synonym

Training Testing: metaconcepts onunseen pairs of concepts

airplane

plane

kid

child

Page 68: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Metaconcept Generalization: ResultsQ: Is there any airplane?A: Yes

Q: Is there any plane?A: Yes

Q: Is there any kid?A: Yes

Q: Is there any child?A: Yes

Q: Is airplane a synonym of plane?A: YesQ: Is kid a synonym of child?A: Yes

Training Testing

Generalization

Page 69: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

CLEVERER: CoLlision Events for Video REpresentation and Reasoning• Descriptive

Q: What is the material of the last object to collide with the cyan cylinder?

Chuang Gan w/ Kevin Xi, Yunzhu Li, Pushmeet Kohli, Jiajun Wu, Antonio Torralba & Josh Tenenbaum

A: Metal

Page 70: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

• Explanatory

Q: What is responsible for the collision between the rubber and metal cylinder?

A. The presence of the yellow sphereB. The collision between the rubber cylinder and the red rubber sphere

Page 71: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

• Counterfactual

Q: What will happen without the cyan cylinder?

A. The red rubber sphere and the metal sphere collideB. The red rubber sphere and the gray object collide

Page 72: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Why is our database down?

What factors might contribute to better output from Factory A vs. Factory B?

How many employees have over 10 years experience but have moved location in the last year?

Looking Ahead

Page 73: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

NeurosymbolicPlanning

NeurosymbolicSafe ML/RL

NeurosymbolicNLU

NeurosymbolicCode Optimization

NeurosymbolicGenerative Models

NeurosymbolicMachine Common Sense

Srivastava et al. 2020 (submitted)

Asai et al. AAAI 2018 Shi et al. ICLR 2019

Wilcox et al. NAACL 2019Fulton et al AAAI 2018

Smith et al. NeurIPS 2019

Page 74: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Inferring flexible behavioral plans/policies from temporal observation data

Inducing Behavioral Insight

MIT-IBM Watson AI Lab

Christian MuiseIBM

Julie ShahMIT

Page 75: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

(:action pickup

:parameters (?b1 ?b2 - block)

:precondition (and (on ?b1 ?b2)(hand-clear))

:effect (and (not (hand-clear))(not (on ?b1 ?b2))(holding ?b1))

)

Task: Induce the action theory of anenvironment through observations

Page 76: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Mixing symbolic planning with neural networks

LatPlan

MIT-IBM Watson AI Lab

Masataro AsaiIBM

Page 77: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Mixing symbolic planning with neural networks

LatPlan

MIT-IBM Watson AI Lab

Masataro AsaiIBM

Page 78: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Verifiably Safe Reinforcement Learning

82

{accel,brake,turn}

Observe Reward

Safe?Policy

φ

Use a theorem prover to prove:init→[{{accel∪brake};ODEs}*]safe

is correctly monitored by φ.

Nathan FultonIBM

MIT-IBM Watson AI Lab

Page 79: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

apple

origin structurekind

apple tree body stem fruit

shape size color taste

round hand red green apple

+

NEURAL NETWORKS SYMBOLIC AI

Page 80: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

Causal InferenceBeyond Correlation—inferring and testing for causal relationships in complex systems

Caroline UhlerMIT

Guy BreslerMIT

Karthikeyan Shanmugam

IBM

http://tylervigen.com/spurious-correlations

MIT-IBM Watson AI Lab

Page 81: Neurosymbolic AIintrotodeeplearning.com/2020/slides/6S191_MIT_Deep...Narrow AI Single task, single domain Superhuman accuracy and speed for certain tasks Broad AI Multi-task, multi-domain

S2 S7

S1 S3

S4

S6

S5

MIT-IBM Watson AI Lab