march 2016 - wordpress.com · 2016-09-06 · full-sized game of go, a feat previously thought to be...

158

Upload: others

Post on 25-May-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 2: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

March 2016

Page 3: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Mainstream Media

Page 4: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

1997

Page 5: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Ing cup 1985 – 2000(up to 1,400,000$)

(1985-2000)

Page 6: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

5d win 1998

Page 7: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

October 2015

Page 8: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

This is the first time that a computer program has defeated a human professional player in the

full-sized game of Go, a featpreviously thought to be at

least a decade away.

Silver, D. et al., 2016. Mastering the game of Go with deep neural networks and tree search. Nature, 529(7587), p.484-489.

January 2016

Page 9: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

What did AlphaGo do to beat the strongest human Go player?

Tobias [email protected]

Page 10: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Go

Page 11: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Computational Challenge

Page 12: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Monte Carlo Method

Page 13: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Neural Networks

Page 14: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Revolution with Neural Networks

Page 15: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

What did we learn?

Page 16: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Go

Page 17: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 18: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 19: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 20: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 21: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 22: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 23: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 24: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 25: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 26: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 27: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 28: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 29: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 30: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 31: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 32: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 33: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 34: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 35: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 36: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Computational Challenge

Page 37: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Go vs. Chess

Page 38: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Complex vs. Complicated

Page 39: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

„While the Baroque rules of chess could only have been created by humans, the rules of

go are so elegant, organic, and rigorously logical that if intelligent life forms exist

elsewhere in the universe, they almost certainly play go.“

Edward Lasker (chess grandmaster)

Page 40: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Larger board19x19 vs. 8x8

Page 41: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Almost every move is legal

Page 42: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Average branching factor:250 vs 35

Page 43: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

State Space Complexity:10 171 vs 1047

Page 44: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

1080

Page 45: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Global impact of moves

Page 46: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

68957966354765

685766345

857635

563

6

MAX

MIN

MAX

MIN

MAX

Traditional Seach

Page 47: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

68957966354765

685766345

857635

563

6

MAX

MIN

MAX

MIN

MAX

Evaluation Function

Page 48: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 49: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Monte Carlo Method

Page 50: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

What is Pi?

Page 51: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

How do you determine Pi?

Page 52: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 53: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

2006

Page 54: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Browne, Cb, and Edward Powley. 2012. A survey of monte carlo tree search methods. Intelligence and AI 4, no. 1: 1-49

Page 55: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

2/4

1/1 0/1 1/1 0/1

A1D5

F13C7

Page 56: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

2/4

1/1 0/1 1/1 0/1

A1D5

F13C7

Selection

Page 57: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

2/4

1/1 0/1 1/1 0/1

A1D5

F13C7

0/0

B5

Expansion

Page 58: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

2/4

1/1 0/1 1/1 0/1

A1D5

F13C7

0/0

B5

Simulation

Page 59: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Random

Page 60: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Not Human like?

Page 61: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

3/5

2/2 0/1 1/1 0/1

A1D5

F13C7

1/1

B5

Backpropagation

Page 62: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

3/5

2/2 0/1 1/1 0/1

A1D5

F13C7

1/1

B5

Perspective

Page 63: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

2/5

1/2 0/1 1/1 0/1

A1D5

F13C7

1/1

B5

Perspective

Page 64: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Multi Armed Bandit

Page 65: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Multi Armed Bandit

Exploitation vs Exploration

Page 66: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

winsvisits

+explorationFactor √ ln (totalVisits)visits

Page 67: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

15042

86/193

0/1 1/2 0/2

36/1116

2/2

58/151

1/2 0/23/3

Page 68: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

15042

86/193

0/1 1/2 0/2

36/1116

2/2

58/151

1/2 0/23/3

Page 69: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

15042

86/193

0/1 1/2 0/2

36/1116

2/2

58/151

1/2 0/23/3

Page 70: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Generate a valid random move

Page 71: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Who has won?

Page 72: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 73: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

General Game Playing

Page 74: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Anytime

Page 75: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Lazy

Page 76: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 77: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Expert Knowledge

Page 78: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Neural Networks

Page 79: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

2014

Page 80: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

What does this even mean?

Page 81: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Neural Networks

Page 82: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Input“Hidden” Layer

Output

Neural Networks

Page 83: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Weights

Page 84: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Bias/Threshold

Page 85: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

4

2

-3

3.2

Activation

Page 86: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

5.2 >= 4

2

-3

3.2

Activation

Page 87: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

2.2 <= 4

2

-3

3.2

Activation

Page 88: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Activation

Page 89: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Training

Page 90: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Adjust parameters

Page 91: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Supervised Learning

InputExpectedOutput

Page 92: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Backpropagation

Page 93: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Data set

Page 94: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Training data + test data

Page 95: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Training

Page 96: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Verify

Page 97: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Overfitting

Page 98: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Deep Neural Networks

Page 99: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Convolutional Neural Networks

Page 100: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Local Receptive Field

Page 101: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Feature Map

Page 102: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Stride

Page 103: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Shared weights and biases

Page 104: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

19 x 19 3 x 17 x 17

Multiple Feature maps/filters

Page 105: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Architecture

...

Input Features

12 layers with 64 – 192 filters

Output

Page 106: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Architecture

...

Input Features

12 layers with 64 – 192 filters

Output

Page 107: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Architecture

...

Input Features

12 layers with 64 – 192 filters

Output

Page 108: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

2.3 million parameters630 million connections

Page 109: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

● Stone Colour x 3● Liberties x 4● Liberties after move played x 6● Legal Move x 1● Turns since x 5● Capture Size x 7 ● Ladder Move x 1● KGS Rank x 9

Input Features

Page 110: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Training on game data predicting the next move

Page 111: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

55% Accuracy

Page 112: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Mostly beats GnuGo

Page 113: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Combined with MCTS in the Selection

Page 114: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Asynchronous GPU Power

Page 115: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Revolution

Page 116: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Silver, D. et al., 2016. Mastering the game of Go with deep neural networks and tree search. Nature, 529(7587), p.484-489.

Networks in Training

Page 117: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Silver, D. et al., 2016. Mastering the game of Go with deep neural networks and tree search. Nature, 529(7587), p.484-489.

Networks in Training

Page 118: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Silver, D. et al., 2016. Mastering the game of Go with deep neural networks and tree search. Nature, 529(7587), p.484-489.

AlphaGo Search

Page 119: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Selection

Page 120: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Action ValuePrior Probability

Visit Count

Selection

Page 121: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Action ValuePrior Probability

Visit Count

Page 122: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Action ValuePrior Probability

Visit Count

Selection

Page 123: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Action ValuePrior Probability

Visit Count

Selection

Page 124: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

0.8

1.2 0.5 1.1 0.9

Action Value + Bonues

Page 125: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

0.8

1.2 0.5 1.1 0.9

Expansion

Page 126: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

0.8

1.2 0.5 1.1 0.9

Expansion

Page 127: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

0.8

1.2 0.5 1.1 0.9

Prior Probability

Page 128: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

0.8

1.2 0.5 1.1 0.9

Evalution

Page 129: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

0.8

1.2 0.5 1.1 0.9

Evalution

Page 130: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

0.8

1.2 0.5 1.1 0.9

Rollout

Page 131: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

0.8

1.2 0.5 1.1 0.9

Value Network

Page 132: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

0.81

1.3 0.5 1.1 0.9

Backup

1 .6

Page 133: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

1202 CPUs 176 GPUS0.8

1.2 0.5 1.1 0.9

Page 134: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Tensor

Page 135: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Human Instinct Policy NetworkReading Capability SearchPositional Judgement Value Network

3 Strengths of AlphaGo

Page 136: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Human Instinct Policy NetworkReading Capability SearchPositional Judgement Value Network

Most Important Strength

Page 137: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

More Natural

Page 138: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Lee Sedol match

Page 139: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Style

Page 140: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

So when AlphaGo plays a slack looking move,we may regard it as a mistake,

but perhaps it should more accurately be viewedas a declaration of victory?

An Younggil 8p

Page 141: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Game 2

Page 142: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Game 4

Page 143: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with
Page 144: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Game 4

Page 145: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Game 4

Page 146: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

What can we learn?

Page 147: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Making X fastervs

Doing less of X

Page 148: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Benchmark everything

Page 149: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Solving problems the human wayvs

Solving problems the computer way

Page 150: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Don't blindly dismiss approaches as infeasible

Page 151: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

One Approach vs

Combination of Approaches

Page 152: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Joy of Creation

Page 153: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

PragTob/Rubykon

Page 155: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

What did AlphaGo do to beat the strongest human Go player?

Tobias [email protected]

Page 156: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Sources

● Maddison, C.J. et al., 2014. Move Evaluation in Go Using Deep Convolutional Neural Networks.

● Silver, D. et al., 2016. Mastering the game of Go with deep neural networks and tree search. Nature, 529(7587), p.484-489.

● Michael A. Nielsen, "Neural Networks and Deep Learning", Determination Press, 2015 http://neuralnetworksanddeeplearning.com

● Gelly, S. & Silver, D., 2011. Monte-Carlo tree search and rapid action value estimation in computer Go. Artificial Intelligence, 175(11), p.1856-1876.

● I. Althöfer, “On the Laziness of Monte-Carlo Game Tree Search In Non-tight Situations,” Friedrich-Schiller Univ., Jena, Tech. Rep., 2008.

● Browne, C. & Powley, E., 2012. A survey of monte carlo tree search methods. IEEE Transactions on Intelligence and AI in Games, 4(1), p.1-49.

● Gelly, S. & Silver, D., 2007. Combining online and offline knowledge in UCT. Machine Learning, p.273-280.

● https://www.youtube.com/watch?v=LX8Knl0g0LE&index=9&list=WL

Page 157: March 2016 - WordPress.com · 2016-09-06 · full-sized game of Go, a feat previously thought to be at least a decade away. Silver, D. et al., 2016. Mastering the game of Go with

Photo Credit● http://www.computer-go.info/events/ing/2000/images/bigcup.jpg

● https://en.wikipedia.org/wiki/File:Kasparov-29.jpg

● http://www.geforce.com/hardware/desktop-gpus/geforce-gtx-titan-black/product-images

● http://giphy.com/gifs/dark-thread-after-lCP95tGSbMmWI

● https://cloudplatform.googleblog.com/2016/05/Google-supercharges-machine-learning-tasks-with-custom-chip.html

● https://gogameguru.com/i/2016/01/Fan-Hui-vs-AlphaGo-550x364.jpg

● http://makeitstranger.com/

● CC BY 2.0

– https://en.wikipedia.org/wiki/File:Deep_Blue.jpg

– https://www.flickr.com/photos/luisbg/2094497611/

● CC BY-SA 3.0

– https://en.wikipedia.org/wiki/Alpha%E2%80%93beta_pruning#/media/File:AB_pruning.svg

● CC BY-SA 2.0

– https://flic.kr/p/cPUtny

– https://flic.kr/p/dLSKTQ

– https://www.flickr.com/photos/83633410@N07/7658272558/