lecture 1 - nn 2016-17 - ut€¦ · basics foundations lesson title 1 introduction 2 probability...

56
Neural Networks Lecture 1: Introduction

Upload: others

Post on 14-Sep-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Neural NetworksLecture 1: Introduction

Page 2: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Objectives of the course

• to acquire main concepts of the theory and practice of modern neural networks

• to apply deep learning approaches to different research areas

• to develop a critical view of recent literature and news about deep learning technology

Page 3: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Organization of the course

https://courses.cs.ut.ee/2017/nn/fall

Page 4: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Organization of the course

Raul Vicente [email protected] Matiisen [email protected]

Group of Computational NeuroscienceUlikooli 17: room 208

https://courses.cs.ut.ee/2017/nn/fall

Page 5: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Schedule of the course

• Lectures (Monday 4:15 @ room 122 in Liivi 2)

• Practical (Tuesday 2:15 @ room 122 in Liivi, 2)

Page 6: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Selected topics

Advanced

Basics

Foundations

Lesson Title

1 Introduction

2 Probability and Information Theory

3 Basics of Machine Learning

4 Deep Feed-forward networks

5 Back-propagation

6 Optimization and regularization

7 Convolutional Networks

8 Sequence Modeling: recurrent networks

9 Applications

10 Sofware and Practical methods

11 Representational Learning

12 Deep Generative Models

13 Deep Reinforcement Learning

14 Deep Learning and the Brain

15 Conclusions and Future Perspective

16 Project presentations

Page 7: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Lesson Title

1 Introduction

2 Probability and Information Theory

3 Basics of Machine Learning

4 Deep Feed-forward networks

5 Back-propagation

6 Optimization and regularization

7 Convolutional Networks

8 Sequence Modeling: recurrent networks

9 Applications

10 Software and Practical methods

11 Representational Learning

12 Deep Generative Models

13 Deep Reinforcement Learning

14 Deep Learning and the Brain

15 Conclusions and Future Perspective

16 Project presentations

Homework

Project

Page 8: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5
Page 9: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Small exercises/questionnaire (about concepts in lecture)

Practical session (Python notebook)

Homework

http://cs231n.github.io/python-numpy-tutorial/

Page 10: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

1-4 people

small research (data analysis, analytical calculation, replication of a model, essay…)

12-15 pages report

oral presentation (10-15 min)

Project

Page 11: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Project (ANN playing Atari)

Page 12: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Project (self-driving minicar)

Page 13: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Project (synaptic failure)

Page 14: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Grading system

• Homework (60%)

• Project (40%)

Differentiated: A-F

50% for each component to get a final assessment

Bonus exercises (up to 10%)

You can collect up to 110 points, 90+ will give you A, 80+ B, 70+ C and so on.

Page 15: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Pre-requisites

• Programming (e.g. Python)

• Calculus, basic probability & statistics, linear algebra, basic machine learning…

Page 16: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5
Page 17: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5
Page 18: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5
Page 19: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5
Page 20: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

ArtificialIntelligence

Input: Output:Sensors

DataMovementLanguage

Why Artificial Intelligence?

Page 21: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5
Page 22: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5
Page 23: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

What is Deep Learning?

Methods that learn multiple levels of representation to model the relation between input data and output

Page 24: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Deep Learning = Artificial Neural Networks

Page 25: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Artificial neural network

• A collection of simple trainable mathematical units, which collaborate to compute a complicated function

• Compatible with supervised, unsupervised, and reinforcement

• Brain inspired (loosely)

Page 26: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

The neuron

w1 w2 w3

• Different weights compute different functions

yi = F

X

i

wixi

!

F (x) =

1

1 + exp (�x)

x1 x2 x3

Page 27: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Neural network

Input:

Output:

Page 28: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Neural Network

Input:

Output:

“Dog” “Cat”

Page 29: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Neural network

Entrada:

Salida:

Page 30: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Neural network

Input:

Output:

Page 31: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Input:

Output:

Neural network“Dog” “Cat”

Page 32: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Learning algorithm

• modify connection weights to make prediction closer to y

• run neuronal network on input x

• while not done

• pick a random training case (x, y)

Page 33: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

McCulloch & Pitts (1943)A Logical Calculus of the Ideas

Immanent in Nervous Activity

Rosenblatt (1957)Perceptron

Minsky & Papert (1969)Perceptrons: an introduction to computational geometry

Rumelhart, Hinton & Williams (1986)Learning representations by back-propagating errors

Page 34: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Why now?

Page 35: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Why now?

Page 36: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

90%data createdgloballybetween2013-2015

10%datos creadosBefore 2013

Volume of data

Source: IBM

Page 37: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

5000 samples to work reasonably

10000000 samples to rival human level

Page 38: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

x 1000000

Why now?

Page 39: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Why now?

Page 40: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

• Pre-training (weights initialization)

• Efficient descent algorithms

• Dropout

• Domain Prior Knowledge

• Activation

(complex landscape)

(complex landscape)

(vanishing gradient)

(overfitting)

Page 41: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Now that we are deep...

• Powerful function approximator

• Instead of hand-crafted features, let the algorithm build the relevant features for your problem

• More representational power for learning

Page 42: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

With deep networks…

• the network can build blocks made of blocks…

• the key of deep learning is that allows a better representation of the data for each task

Page 43: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

http://www.clarifai.com/

Object recognition

ImageNet competition

1000 categories

Page 44: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

25% 15%

2012 2016

3.6%

Humans

5.1%

Page 45: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Automatic description of images

Page 46: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Playing Atari with Deep Reinforcement Learning

Page 47: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

IA that in few hours learns to play better than humans!

Pong Breakout Seaquest

Space Invaders EnduroBeam Rider

Page 48: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Input Action

Reward

Page 49: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

It does not know the game rules, nor what is a ball, nor that blocks can be broken,...

Its only obsession in life is to increase the reward!

Page 50: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5
Page 51: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5
Page 52: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5
Page 53: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Open questions

• Transfer learning/planning

• Mathematical guarantees

• Why are so efficient?

• Neuroscience inspirations

• …

Page 54: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

• Deep Learning = Neural Networks 3.0

• Feature learning & approximate wild functions

• Pushing ML and AI to unthinkable applications

Take home message

Page 55: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

Selected topics

Advanced

Basics

Foundations

Lesson Title

1 Introduction

2 Probability and Information Theory

3 Basics of Machine Learning

4 Deep Feed-forward networks

5 Back-propagation

6 Optimization and regularization

7 Convolutional Networks

8 Sequence Modeling: recurrent networks

9 Applications

10 Software and Practical methods

11 Representational Learning

12 Deep Generative Models

13 Deep Reinforcement Learning

14 Deep Learning and the Brain

15 Conclusions and Future Perspective

16 Project presentations

Page 56: Lecture 1 - NN 2016-17 - ut€¦ · Basics Foundations Lesson Title 1 Introduction 2 Probability and Information Theory 3 Basics of Machine Learning 4 Deep Feed-forward networks 5

To know more

Deep Learning, Goodfellow, Bengio, Courville, MIT press 2017