recursive self-improvement - supsijuergen/recursive-self-improvement.pdf · initialize gm by...

17

Jürgen Schmidhuber The Swiss AI Lab IDSIA Univ. Lugano & SUPSI http://www.idsia.ch/~juergen Recursive Self-Improvement NNAISENSE

Upload: hoangcong

Post on 21-Jul-2018

218 views

Category:

Documents

1 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

Jürgen Schmidhuber The Swiss AI Lab IDSIA Univ. Lugano & SUPSI http://www.idsia.ch/~juergen

Recursive Self-Improvement

NNAISENSE

Page 2: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

Jürgen Schmidhuber You_again Shmidhoobuh

Page 3: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

J. Good (1965): informal remarks on an "intelligence explosion"

through RSI "super-intelligences”

Next: four concrete algorithms for RSI: 1987, 93, 97, 2003…

Page 4: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

1987 - First RSI: Genetic

Programming (Cramer, 1985)

recursively applied to itself, to obtain

Meta-GP and Meta-meta-GP etc:

J. Schmidhuber. Evolutionary

principles in self-referential learning. On learning how to

learn: The meta-meta-... hook.

Diploma thesis, TU Munich, 1987

http://people.idsia.ch/~juergen/diploma.html

Page 5: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

1993: Gradient-based meta-RNNs that can learn to run their own weight change algorithm: J. Schmidhuber. A self-referential weight matrix. ICANN 1993

This was before LSTM. In 2001, however, Hochreiter taught a meta-LSTM to learn a learning algorithm for quadratic functions that was faster than backprop

Page 6: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

1997: Lifelong meta-learning with self-modifying policies: 2 agents, 2 doors, 2 keys. 1st southeast wins 5, the other 3. Through recursive self-modifications only: from 300,000 steps per trial down to 5,000.

Page 7: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

Page 8: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

Page 9: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

Page 10: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

Page 11: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

Page 12: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

Page 13: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

E.g., Schmidhuber, Zhao, Wiering: MLJ 28:105-130, 1997

Success-story algorithm for self-modifying code

R(t)/t < [R(t)-R(v1)] / (t-v1) < [R(t)-R(v2)] / (t-v2) <…

R(t): Reward until time t. Stack of past check points v1v2v3 … with self-mods in between. SSA undoes selfmods after vi that are not followed by long-term reward acceleration up until t (now):

Page 14: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

1997: Lifelong meta-learning with self-modifying policies: 2 agents, 2 doors, 2 keys. 1st southeast wins 5, the other 3. Through recursive self-modifications only: from 300,000 steps per trial down to 5,000.

Page 15: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

Gödel Machine (2003): agent-controlling program that speaks about itself, ready to rewrite itself in arbitrary fashion once it has found a proof that the rewrite is useful, given a user-defined utility function

Theoretically optimal self-improver!

Page 16: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

Initialize GM by asymptotically fastest

method for all well-defined problems

Given f:X→Y and x∈X, search proofs to find program q that provably computes f(z) for all

z∈X within time bound tq(z); spend most time on f(x)-computing q with best current bound

Marcus Hutter IDSIA, 2002, on

Schmidhuber‘s SNF Universal AI grant

n3+1010000=n3+O(1)

As fast as fastest f-computer, save for factor 1+ε and f-specific const. independent of x!

Page 17: Recursive Self-Improvement - SUPSIjuergen/recursive-self-improvement.pdf · Initialize GM by asymptotically fastes method for all well-defined problems Given f:X→Y and x∈X, search

Recursive Server - Internet Societyws.edu.isoc.org/.../180846294540d6a670ca20a/3-Recursive-Server.pdf · Recursive Server •Used to lookup ... •anonymous login •cd domain •get

Recursive Dynamic CS: Recursive Recovery of Sparse Signal ...home.engineering.iastate.edu/~namrata/TSP_RecRecon_Paper_4.pdf · Recursive Dynamic CS: Recursive Recovery of Sparse Signal

Recursive Filter

Recursive Algorithms · 2017. 1. 27. · Recursive Algorithm •A recursive algorithm is an algorithm that calls itself. •A recursive algorithm has –Base case: output computed

Modeling Recursive Relations - İTÜweb.itu.edu.tr/~cetinerg/notes/ie424t2.pdf · Modeling Recursive Relations. 3 Assoc.Prof.Dr.B.Gültekin Çetiner Diagram a Recursive Relationship

Recursive Algorithms Recursive procedure

noter ch.3 Modeling recursive structure by class hierarchy ... · Recursive data structures noter ch.3 Modeling recursive structure by class hierarchy Recursive traversal of structure

PRIMITIVE RECURSIVE VECTOR SEQUENCES, POLYNOMIAL …people.math.carleton.ca/~sartaj/mythesis.pdf · PRIMITIVE RECURSIVE VECTOR SEQUENCES, POLYNOMIAL SYSTEMS ... RECURSIVE VECTOR SEQUENCES,

Bounded Recursive Self-Improvement - SUPSIpeople.idsia.ch/.../TR13_bounded_recursive_self-improvement.pdf · Bounded Recursive Self-Improvement E. Nivel,1 K. R. Thórisson,1,6 B

Compressed sensing of streaming data€¦ · Recursive Compressed Sensing Recursive sampling Recursive estimation Analysis Simulations. Compressed sensing Sampling: m

cellulase improvement.pdf

Auto Control Design for Power Factor Improvement.pdf

Recursive Static Route - Cisco · Information About Recursive Static Route How to Install Recursive Static Route Installing Recursive Static Routes in a VRF

India Fastes Growing Economy

Recursion. Understand how the Fibonacci series is generated Recursive Algorithms Write simple recursive algorithms Analyze simple recursive algorithms

Computer Science - RECURSIVE PROGRAMS AS ...Key words, programming logic, recursive programs, recursive definitions, rewrite rules, semantics, verification, programtransformations

Handling tree structures — recursive SPs, nested sets, recursive CTEs

Recursive weighted median filters admitting negative ...arce/Research_files/2-Recursive... · Recursive Weighted Median Filters Admitting ... RECURSIVE WEIGHTED MEDIAN FILTERS ADMITTING

8.5 Using Recursive Rules with Sequences - Big Ideas Math · Evaluate recursive rules for sequences. Write recursive rules for sequences. Translate between recursive and explicit

Recursive Static Route · Information About Recursive Static Route Recursive Static Routes Arecursivestaticrouteisaroutewhosenexthopandthedestinationnetworkarecoveredbyanotherlearned

Recursive Algorithms - homepages.math.uic.eduhomepages.math.uic.edu/~jan/mcs275/recursion.pdf · Recursive Algorithms 1 Recursive Functions ... Computes the factorial of nbr recursively

Recursive Neural Networks - Technion – Israel this assignment” Recursive Neural Networks for Structure Predicon Recursive Neural Networks for Structure Predicon A standard Recursive

Recursive Thinking

Les années fastes : de 1920 à 1960

Recursive data structures noter ch.3 Modeling recursive structure by class hierarchy Recursive traversal of structure

Recursive Data Structures and Grammars Themes Recursive Description of Data Structures Recursive Definitions of Properties of Data Structures Recursive

A Guide to Chess Improvement.pdf

Chapter 11 – Recursion Recursive Processes Writing a Recursive Method A Recursive Factorial Method Comparison of Recursive and Iterative Solutions Recursive

Four different groups of recursive tests Forward recursive tests

The Lean Anthology A Practical Primer in Continual Improvement.pdf

JAVANESE WRITING SKILLS IMPROVEMENT.pdf

Recursive Static Route - cisco.com · Recursive Static Route TheRecursiveStaticRoutefeatureenablesyoutoinstallarecursivestaticrouteintotheRoutingInformation …

Coke drum life improvement.pdf

Completely Subtyping Iso-recursive Types - USFligatti/papers/subIsoTR.pdfCompletely Subtyping Iso-recursive Types ... cused more on equi-recursive than iso-recursive systems. For exam-