monte carlo methods. learn from complete sample returns – only defined for episodic tasks can work...

Monte Carlo Methods

Upload: dwayne-fisher

Post on 18-Jan-2016

215 views

Category:

Documents

1 download

Report

Download

Embed Size (px):

TRANSCRIPT

Monte Carlo Methods

• Learn from complete sample returns– Only defined for episodic tasks

• Can work in different settings– On-line: no model necessary– Simulated: no need for full model

• DP doesn’t need any experience

• MC doesn’t need a full model– Learn from actual or simulated experience

• MC expense independent of total # of states• MC has no bootstrapping– Not harmed by Markov violations

Page 4: Monte Carlo Methods. Learn from complete sample returns – Only defined for episodic tasks Can work in different settings – On-line: no model necessary

• Every-Visit MC: average returns every time s is visited in an episode

• First-visit MC: average returns only first time s is visited in an episode

• Both converge asymptotically

Page 5: Monte Carlo Methods. Learn from complete sample returns – Only defined for episodic tasks Can work in different settings – On-line: no model necessary

1st Visit MC for Vπ

What about control?

Page 6: Monte Carlo Methods. Learn from complete sample returns – Only defined for episodic tasks Can work in different settings – On-line: no model necessary

Want to improve policy by greedily following Q

Page 7: Monte Carlo Methods. Learn from complete sample returns – Only defined for episodic tasks Can work in different settings – On-line: no model necessary

• MC can estimate Q when model isn’t available– Want to learn Q*

• Converges asymptotically if every state-action pair is visited “enough”– Need to explore: why?

• Exploring starts: every state-action pair has non-zero probability of being starting pair– Why necessary?– Still necessary if π is probabilistic?

Page 8: Monte Carlo Methods. Learn from complete sample returns – Only defined for episodic tasks Can work in different settings – On-line: no model necessary

MC for Control (Exploring Starts)

•

Page 9: Monte Carlo Methods. Learn from complete sample returns – Only defined for episodic tasks Can work in different settings – On-line: no model necessary

• Q0(.,.) = 0

• π0 = ?• Action: 1 or 2• State: amount of money• Transition: based on state, flip outcome, and

action• Reward: amount of money• Return: money at end of episode

Page 10: Monte Carlo Methods. Learn from complete sample returns – Only defined for episodic tasks Can work in different settings – On-line: no model necessary

MC for Control, On-Policy

• Soft policy: – π(s,a) > 0 for all s,a– ε-greedy

• Can prove version of policy improvement for soft policies

Page 11: Monte Carlo Methods. Learn from complete sample returns – Only defined for episodic tasks Can work in different settings – On-line: no model necessary

MC for Control, On-Policy

Page 12: Monte Carlo Methods. Learn from complete sample returns – Only defined for episodic tasks Can work in different settings – On-line: no model necessary

Off-Policy Control

• Why consider off-policy methods?

Page 13: Monte Carlo Methods. Learn from complete sample returns – Only defined for episodic tasks Can work in different settings – On-line: no model necessary

Off Policy Control

• Learn about π while following π’– Behavior policy vs. Estimation policy

Page 14: Monte Carlo Methods. Learn from complete sample returns – Only defined for episodic tasks Can work in different settings – On-line: no model necessary

Page 15: Monte Carlo Methods. Learn from complete sample returns – Only defined for episodic tasks Can work in different settings – On-line: no model necessary

Off policy learning

• Why only learn from tail?• How choose policy to follow?

14 Monte Carlo methodsquantum.phys.unm.edu/467-17/chap14.pdf · 2017. 4. 5. · 14 Monte Carlo methods 14.1 The Monte Carlo method The Monte Carlo method is simple, robust, and useful

Monte Carlo Method - Monte Carlo Simulation · Monte Carlo Method Monte Carlo Simulation Peter Frank Perroni December 1, 2015 Peter Frank Perroni Monte Carlo Method

Monte Carlo Simulations - DESY · 2010-08-18 · H. Jung, Monte Carlo Simulations in particle physics, summer student lecture, august 8, 2010 33 Monte Carlo method Monte Carlo method

Markov Chain Monte Carlo Methods By Marc Sobel. Exploring Monte Carlo A biulding in the country of Monte Carlo:

Monte Carlo Solution for Actuarial Problems - Member · MONTE CARLO SOLUTION FOR ACTUARIAL PROBLEMS ... Monte Carlo simulation is a collection of techniques to extract ... MONTE CARLO

Monte Carlo approach to uncertainty analyses in forestry ... Monte Carlo... · inventory Monte Carlo Small or large ... To do Monte Carlo simulations, ... Monte Carlo simulation #

Monte Carlo Simulation Monte Carlo Simulation - University of Florida

Monte Carlo and Kinetic Monte Carlo Methods – A Tutorial€¦ · Monte Carlo and Kinetic Monte Carlo Methods – A Tutorial Peter Kratzer published in Multiscale Simulation Methods

Monte Carlo Simulationathreya/Teaching/statistics1/dootika.pdf · Monte Carlo Simulation Monte Carlo methods, or Monte Carlo experiments, are a broad class of computational algorithms

Monte Carlo Particle Transport: Algorithm and Performance ... · arise in photon Monte Carlo simulations. Neutron Monte Carlo Transport The Monte Carlo method of simulating particle

AS MONACO · TO MONACO Fabulous events in the Principality Football Prince’s Palace Monte-Carlo Monaco Monte-Carlo Monte-Carlo Monaco Monte-Carlo AS Monaco of Monaco Casino Grand

Monte Carlo Treatment Planning - Radiation Dosimetryradiationdosimetry.org/files/...16-monte-carlo-treatment-planning.pdf · Monte Carlo Treatment Planning An Introduction NEDERLANDSE

Monte Carlo and quasi-Monte Carlo for image synthesisstatweb.stanford.edu/~owen/pubtalks/egsr2014post.pdf · 2014-06-29 · Monte Carlo and quasi-Monte Carlo for image synthesis Art

Monte Carlo Methods - ethz.ch · 1 Monte Carlo Methods Nicholas Constantine Metropolis Stanislaw Ulam 2 Books about Monte Carlo • M.H. Kalos and P.A. Whitlock: „Monte Carlo Methods“

Method? Monte Carlo Methods Two basic principles Monte Carlo

Monte Carlo Methods and Area Estimates - Cornell … · Monte Carlo Methods and Area Estimates CS3220 - Summer 2008 Jonathan Kaldor. Monte Carlo Methods ... as a Monte Carlo simulation

Monte Carlo Methods - Rupam Mahmood...is extended to the Monte Carlo case in which only sample experience is available. 5.1 Monte Carlo Prediction We begin by considering Monte Carlo

Fatin Sezgin - MCQMC2010 - Monte Carlo and Quasi-Monte Carlo

Monte Carlo Simulation - statvision.com Carlo Simulation.pdf · Monte Carlo Simulation “Monte Carlo simulations are used to model the probability of different outcomes in a process

Monte Carlo Methods and Stochastic Algorithmsbl/monte-carlo/poly.pdf · 1.1. ON THE CONVERGENCE RATE OF MONTE-CARLO METHODS 3 1.1 On the convergence rate of Monte-Carlo methods In

Maximum Entropy Monte-Carlo Planning › paper › 9148-maximum-entropy-monte-carlo-p… · 2.2 Monte Carlo Tree Search and UCT To solve the online planning task, Monte Carlo Tree

Monte Carlo Trails Park Concept Plan Phase One Monte Carlo ... · 5/29/2018 · Monte Carlo Trails Park Concept Plan Phase One Parque de Senderos Monte Carlo Plan de Conceptual Fase

Low Discrepancy Sequences and Quasi-Monte Carlo ...sites.science.oregonstate.edu/.../1997ScSiWi.pdf1.3 Quasi-Monte Carlo Integration Quasi-Monte Carlo Integration originated in the

Monte Carlo

OPTIONS: A MONTE CARLO APPROACH - camjclub - homeA+Monte+Carlo+Appro… · OPTIONS: A MONTE CARLO APPROACH ... This paper develops a Monte Carlo simulation method for solving option

Monte Carlo tools for LHC - INFN Lecce web · Monte Carlo tools for LHC Contents Basics of Monte Carlo methods Monte Carlo programs for particle physics 1. Monte Carlo integrators

Monte Carlo and quasi-Monte Carlo integration

Quantum Monte Carlo for Electronic Structurekentpr/talks/uc_group_mtg_June2003.pdf · •Real-world Applications •Monte Carlo integration •Variational Monte Carlo •Diffusion

Advanced Monte Carlo Methods: General Principles …mascagni/General_Principles.pdfThe Monte Carlo Method General Principles Every Monte Carlo computation that leads to quantitative

Quantum Monte Carlo - academic.sun.ac.zaacademic.sun.ac.za/somerskool/2005/documents/lecture_notes/... · 1 ceperley random walks Quantum Monte Carlo 1. Introduction to Monte Carlo:

Chapter 4 Monte Carlo Methods and Simulation. Monte Carlo Methods

Monte` Carlo Methods 1 MONTE` CARLO METHODS INTEGRATION and SAMPLING TECHNIQUES

Quasi-Monte Carlo Variational InferenceQuasi-Monte Carlo Variational Inference 3. Quasi-Monte Carlo Variational Inference In this Section, we introduce Quasi-Monte Carlo Varia-tionalInference(qmcvi),usingrandomizedqmc

CLASSICAL MONTE CARLO & METROPOLIS ALGORITHMstaff.ustc.edu.cn/~yjdeng/CQMC2012/lecture_notes/part1.pdf · CLASSICAL MONTE CARLO & METROPOLIS ALGORITHM Monte Carlo (MC) simulations