1 monte-carlo methods in ai: overview prasad tadepalli

5

1 Monte-Carlo Methods in AI: Overview Prasad Tadepalli

Upload: steven-sparks

Post on 14-Jan-2016

213 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: 1 Monte-Carlo Methods in AI: Overview Prasad Tadepalli

1

Monte-Carlo Methods in AI: Overview

Prasad Tadepalli

Page 2: 1 Monte-Carlo Methods in AI: Overview Prasad Tadepalli

What is a Monte-Carlo Method? Any method that relies on repeated random

simulations to estimate something

Simplest case: Polling – who wins the election? True probability of a person voting for Obama is Ask N = 1000 random registered voters how they vote. Calculate = #(Obama voters)/1000

Apply Chernoff’s bound

Key idea: Although people are complex and varied, they can be treated as independent samples of an identical distribution for estimation

2Pr ( ) ( ) expP Obama P Obama N

( )P Obama

( )P Obama

Page 3: 1 Monte-Carlo Methods in AI: Overview Prasad Tadepalli

Applications First modern use in simulating nuclear

reactions in 1940’s by Stanislaw Ulam

Predicting the behavior of complex systems – weather, finance, fluid dynamics, markets, …

Planning and optimization - Computer games: Bridge, Go, Solitaire, StarCraft Optimal path planning in time-sensitive networks True model either does not exist or is too

complicated to reason about

Page 4: 1 Monte-Carlo Methods in AI: Overview Prasad Tadepalli

Two Fundamental Problems Prediction/Inference Problem

Given a probabilistic model of how the world operates (a “Bayesian Network”) and some observed evidence, what can we infer about a particular query variable?

Draw samples of the model where the observed evidence is true

Estimate the number of times the query variable is true

Planning/Optimization Problem Given a faithful simulator of an environment, how can we

use it to choose an optimal action? Run lots and lots of trials Combine the evidence in a “smart” way Output the action that yields best results

Page 5: 1 Monte-Carlo Methods in AI: Overview Prasad Tadepalli

Organization

Monday, Tuesday, Wednesday are divided into 2 parts Mornings

Inference/Prediction Problem (Experiments with Genie) Application Talk

Afternoons Planning/Optimization Problem (Experiments with MCP) Project/Lab (Galcon)

Wednesday evening dinner @5:30, McMenamins, Monroe

Thursday 2 talks plus tournament project work Tournament code is due: Friday 9 AM. Friday – Advanced topics, tournaments, student

presentations

The Future of Filmscanning, Sal Prasad | Prasad Group

1 Prasad Tadepalli Intelligent assistive systems Infer the goals of the human users and offer timely help; applications to assistance, tutoring; Learning

Multi-Agent Shared Hierarchy Reinforcement Learning Neville Mehta Prasad Tadepalli School of Electrical Engineering and Computer Science Oregon State University

Oregon State University School of Electrical Engineering and Computer Science End-User Programming of Intelligent Learning Agents Prasad Tadepalli, Ron

Dr. Syama Prasad Mookerjee : A Selfless PatriotDr. Syama Prasad Mookerjee - A Selfless Patriot Dr. Syama Prasad Mookerjee - A Selfless Patriot 3 5 DR. SYAMA PRASAD MOOKERJEE A Selfless

Monte Carlo Simulation of Stochastic Processescacs.usc.edu/education/cs596/09Stochastic.pdf · Monte Carlo Simulation of Stochastic Processes MONTE CARLO METHOD • Monte Carlo

A Reinforcement Learning Approach for Product Delivery by Multiple Vehicles Scott Proper Oregon State University Prasad Tadepalli Hong TangRasaratnam Logendran

Monte Carlo Simulations in Ad-Lift Measurement Using Spark by Prasad Chalasani and Ram Sriharsha

Sai Prasad Residency Brochure Sai Prasad Residency, Kharghar

Monte Carlo approach to uncertainty analyses in forestry ... Monte Carlo... · inventory Monte Carlo Small or large ... To do Monte Carlo simulations, ... Monte Carlo simulation #

Bar & Bench () Prasad @ Lalu Prasad Yadav, S/o- Late Kundan Rai, aged about 70 Years, R/o ... Krishna Kumar Prasad, 12. Lal Mohan Prasad, 13. Lalu Prasad @ Lalu Prasad Yadav

Udbhav SchoolWelcome, Mrs Vasanthi Tadepalli ! Udbhav School warmly welcomes Mrs Vasanthi Tadepalli as the new Head Mistress. Mrs.Vasanthi joins us with 25 years of experience in reputed

(PRASAD (Drury)

YEAR CLUB - Oregon State University · June Padman Petri Pohjanpelto Gregory Rorrer Richard Roseberg Valerie Rosenberg Teresa Sawyer Ted Simpson Steven Smith Prasad Tadepalli Janet

Using Trajectory Data to Improve Bayesian …...Palo Alto, CA 94304 USA Alan Fern [email protected] Prasad Tadepalli [email protected] School of Electrical Engineering

Results of IPC 2008: Learning Track Minh Do Organizers: Alan Fern, Prasad Tadepalli, Roni Khardon

Learning First-Order Probabilistic Models with Combining Rules Sriraam Natarajan Prasad Tadepalli Eric Altendorf Thomas G. Dietterich Alan Fern Angelo

ANOMALY DETECTION Scholar: Andrew Emmott Focus: Machine Learning Advisors: Tom Dietterich, Prasad Tadepalli Donors: Leslie and Mark Workman Acknowledgements:

Monte Carlo and quasi-Monte Carlo integration

Computer Science Orientation Prasad Tadepalli Computer Science Head Graduate Advisor

User-Initiated Learning (UIL) Kshitij Judah, Tom Dietterich, Alan Fern, Jed Irvine, Michael Slater, Prasad Tadepalli, Oliver Brdiczka, Jim Thornton, Jim

prasad vidolkar

VLS Finance Ltd. (CIN:L65910DL1986PLC023129) Statement of … Unpaid Dividend_2016-2017.pdf · 11 IN30102220383802 A N V PRASAD 2 1 33 OPP Z P HIGH SCHOOL A N S BUILDING TADEPALLI

A Decision-Theoretic Model of Assistance - Evaluation, Extension and Open Problems Sriraam Natarajan, Kshitij Judah, Prasad Tadepalli and Alan Fern School

Adaptive Multilevel Monte Carlo Simulation of Stochastic …€¦ · 4 Adaptive multilevel Monte Carlo. Formulation of SDE approximationSingle level Monte CarloMultilevel Monte CarloAdaptive

Monte Carlo Method - Monte Carlo Simulation · Monte Carlo Method Monte Carlo Simulation Peter Frank Perroni December 1, 2015 Peter Frank Perroni Monte Carlo Method