intelligent agents - indian institute of technology...

32
Intelligent Agents CSL 302 ARTIFICIAL INTELLIGENCE SPRING 2014

Upload: others

Post on 29-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Intelligent AgentsCSL 302 ARTIFICIAL INTELLIGENCE

SPRING 2014

Page 2: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Definition of AI

1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 2

Acting Rationally

rational behavior = doing the right thing

Encompasses the other lines of thought.oThinking rationally will help to act rationally, but is not the

only means; Eg: Reflex

Goal: building rational agents

Thinking humanly Thinking rationally

Acting humanly Acting rationally

Page 3: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

AgentEnvironment

Agent

Perc

epti

on A

ction

What should I do next?

1/17/2014 3CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 4: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Agent Functions and ProgramAgent behavior is described by the agent function that maps a percept sequences to actions.

Lookup Table – An action for every possible percept sequence.

Agent Program: realization/concrete implementation of the agent function within some physical system.

1/17/2014 4CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 5: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Vacuum World

1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 5

Page 6: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Rational AgentsA rational agent does the right thing(action)

Without loss of generality, “goals” specifiable by performance measure defining a numerical value for any environment history

Rational Action: that maximizes the expected value of the performance measure given the percept sequence to data and prior knowledge

Rationality ≠ Omniscience

Rationality ≠ Successful

Rationality ≠ Clairvoyant

Rationality ≠ Intentionally no Sensing

1/17/2014 6CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 7: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

PEAS – Specifying the Task EnvironmentMust specify the task environment as fully as possible

oPerformance

oEnvironment

oActuator

oSensors

Task Environment for automated taxi driver?

1/17/2014 7CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 8: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

PEAS – Specifying the Task EnvironmentMust specify the task environment as fully as possible

oPerformance- safe, fast, comfortable

oEnvironment-roads, other traffic, traffic signals

oActuator-steering, accelerator, brake, horn, signal

oSensors-video camera, IR sensor, GPS, odometer

Task Environment for automated taxi driver?

1/17/2014 8CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 9: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

PEAS – Specifying the Task EnvironmentHow does the following affect the complexity of the problem the rational agent faces?

oPerformance – complex goals makes performance harder to achieve?

oEnvironment

oActuator – Lack of effectors makes performance harder to achieve?

oSensors – Lack of percepts makes performance harder to achieve?

1/17/2014 9CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 10: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Intelligent Agents17/1

1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 10

Homework 1 is due on Monday 20-1-2014.

Page 11: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

PEAS – Specifying the Task EnvironmentMust specify the task environment as fully as possible

oPerformance

oEnvironment

oActuator

oSensors

1/17/2014 11CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 12: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Properties of the Task Environment

Environment

Agent

Perc

epti

on A

ction

What should I do next?

Static vs. DynamicPartially vs. Fully Observable

Deterministic vs. Stochastic

Instantaneousvs. Durative

Full vs. Partial Satisfaction

Discrete vs. Continuous

Single vs. Multiple Agents

Episodic vs. Sequential

1/17/2014 12CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 13: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Properties of the Task Environment

1/17/2014 13CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Observable: The agent can “sense” its environment

obest: fully observable worst: unobservable typical: partially observable

Deterministic: The actions have predictable effects

obest: deterministic worst: non-deterministic typical: stochastic

Static: The world evolves only because of agents’ actions

obest: static worst: dynamic typical: quasi-static

Episodic: The performance of the agent is determined episodically

obest: episodic worst: non-episodic

Discrete: The environment evolves through a discrete set of states

obest: discrete worst: continuous typical: hybrid

Agents: # of agents in the environment; are they competing or cooperating?

Page 14: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents

Chess

1/17/2014 14CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 15: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents

Chess Fully Deterministic Semi Sequential Discrete Multi

1/17/2014 15CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 16: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents

Chess Fully Deterministic Semi Sequential Discrete Multi

Poker

1/17/2014 16CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 17: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents

Chess Fully Deterministic Semi Sequential Discrete Multi

Poker Partial Stochastic Static Sequential Discrete Multi

1/17/2014 17CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 18: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents

Chess Fully Deterministic Semi Sequential Discrete Multi

Poker Partially Stochastic Static Sequential Discrete Multi

Taxi Driving

1/17/2014 18CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 19: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents

Chess Fully Deterministic Semi Sequential Discrete Multi

Poker Partially Stochastic Static Sequential Discrete Multi

Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi

1/17/2014 19CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 20: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents

Chess Fully Deterministic Semi Sequential Discrete Multi

Poker Partially Stochastic Static Sequential Discrete Multi

Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi

M-Diagnosis

1/17/2014 20CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 21: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents

Chess Fully Deterministic Semi Sequential Discrete Multi

Poker Partially Stochastic Static Sequential Discrete Multi

Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi

M-Diagnosis Partially Stochastic Dynamic Sequential Continuous Single

1/17/2014 21CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 22: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents

Chess Fully Deterministic Semi Sequential Discrete Multi

Poker Partially Stochastic Static Sequential Discrete Multi

Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi

M-Diagnosis Partially Stochastic Dynamic Sequential Continuous Single

I-Analysis

1/17/2014 22CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 23: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents

Chess Fully Deterministic Semi Sequential Discrete Multi

Poker Partially Stochastic Static Sequential Discrete Multi

Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi

M-Diagnosis Partially Stochastic Dynamic Sequential Continuous Single

I-Analysis Fully Deterministic Semi Episodic Continuous Single

1/17/2014 23CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 24: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents

Chess Fully Deterministic Semi Sequential Discrete Multi

Poker Partially Stochastic Static Sequential Discrete Multi

Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi

M-Diagnosis Partially Stochastic Dynamic Sequential Continuous Single

I-Analysis Fully Deterministic Semi Episodic Continuous Single

Inter. Tutor

1/17/2014 24CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 25: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents

Chess Fully Deterministic Semi Sequential Discrete Multi

Poker Partially Stochastic Static Sequential Discrete Multi

Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi

M-Diagnosis Partially Stochastic Dynamic Sequential Continuous Single

I-Analysis Fully Deterministic Semi Episodic Continuous Single

Inter. Tutor Partially Stochastic Dynamic Sequential Discrete Multi

1/17/2014 25CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

Page 26: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents

Chess Fully Deterministic Semi Sequential Discrete Multi

Poker Partially Stochastic Static Sequential Discrete Multi

Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi

M-Diagnosis Partially Stochastic Dynamic Sequential Continuous Single

I-Analysis Fully Deterministic Semi Episodic Continuous Single

Inter. Tutor Partially Stochastic Dynamic Sequential Discrete Multi

1/17/2014 26CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR

The real world is inaccessible, stochastic, dynamic and continuousHow do we handle it then?

Page 27: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Types of AgentsTypes of agents (increasing in generality and ability to handle complex environments)oSimple reflex agents

oModel based reflex agents

oGoal-based agents

oUtility-based agents

oLearning agents

1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 27

Page 28: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Simple Reflex Agents

1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 28

Page 29: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Model Based Reflex Agents

1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 29

State Estimation

Page 30: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Goal Based Agents

1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 30

State Estimation

Search/Planning

Search: process of looking for a sequence of actions that reaches the goal statePlanning: can be viewed as search in a structured environment.

Page 31: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Utility Based Agents

1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 31

• Utility function: internalization of the performance measure• Conflicting goals• Multiple uncertain goals• Decision theoretic planning

Page 32: Intelligent Agents - Indian Institute of Technology Roparcse.iitrpr.ac.in/ckn/courses/s2014/ia.pdf · 2017. 3. 21. · Intelligent Agents 17/1 1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE,

Learning Agents

1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 32