intelligent agents - indian institute of technology...
TRANSCRIPT
Intelligent AgentsCSL 302 ARTIFICIAL INTELLIGENCE
SPRING 2014
Definition of AI
1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 2
Acting Rationally
rational behavior = doing the right thing
Encompasses the other lines of thought.oThinking rationally will help to act rationally, but is not the
only means; Eg: Reflex
Goal: building rational agents
Thinking humanly Thinking rationally
Acting humanly Acting rationally
AgentEnvironment
Agent
Perc
epti
on A
ction
What should I do next?
1/17/2014 3CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Agent Functions and ProgramAgent behavior is described by the agent function that maps a percept sequences to actions.
Lookup Table – An action for every possible percept sequence.
Agent Program: realization/concrete implementation of the agent function within some physical system.
1/17/2014 4CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Vacuum World
1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 5
Rational AgentsA rational agent does the right thing(action)
Without loss of generality, “goals” specifiable by performance measure defining a numerical value for any environment history
Rational Action: that maximizes the expected value of the performance measure given the percept sequence to data and prior knowledge
Rationality ≠ Omniscience
Rationality ≠ Successful
Rationality ≠ Clairvoyant
Rationality ≠ Intentionally no Sensing
1/17/2014 6CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
PEAS – Specifying the Task EnvironmentMust specify the task environment as fully as possible
oPerformance
oEnvironment
oActuator
oSensors
Task Environment for automated taxi driver?
1/17/2014 7CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
PEAS – Specifying the Task EnvironmentMust specify the task environment as fully as possible
oPerformance- safe, fast, comfortable
oEnvironment-roads, other traffic, traffic signals
oActuator-steering, accelerator, brake, horn, signal
oSensors-video camera, IR sensor, GPS, odometer
Task Environment for automated taxi driver?
1/17/2014 8CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
PEAS – Specifying the Task EnvironmentHow does the following affect the complexity of the problem the rational agent faces?
oPerformance – complex goals makes performance harder to achieve?
oEnvironment
oActuator – Lack of effectors makes performance harder to achieve?
oSensors – Lack of percepts makes performance harder to achieve?
1/17/2014 9CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Intelligent Agents17/1
1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 10
Homework 1 is due on Monday 20-1-2014.
PEAS – Specifying the Task EnvironmentMust specify the task environment as fully as possible
oPerformance
oEnvironment
oActuator
oSensors
1/17/2014 11CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Properties of the Task Environment
Environment
Agent
Perc
epti
on A
ction
What should I do next?
Static vs. DynamicPartially vs. Fully Observable
Deterministic vs. Stochastic
Instantaneousvs. Durative
Full vs. Partial Satisfaction
Discrete vs. Continuous
Single vs. Multiple Agents
Episodic vs. Sequential
1/17/2014 12CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Properties of the Task Environment
1/17/2014 13CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Observable: The agent can “sense” its environment
obest: fully observable worst: unobservable typical: partially observable
Deterministic: The actions have predictable effects
obest: deterministic worst: non-deterministic typical: stochastic
Static: The world evolves only because of agents’ actions
obest: static worst: dynamic typical: quasi-static
Episodic: The performance of the agent is determined episodically
obest: episodic worst: non-episodic
Discrete: The environment evolves through a discrete set of states
obest: discrete worst: continuous typical: hybrid
Agents: # of agents in the environment; are they competing or cooperating?
Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents
Chess
1/17/2014 14CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents
Chess Fully Deterministic Semi Sequential Discrete Multi
1/17/2014 15CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents
Chess Fully Deterministic Semi Sequential Discrete Multi
Poker
1/17/2014 16CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents
Chess Fully Deterministic Semi Sequential Discrete Multi
Poker Partial Stochastic Static Sequential Discrete Multi
1/17/2014 17CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents
Chess Fully Deterministic Semi Sequential Discrete Multi
Poker Partially Stochastic Static Sequential Discrete Multi
Taxi Driving
1/17/2014 18CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents
Chess Fully Deterministic Semi Sequential Discrete Multi
Poker Partially Stochastic Static Sequential Discrete Multi
Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi
1/17/2014 19CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents
Chess Fully Deterministic Semi Sequential Discrete Multi
Poker Partially Stochastic Static Sequential Discrete Multi
Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi
M-Diagnosis
1/17/2014 20CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents
Chess Fully Deterministic Semi Sequential Discrete Multi
Poker Partially Stochastic Static Sequential Discrete Multi
Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi
M-Diagnosis Partially Stochastic Dynamic Sequential Continuous Single
1/17/2014 21CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents
Chess Fully Deterministic Semi Sequential Discrete Multi
Poker Partially Stochastic Static Sequential Discrete Multi
Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi
M-Diagnosis Partially Stochastic Dynamic Sequential Continuous Single
I-Analysis
1/17/2014 22CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents
Chess Fully Deterministic Semi Sequential Discrete Multi
Poker Partially Stochastic Static Sequential Discrete Multi
Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi
M-Diagnosis Partially Stochastic Dynamic Sequential Continuous Single
I-Analysis Fully Deterministic Semi Episodic Continuous Single
1/17/2014 23CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents
Chess Fully Deterministic Semi Sequential Discrete Multi
Poker Partially Stochastic Static Sequential Discrete Multi
Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi
M-Diagnosis Partially Stochastic Dynamic Sequential Continuous Single
I-Analysis Fully Deterministic Semi Episodic Continuous Single
Inter. Tutor
1/17/2014 24CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents
Chess Fully Deterministic Semi Sequential Discrete Multi
Poker Partially Stochastic Static Sequential Discrete Multi
Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi
M-Diagnosis Partially Stochastic Dynamic Sequential Continuous Single
I-Analysis Fully Deterministic Semi Episodic Continuous Single
Inter. Tutor Partially Stochastic Dynamic Sequential Discrete Multi
1/17/2014 25CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
Task Environment-ExamplesEnvironment Observable Deterministic Static Episodic Discrete # Agents
Chess Fully Deterministic Semi Sequential Discrete Multi
Poker Partially Stochastic Static Sequential Discrete Multi
Taxi Driving Partially Stochastic Dynamic Sequential Continuous Multi
M-Diagnosis Partially Stochastic Dynamic Sequential Continuous Single
I-Analysis Fully Deterministic Semi Episodic Continuous Single
Inter. Tutor Partially Stochastic Dynamic Sequential Discrete Multi
1/17/2014 26CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR
The real world is inaccessible, stochastic, dynamic and continuousHow do we handle it then?
Types of AgentsTypes of agents (increasing in generality and ability to handle complex environments)oSimple reflex agents
oModel based reflex agents
oGoal-based agents
oUtility-based agents
oLearning agents
1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 27
Simple Reflex Agents
1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 28
Model Based Reflex Agents
1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 29
State Estimation
Goal Based Agents
1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 30
State Estimation
Search/Planning
Search: process of looking for a sequence of actions that reaches the goal statePlanning: can be viewed as search in a structured environment.
Utility Based Agents
1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 31
• Utility function: internalization of the performance measure• Conflicting goals• Multiple uncertain goals• Decision theoretic planning
Learning Agents
1/17/2014 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 32