agentarchitectures · 1. reflex(reactive)agent 2. model-basedreflexagent 3....
TRANSCRIPT
![Page 1: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/1.jpg)
Agent Architectures
BE4M36MAS - Multiagent systems
![Page 2: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/2.jpg)
Organization
![Page 3: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/3.jpg)
Distant Teaching
• Active participation stillexpected!
• Don’t be afraid to ask• Un-mute and speak,that’s it.
• You can use the chat ifyou are shy.
• Preferably, connect withvideo.
1
![Page 4: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/4.jpg)
Tutors
David Fiedler (Tutorial 1–3) [email protected]
Dominik Andreas Seitz [email protected]
Michal Šustr [email protected]
Aditya Aradhye (Tutorial 12–14)
Website:https://cw.fel.cvut.cz/wiki/courses/be4m36mas/start
2
![Page 5: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/5.jpg)
Course Outline
Agent Programming22 Sept Introduction to multi-agent
systemsPěchouček
29 Sept Agent Architectures. Belief-Desire-Intention architecture
Pěchouček/Jakob
3
![Page 6: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/6.jpg)
Course Outline
Non-cooperative Game Theory6 Oct Introduction to Game Theory Bošanský13 Oct Solving Normal-form Games Bošanský20 Oct Games in Extensive Form Bošanský27 Oct Solving Extensive-Form Games Bošanský3 Nov Other Game Representations Bošanský
4
![Page 7: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/7.jpg)
Course Outline
Multiagent Resource Allocation10 Nov Multiagent Resource Allocation Jakob
5
![Page 8: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/8.jpg)
Course Outline
Auctions24 Nov Auctions 1 Jakob1 Dec Auctions 2 Jakob
6
![Page 9: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/9.jpg)
Course Outline
Coalitional Game Theory8 Dec Coalitional Game Theory 1 Kroupa15 Dec Coalitional Game Theory 2 Kroupa22 Dec Coalitional Game Theory 3 Kroupa
7
![Page 10: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/10.jpg)
Course Outline
Social Choice, Voting5 Jan Social Choice, Voting Kroupa
8
![Page 11: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/11.jpg)
Tutorials
Attendance: voluntary (but tracked)
Assessment – 3 assignments:
1. Agent programming (max 14 pts)2. Game theory (max 14 pts)3. Coalitional game theory (12 pts)
You have to obtain at least 20 points. Plagiarism is strictlyforbidden (Strong punishments would be applied).
9
![Page 12: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/12.jpg)
Agent architectures
![Page 13: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/13.jpg)
Components of agent architectures
Actions (A)Ways for the agent to influence theenvironment
Percepts (P)Observations about the state of the world
Decision making (d : P∗ → A)Mapping perception history to actions
10
![Page 14: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/14.jpg)
Components of agent architectures
Actions (A)Ways for the agent to influence theenvironment
Percepts (P)Observations about the state of the world
Decision making (d : P∗ → A)Mapping perception history to actions
10
![Page 15: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/15.jpg)
Architecture types
1. Reflex (reactive) Agent2. Model-based Reflex Agent3. Model-based Goal-based Agent4. Model-based Utility-based Agent5. Learning-based Agent
(Russell and Norvig)
11
![Page 16: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/16.jpg)
Let’s Play with Agents!
![Page 17: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/17.jpg)
Wumpus’ World
Wumpus’ World• Grid world environment• Agent has to find the gold brickand carry it to the bottom le tsquare
• Problem: Entering a squareoccupied by Wumpus orcontaining a pit costs agent hislife(Wumpus does not move)
12
![Page 18: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/18.jpg)
Wumpus’ World
Wumpus’ World — Percepts• Breeze — whenever agent standsnext to a pit
• Stench — whenever agentstands next to Wumpus
• Gold — when agent carries agold brick
13
![Page 19: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/19.jpg)
Wumpus’ World
Wumpus’ World — Actions• Going to any neighboring square(only vertically and horizontally)
14
![Page 20: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/20.jpg)
Reflex agent
Agent conditions his decision solely on his current percepts.(e.g. on the facts he can currently sense)
Task: Implement a reflex agent for Wumpus world. Beware, donot use any kind of memory or smarter reasoning ;-)
15
![Page 21: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/21.jpg)
Model-based reflex agent
Agent uses percepts to graduallybuild a model of the environment.
Decisions are based on the expectedstate of the world according to hismodel.
Question: Does this approach allow us to overcome this issue?
Task: Implement a model-based agent and reach the gold!
16
![Page 22: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/22.jpg)
Model-based reflex agent
Agent uses percepts to graduallybuild a model of the environment.
Decisions are based on the expectedstate of the world according to hismodel.
Question: Does this approach allow us to overcome this issue?
Task: Implement a model-based agent and reach the gold!
16
![Page 23: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/23.jpg)
Model-based reflex agent
Question: Is the behaviour of the agent rational?
Definitely not!
Agent just exploits the model to stay alive. He does notintentionally pursue his goal.
17
![Page 24: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/24.jpg)
Model-based reflex agent
Question: Is the behaviour of the agent rational?Definitely not!
Agent just exploits the model to stay alive. He does notintentionally pursue his goal.
17
![Page 25: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/25.jpg)
Model-based Goal-based agent
Actions are chosen in order to reach a declaratively specifiedgoal.
Techniques:
1. Planning Planning in AI2. Belief-Desire-Intention Architecture this course
Question: What does it mean for an agent in Wumpus’ world?
18
![Page 26: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/26.jpg)
Model-based Utility-based agent
Not all ways to reach the goal are equally plausible. Someways to reach the goal should be prefered against others.(e.g. cheaper or less risky ones)
Utility driven sequential decision making:
• Non–adversarial: MDPs, POMDPs Planning in AI• Adversarial: Sequential games this course
19
![Page 27: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/27.jpg)
Learning-based agent
Agent does not fully know the task he is facing.(what his action does, what is his goal etc.)
He learns the task on the go — strategy reflecting these findscannot be fixed in advance.
Learning both model and strategy.
20
![Page 28: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/28.jpg)
Next tutorial
![Page 29: AgentArchitectures · 1. Reflex(reactive)Agent 2. Model-basedReflexAgent 3. Model-basedGoal-basedAgent 4. Model-basedUtility-basedAgent 5. Learning-basedAgent (RussellandNorvig)](https://reader035.vdocument.in/reader035/viewer/2022071214/604301bac7f0c2483d62aaf5/html5/thumbnails/29.jpg)
Next tutorial
• Belief-Desire-Intention architecture
21