cs344m autonomous multiagent systemstodd/cs344m/slides/week8b-pp4.pdfaction 1 4,8 2,0 player 1...

47
CS344M Autonomous Multiagent Systems Todd Hester Department of Computer Science The University of Texas at Austin

Upload: others

Post on 22-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

CS344MAutonomous Multiagent Systems

Todd Hester

Department of Computer ScienceThe University of Texas at Austin

Page 2: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Good Afternoon, Colleagues

Are there any questions?

Todd Hester

Page 3: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Good Afternoon, Colleagues

Are there any questions?

Todd Hester

Page 4: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Logistics

• Progress reports due in 2 weeks

Todd Hester

Page 5: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Logistics

• Progress reports due in 2 weeks

• Readings for next week

Todd Hester

Page 6: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Game Theory Premises

• Simultaneous actions

• No communication

• Outcome depends on combination of actions

• Utility (payoff) encapsulates everything about preferencesover outcomes

Todd Hester

Page 7: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Solution Concepts

• Dominant strategy

• Nash equilibrium

• Pareto optimality

• Maximum social welfare

• Maximin strategy

Todd Hester

Page 8: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Prisoner’s Dilemma

Column

C(1) D(2)

C(1) 3,3 0,5

Row

D(2) 5,0 1,1

Todd Hester

Page 9: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Chicken

Column

C(1) D(2)

C(1) 3,3 1,5

Row

D(2) 5,1 0,0

Todd Hester

Page 10: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Bach/Stravinsky

• My wife and I agree to meet at a concert

Todd Hester

Page 11: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Bach/Stravinsky

• My wife and I agree to meet at a concert

• Unfortunately, there are 2: Bach and Stravinsky

Todd Hester

Page 12: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Bach/Stravinsky

• My wife and I agree to meet at a concert

• Unfortunately, there are 2: Bach and Stravinsky

• No time to get in touch with each other

Todd Hester

Page 13: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Bach/Stravinsky

• My wife and I agree to meet at a concert

• Unfortunately, there are 2: Bach and Stravinsky

• No time to get in touch with each other

• I prefer Stravinsky, she prefers Bach

Todd Hester

Page 14: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Bach/Stravinsky

• My wife and I agree to meet at a concert

• Unfortunately, there are 2: Bach and Stravinsky

• No time to get in touch with each other

• I prefer Stravinsky, she prefers Bach

• But most of all, we want to be together

Todd Hester

Page 15: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Bach/Stravinsky

• My wife and I agree to meet at a concert

• Unfortunately, there are 2: Bach and Stravinsky

• No time to get in touch with each other

• I prefer Stravinsky, she prefers Bach

• But most of all, we want to be together

– If not, so distraught we don’t care what we’re listeningto

Todd Hester

Page 16: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Bach/Stravinsky

• My wife and I agree to meet at a concert

• Unfortunately, there are 2: Bach and Stravinsky

• No time to get in touch with each other

• I prefer Stravinsky, she prefers Bach

• But most of all, we want to be together

– If not, so distraught we don’t care what we’re listeningto

• Propose a payoff matrix

Todd Hester

Page 17: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Bach/Stravinsky

Wife

S B

S 2,1 0,0

Me

B 0,0 1,2

Todd Hester

Page 18: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Nash Equilibrium

• Does every game have a pure strategy Nash equilibrium?

Todd Hester

Page 19: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Matching Pennies• We each put a penny down covered

• If they match, I win, if they don’t, you win

Todd Hester

Page 20: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Matching Pennies• We each put a penny down covered

• If they match, I win, if they don’t, you win

Player 2

H T

H 1,-1 -1,1

Player 1

T -1,1 1,-1

Todd Hester

Page 21: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Matching Pennies• We each put a penny down covered

• If they match, I win, if they don’t, you win

Player 2

H T

H 1,-1 -1,1

Player 1

T -1,1 1,-1

Nash equilibrium?

Todd Hester

Page 22: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Nash Equilibrium

• Every game has at least one Nash equilibrium

Todd Hester

Page 23: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Nash Equilibrium

• Every game has at least one Nash equilibrium

– Nobel prize and academy award!

Todd Hester

Page 24: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Nash Equilibrium

• Every game has at least one Nash equilibrium

– Nobel prize and academy award!

• Not known if complexity of finding one is NP-complete orin P

Todd Hester

Page 25: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Some theory• Prove that if each player plays a dominant strategy, the

result is a Nash equilibrium

Todd Hester

Page 26: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Some theory• Prove that if each player plays a dominant strategy, the

result is a Nash equilibrium

• Are all Nash equilibria the result of playing dominantstrategies?

Todd Hester

Page 27: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Some theory• Prove that if each player plays a dominant strategy, the

result is a Nash equilibrium

• Are all Nash equilibria the result of playing dominantstrategies?

• Is the outcome of a Nash equilibrium necessarily Paretooptimal?

Todd Hester

Page 28: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Some theory• Prove that if each player plays a dominant strategy, the

result is a Nash equilibrium

• Are all Nash equilibria the result of playing dominantstrategies?

• Is the outcome of a Nash equilibrium necessarily Paretooptimal?

• Is a Pareto optimal outcome necessarily the result of Nashequilibrium strategies?

Todd Hester

Page 29: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Some theory• Prove that if each player plays a dominant strategy, the

result is a Nash equilibrium

• Are all Nash equilibria the result of playing dominantstrategies?

• Is the outcome of a Nash equilibrium necessarily Paretooptimal?

• Is a Pareto optimal outcome necessarily the result of Nashequilibrium strategies?

• Is the maximum social welfare outcome necessarily Paretooptimal?

Todd Hester

Page 30: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Some theory• Prove that if each player plays a dominant strategy, the

result is a Nash equilibrium

• Are all Nash equilibria the result of playing dominantstrategies?

• Is the outcome of a Nash equilibrium necessarily Paretooptimal?

• Is a Pareto optimal outcome necessarily the result of Nashequilibrium strategies?

• Is the maximum social welfare outcome necessarily Paretooptimal?

• If both players play maximin, is it necessarily a Nashequilibrium?

Todd Hester

Page 31: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

ActivityPlayer 2

Rock Paper Scissors

Rock 0,0 -1,1 1,-1

Player 1

Paper 1,-1 0,0 -1,1

Scissors -1,1 1,-1 0,0

Todd Hester

Page 32: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

ActivityPlayer 2

Rock Paper Scissors

Rock 0,0 -1,1 1,-1

Player 1

Paper 1,-1 0,0 -1,1

Scissors -1,1 1,-1 0,0

Todd Hester

Page 33: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Mixed strategy equilibriumPlayer 2

Action 1 Action 2

Action 1 4,8 2,0

Player 1

Action 2 6,2 0,8

Todd Hester

Page 34: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Mixed strategy equilibriumPlayer 2

Action 1 Action 2

Action 1 4,8 2,0

Player 1

Action 2 6,2 0,8

• What if player 2 picks action 1 3/4 of the time?

Todd Hester

Page 35: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Mixed strategy equilibriumPlayer 2

Action 1 Action 2

Action 1 4,8 2,0

Player 1

Action 2 6,2 0,8

• What if player 2 picks action 1 3/4 of the time?• What if player 2 picks action 1 1/4 of the time?

Todd Hester

Page 36: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Mixed strategy equilibriumPlayer 2

Action 1 Action 2

Action 1 4,8 2,0

Player 1

Action 2 6,2 0,8

• What if player 2 picks action 1 3/4 of the time?• What if player 2 picks action 1 1/4 of the time?• Player 1 must be indifferent between actions 1 and 2

Todd Hester

Page 37: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Mixed strategy equilibriumPlayer 2

Action 1 Action 2

Action 1 4,8 2,0

Player 1

Action 2 6,2 0,8

• What if player 2 picks action 1 3/4 of the time?• What if player 2 picks action 1 1/4 of the time?• Player 1 must be indifferent between actions 1 and 2• Player 2 must be indifferent between actions 1 and 2

Todd Hester

Page 38: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Mixed strategy equilibriumPlayer 2

Action 1 Action 2

Action 1 4,8 2,0

Player 1

Action 2 6,2 0,8

• What if player 2 picks action 1 3/4 of the time?• What if player 2 picks action 1 1/4 of the time?• Player 1 must be indifferent between actions 1 and 2• Player 2 must be indifferent between actions 1 and 2

Do actual numbers matter?

Todd Hester

Page 39: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Rock/Paper/Scissors

• Nash equilibrium?

Todd Hester

Page 40: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Rock/Paper/Scissors

• Nash equilibrium?

• Why is anything else not an equilibrium?

Todd Hester

Page 41: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Rock/Paper/Scissors

• Nash equilibrium?

• Why is anything else not an equilibrium?

• Rock Paper Scissors tournament

Todd Hester

Page 42: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Rock/Paper/Scissors

• Nash equilibrium?

• Why is anything else not an equilibrium?

• Rock Paper Scissors tournament

• Poker

Todd Hester

Page 43: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Discussion• What is an example game within robot soccer?

Todd Hester

Page 44: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Discussion• What is an example game within robot soccer?

Goalie

Block

Right Left

Left 1,-1 -1,1

Kicker

Right -1,1 1,-1

Todd Hester

Page 45: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Discussion• What is an example game within robot soccer?

Goalie

Block

Right Left

Left 1,-1 -1,1

Kicker

Right -1,1 1,-1

• Can we use game theory to devise better strategies?

Todd Hester

Page 46: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Correlated Equilibria

Sometimes mixing isn’t enough: Bach/Stravinsky

Wife

S B

S 2,1 0,0

Me

B 0,0 1,2

Todd Hester

Page 47: CS344M Autonomous Multiagent Systemstodd/cs344m/slides/week8b-pp4.pdfAction 1 4,8 2,0 Player 1 Action 2 6,2 0,8 What if player 2 picks action 1 3/4 of the time? What if player 2 picks

Correlated Equilibria

Sometimes mixing isn’t enough: Bach/Stravinsky

Wife

S B

S 2,1 0,0

Me

B 0,0 1,2

Want only S,S or B,B - 50% each

Todd Hester