two-person zero-sum games

23
1

Upload: timon-boyer

Post on 03-Jan-2016

73 views

Category:

Documents


12 download

DESCRIPTION

TWO-PERSON ZERO-SUM GAMES. 1.2 2-Person, Zero-Sum Games. Two players Total payoff = 0, thus Payoff to Player I = – Payoff to Player II Information can be summarised nicely in a simple payoff table. Convention. a i = ith action (pure strategy) for Player I - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: TWO-PERSON ZERO-SUM GAMES

1

Page 2: TWO-PERSON ZERO-SUM GAMES

2

1.2 2-Person, Zero-Sum Games

Two players Total payoff = 0, thus Payoff to Player I = – Payoff to Player II

Information can be summarised nicely in a simple payoff table.

Page 3: TWO-PERSON ZERO-SUM GAMES

3

Player II

A1 A2 An

a1 v1,1 v1,2 v1,n

a2 v2,1 v2,2 v2,n

Player I ... ... ... ...

... ... ... ...

am vm,1 vm,n

Convention

Page 4: TWO-PERSON ZERO-SUM GAMES

4

ai = ith action (pure strategy) for Player I Aj = jth action (pure strategy) for Player II

m = # of actions available to Player I n = # of actions available to Player II

vij = Payoff to Player I if they select ai and

Player II selects action Aj . Payoff to Player II = – Payoff to Player I

Page 5: TWO-PERSON ZERO-SUM GAMES

5

Example

Player 1 has options: 2 pure strategies a1 or a2. Player 2 has options: 3 pure strategies A1, A2 or A3. If player 1 plays pure strategy a1 and Player 2 plays

pure strategy A3 then Player 1 wins 25 from Player 2.

A1 A2 A3

a12 -4 25

a26 10 -20

Player 1

Player 2

Page 6: TWO-PERSON ZERO-SUM GAMES

6

Solution?

Player I Player II

Independent, unbiasedanalyst to determine what are the“best” strategies for the two players.

Page 7: TWO-PERSON ZERO-SUM GAMES

7

1.2.1 Example

A1 A2 A3

a12 -4 25

a26 10 -20

What are the best strategies for the players?

Page 8: TWO-PERSON ZERO-SUM GAMES

8

Conflict: Player 1 prefers (a1,A3) with payoff = 25 Player 2 prefers (a2,A3) with payoff = 20 But what would Player 2 do knowing

that Player 1 would go for the 25? What would Player 1 do ........?

What should the players do if they wish to “resolve” the game?

A1 A2 A3

a12 -4 25

a26 10 -20

Page 9: TWO-PERSON ZERO-SUM GAMES

9

Player I Player II

I want the maximum payoff to Player I

I want the maximum payoff to Player II

Page 10: TWO-PERSON ZERO-SUM GAMES

10

Player I Player II

I want the maximum payoff to Player I

I want the maximum payoff to Player II

Be sensible, guys!

Player I Player II

I want the maximum payoff to Player I

I want the maximum payoff to Player II

Page 11: TWO-PERSON ZERO-SUM GAMES

11

Basic Issue

Sensible = ?

Page 12: TWO-PERSON ZERO-SUM GAMES

12

Bottom Line

We need to agree on some fundamental principles to guide us in deciding what is best for the players.

There are two basic issues to be resolved: What kind of payoffs should the players expect? How do you guarantee that the players will

accept that solution?

Page 13: TWO-PERSON ZERO-SUM GAMES

13

Payoff Ideal:

Maximum possible payoff to each player Minimum Expected: Each player gets what she can

“secure”, namely the maximum that she can get regardless of what the other player does.

We look at the worst things that can happen and pick the best of these.

Conservative approach. We seek a strategy such that changing

it could achieve a worse outcome.

Page 14: TWO-PERSON ZERO-SUM GAMES

14

Philosophy

Conservative approach

We wish to be as secure as possible and not take the risk of larger losses in pursuit of larger gains.

Page 15: TWO-PERSON ZERO-SUM GAMES

15

Old example

Ideal payoff :

Player 1 =25, Player 2 = 20 Maximum SECURED Payoff:

Player 1 = –4 ; Player 2 = –6 i.e. Player 1 can make sure of doing no worse than losing

4, (regardless of what Player 2 does) by playing a1.

A1 A2 A3

a12 -4 25

a26 10 -20

James Bond

Ziegfried

Page 16: TWO-PERSON ZERO-SUM GAMES

16

1.2.2 Definition The quantity

si := min {vij: j=1,2,...,n} , i=1,2,...,m

is called the security level for Player I associated with strategy ai .

Similarly, the quantity

Sj := max {vij: i=1,2,...,m} , j=1,2,...,n

is called the security level for Player II associated with strategy Aj .

Page 17: TWO-PERSON ZERO-SUM GAMES

17

Principle I

Player I acts to maximize their security level

Player II acts to minimize their security level

Page 18: TWO-PERSON ZERO-SUM GAMES

18

1.2.3 Example

How do we compute the security levels?

A1 A2 A3

a1 1 10 3

a2 -2 5 1

a3 1 -8 1

Page 19: TWO-PERSON ZERO-SUM GAMES

19

A1 A2 A3 si

a1 1 10 3 1

a2 -2 5 1 -2

a3 1 -8 1 -8

Sj 1 10 3

Page 20: TWO-PERSON ZERO-SUM GAMES

20

A1 A2 A3 si

a1 1 10 3 1

a2 -2 5 1 -2

a3 1 -8 1 -8

Sj 1 10 3

Will the players accept this solution? We apply the same idea to another example:

Maximum security level for Player I = 1. Minimum security level for Player II = 1. Suggested solution (a1, A1) .

Page 21: TWO-PERSON ZERO-SUM GAMES

21

1.2.4 Example

The suggested solution (a2, A2) is not stable.

A1 A2 si

a1 1 5 1

a2 6 2 2

S j 6 5

Page 22: TWO-PERSON ZERO-SUM GAMES

22

1.2.2 Definition (Page 10)

A solution (ai, Aj) to a zero-sum two-person game is stable (or in equilibrium) if Player I expecting Player II to Play Aj has nothing to gain by deviating from ai

AND

Player II expecting Player I to Play ai has nothing to gain by deviating from playing Aj.

Page 23: TWO-PERSON ZERO-SUM GAMES

23

Principle IIThe players tend to strategy pairs that

are in equilibrium, i.e. stable

An optimal solution is said to be reached if neither player finds it beneficial to change their strategy.