218_1_lecture_3_part1

50
Lecture 3 Ongoing interactions: Dynamic games of complete information - Part 1

Upload: monil-joshi

Post on 03-May-2017

214 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: 218_1_lecture_3_part1

Lecture 3

Ongoing interactions: Dynamic games of

complete information - Part 1

Page 2: 218_1_lecture_3_part1

Outline • Extensive-form games

– Sub-game perfect equilibrium, one-step deviation principle – Stackelberg equilibrium

• Interventions: Policing, treats, punishments • Repeated games – formalism, equilibria, automata • Direct vs. Indirect Reciprocity

– Tit-for-Tat etc. – Ratings, Reputations – Tokens/fiat-money

• Bayesian Games • Illustrative examples and comparisons

Questions/comments/observations are always encouraged, at any point during the lecture!!

Page 3: 218_1_lecture_3_part1

Dynamic games

• players • who moves at each turn • knowledge at each turn • choices at each turn • outcomes • appropriate notion of solution

Page 4: 218_1_lecture_3_part1

Include • Hidden actions (normal form games) • Hidden information (Bayesian games)

Framework + solution notion(s) – complicated Begin with • Complete information: no hidden actions • Perfect information: no hidden information

Page 5: 218_1_lecture_3_part1

Extensive form games

• The normal form game representation does not incorporate any notion of sequence, or time, of the actions of the players

• The extensive form is an alternative representation that makes this temporal structure explicit.

• Two variants: – perfect information extensive-form games (= a game

where every player knows all the moves played by all the players before)

– imperfect-information extensive-form games

Page 6: 218_1_lecture_3_part1

Extensive form games - definition

Page 7: 218_1_lecture_3_part1

Extensive form games - definition

Page 8: 218_1_lecture_3_part1

Extensive form games - definition

Page 9: 218_1_lecture_3_part1

Game-tree representation

Page 10: 218_1_lecture_3_part1

Example

Page 11: 218_1_lecture_3_part1

Example

Page 12: 218_1_lecture_3_part1

Game-tree representation

Page 13: 218_1_lecture_3_part1

Example: the sharing game

Page 14: 218_1_lecture_3_part1

Pure-strategies in perfect-information games

Page 15: 218_1_lecture_3_part1

Pure-strategies example

Page 16: 218_1_lecture_3_part1

Nash equilibria

Idea: the game – finite => has a set of penultimate nodes => players moving at these nodes choose strategies leading to the terminal node(s) with max payoff. The players That have successors the penultimate nodes, choose actions that max their payoffs, Given the choice of penultimate nodes etc. ……. resulting strategy is PSNE

Page 17: 218_1_lecture_3_part1

Induced normal form

Page 18: 218_1_lecture_3_part1

Induced normal form

Page 19: 218_1_lecture_3_part1

Induced normal form

Page 20: 218_1_lecture_3_part1

New (stronger) equilibrium concepts needed?

Page 21: 218_1_lecture_3_part1

MAD game

On the board

Page 22: 218_1_lecture_3_part1

Chain-store game

We now define an equilibrium refinement concept that does not suffer from this threat credibility problem

Page 23: 218_1_lecture_3_part1

Subgame

Simplified: A subgame is determined by assuming that a certain history has already happened and considering the game from that point onward

Page 24: 218_1_lecture_3_part1

Subgame example

Page 25: 218_1_lecture_3_part1

Subgame perfect equilibrium

Backwards induction algorithm • Identify all terminal subgames • Determine the Nash eq. for these subgames • Modify the original game tree by replacing the terminal subgames with the Nash equilibrium payoffs • Repeat until the three is reduced to one stage game, and then determine the Nash equilibrium.

Page 26: 218_1_lecture_3_part1

Example backwards induction algorithm

What about the MAD and CS games?

Page 27: 218_1_lecture_3_part1

MAD Game

Page 28: 218_1_lecture_3_part1

Chain Store Game

Page 29: 218_1_lecture_3_part1

Subgame perfect equilibrium - example

Page 30: 218_1_lecture_3_part1

Bargaining Game of Alternating Offers (Extensive Form Game)

Player 1

(1), (1)x M x−

Player 2 (1), (1)x M x−

accept reject

Player 2

(2), (2)M y y−

Round 1

Player 1

accept reject

(2), (2)M y y−Player 1 Round 2

Round 3

Page 31: 218_1_lecture_3_part1

Another example: Alternative offers (Bargaining)

Page 32: 218_1_lecture_3_part1

Another example: Alternative offers (Bargaining)

Page 33: 218_1_lecture_3_part1

Another example: Alternative offers (Bargaining)

Page 34: 218_1_lecture_3_part1
Page 35: 218_1_lecture_3_part1
Page 36: 218_1_lecture_3_part1

Another example: Stackelberg competition

Page 37: 218_1_lecture_3_part1
Page 38: 218_1_lecture_3_part1

Theorem (Zermelo) • Every finite extensive form game of complete

and perfect information has a sub-game perfect equilibrium (SGPE). If there are no ties, SGPE is unique.

• Proof: Backwards induction “Pruning the tree”

Page 39: 218_1_lecture_3_part1

Remarks

Page 40: 218_1_lecture_3_part1

Imperfect information

Page 41: 218_1_lecture_3_part1

Example

Page 42: 218_1_lecture_3_part1

Recall:

Page 43: 218_1_lecture_3_part1

Example

Information sets for player 1: { } and {( , ),( , )}

4 pure strategies for player 1, corresponding to the

information sets { } and {( , ),( , )}:

and , and , and and and

L A L B

L A L B

L l L r R l R r

Page 44: 218_1_lecture_3_part1

Normal form game

Page 45: 218_1_lecture_3_part1

Induced normal form

Page 46: 218_1_lecture_3_part1

Mixed and behavioral strategies

• It turns out there are two meaningfully different kinds of randomized strategies in imperfect information extensive form games

– mixed strategies – behavioral strategies

• Mixed strategy: randomize over pure strategies • Behavioral strategy: independent coin toss every time an

information set is encountered • A mixed strategy is a distribution of vectors (each vector

describing a pure strategy) • A behavioral strategy is a vector of distributions

Page 47: 218_1_lecture_3_part1

Mixed and behavioral strategies

• Two types of randomization: • Mixed strategy: user randomizes over pure strategies • Behavioral strategy: users plans a collection of

randomizations, one for each point at which it needs to take an action

• Mixed strategies and behavioral strategies are

equivalent for games of perfect recall – No player ever forgets any information it knew

• Nash eq. in behavioral strategies = profile such that no player can increase its expected payoff by using a different behavioral strategy.

Page 48: 218_1_lecture_3_part1

• Give an example of a behavioral strategy: – A with probability .5 and G with probability .3

• Give an example of a mixed strategy that is not a behavioral strategy:

– (.6(A,G), .4(B,H)) (Why not? Because the choices made by him at the two nodes are not independent)

• In this game every behavioral strategy corresponds to a mixed strategy..

• What about imperfect-recall games? Examples next

Page 49: 218_1_lecture_3_part1

Games of imperfect recall

Page 50: 218_1_lecture_3_part1

Games of imperfect recall