computational game theory amos fiat modified slides prepared for yishay mansour’s class

Computational Game TheoryAmos Fiat

Modified Slides prepared for Yishay Mansour’s class

Lecture 1 - Introduction

Agenda Introduction to Game Theory Examples Matrix form Games Utility Solution concepts

Dominant Strategies Nash Equilibria

Complexity Mechanism Design: reverse game theory

The study of Game Theory in the context of Computer Science, in order to reason about problems from the perspective of computability and algorithm design.

Computational Game Theory

Computing involves many different selfish entities. Thus involves game theory.

The Internet, Intranet, etc.◦ Many players (end-users, ISVs, Infrastructure

Providers)◦ Players wish to maximize their own benefit and

act accordingly◦ The trick is to design a system where it’s

beneficial for the player to follow the rules

CGT in Computer Science

Theory◦ Algorithm design◦ Complexity◦ Quality of game states (Equilibrium states in

particular)◦ Study of dynamics

Industry◦ Sponsored search ◦ Other auctions

CGT in Computer Science

Rational Player◦ Prioritizes possible actions according to utility or

cost◦ Strives to maximize utility or to minimize cost

Competitive Environment◦ More than one player at the same time

Game Theory analyzes how rational players behave in competitive

environments

Game Theory

Matrix representation of the game

The Prisoner’s Dilema

Thieves honor

Defect

Thieves honor

3,3 6,2

Defect 2,6 5,5

Row Player Column Player

It is a dominant strategy to confess A dominant strategy is a “solution concept”

The Prisoner’s Dilema

Thieves honor

Defect

Thieves honor

3,3 6,2

Defect 2,6 5,5

Internet Service Providers (ISP) often share their physical networks for free

In some cases an ISP can either choose to route traffic in its own network or via a partner network

ISP Routing

ISP 1 needs to route traffic from s1 to t1

ISP 2 needs to route traffic from s2 to t2

The cost of routing along each edge is one

ISP Routing

ISP1 routes via B:◦Cost for ISP1: 1◦Cost for ISP2: 4

ISP Routing

Cost matrix for the game:

ISP Routing

A 3,3 6,2

B 2,6 5,5ISP 1

B,A: s1 to t1B,A: s2 to t2

Prisoners Dilemma Again

The game consists of only one ‘turn’

All the players play simultaneously and are unaware of what the other players do

Players are selfish, seek to maximize their own benefit

Strategic Games

N = {1,…,n} players Player i has actions We will say “action” or “strategy” The space of all possible action vectors is

A joint action is the vector a∈A Player i has a utility function If utility is negative we may call it cost

Strategic Games – Formal Model

A strategic game:

Strategic Games – Formal Model

Players

Actions of each player

Utility of each player

Action ai of player i is a weakly dominant strategy if:

Dominant Strategies

Action ai of player i is a strongly dominant strategy if:

An outcome a of a game is Pareto optimal if for every other outcome b, some player will lose by changing to b

Pareto Optimality

Vilfredo Pareto

St. Petersburg Paradox:◦ Toss a coin until tails, I pay

◦ What will you pay me to play?

Bernulli Utility

“Utility of Money”, “Bernulli Utility”

Completeness:Transitivity:Continuity:

Independence:

Von Neumann–Morgenstern Rationality Axioms (1944)Preferences over lotteries

Rationality Axioms

Utility function overlotteries, real valued,

expected utility maximization

Gamble A: 100% € 1,000,000Gamble B: 10% € 5,000,000

89% € 1,000,000 1% Nothing

Gamble C: 11% € 1,000,000 89% NothingGamble D: 10% € 5,000,000 90% Nothing

Allias Paradox (1953)

Gamble A or B?

Gamble C or D?

Experimental ”Fact”:

Experimental “Fact”:

Gamble A: 100% € 1,000,000Gamble B: 10% € 5,000,000

89% € 1,000,000 1% Nothing

Gamble C: 11% € 1,000,000 89% NothingGamble D: 10% € 5,000,000 90% Nothing

Allias Paradox

“Fact”:

Expected Utility Theory

VNM Axioms Expected Utility MaximizationMixed Nash Equilibrium exists

Assume there’s a shared resource (network bandwidth) and N players.

Each player “uses” the common resource, by choosing Xi from [0,1].

Otherwise,

Tragedy of the commons

Given that the otherplayers are fixed, whatIs the best response?

This is an equilibriumNo player can improve

The case for Privatization or central control of commons

A Nash Equilibrium is an outcome of the game in which no player can improve its utility alone:

Alternative definition: every player’s action is a best response:

Nash Equilibrium

The payoff matrix:

Battle of the Sexes

The payoff matrix:

Battle of the Sexes

Row player has no incentive to

move up

The payoff matrix:

Battle of the Sexes

Column player has no

incentive to move left

The payoff matrix:

Battle of the Sexes

So this is an Equilibrium state

The payoff matrix:

Battle of the Sexes

Same thing here

2 players need to send a packet from point O to the network.

They can send it via A (costs 1) or B (costs 2)

Routing Game

The cost matrix:

Routing Game

The cost matrix:

Routing Game

Equilibrium states

2 players, each chooses Head or Tail Row player wins if they match the column

player wins if they don’t Utility matrix:

Matching Pennies

Row player is fine, but Column player wants to move left

Matching Pennies

Column player is fine, but Row player wants to move up

Matching Pennies

Row player is fine, but Column player

wants to move right

Matching Pennies

Column player is fine, but Row player wants

to move down

No equilibrium state!

Matching Pennies

Players do not choose a pure strategy (one specific strategy)

Players choose a distribution over their possible pure strategies

For example: with probability p choose Heads, and with probability 1-p choose Tails

Mixed Strategies

Row player chooses Heads with probability p and Tails with probability 1-p

Column player chooses Heads with probability q and Tails with probability 1-q

Row plays Heads: Row plays Tails:

Matching Pennies

Each player selects where is the set of all possible distributions over Ai

An outcome of the game is the Joint Mixed Strategy

An outcome of the game is a Mixed Nash Equilibrium if for every player

Mixed Strategy

2nd definition of Mixed Nash Equilibrium:

Definition:

Property of Mixed Nash Equilibrium:

Mixed Strategy

No pure strategy Nash Equilibrium, only Mixed Nash Equilibrium, for mixed strategy (1/3, 1/3, 1/3) .

Rock Paper Scissors

N ice cream vendors are spread on the beach

Assume that the beach is the line [0,1] Each vendor chooses a location Xi, which

affects its utility (sales volume). The utility for player i :

X0 = 0, Xn+1 = 1

Location Game

For N=2 we have a pure Nash Equilibrium:

No player wants to move since it will lose space

For N=3 no pure Nash Equilibrium:

The player in the middle always wants to move to improve its utility

Location Game

0 11/2

If instead of a line we will assume a circle, we will always have a pure Nash Equilibrium where every player is evenly distanced from each other:

Location Game

N companies are producing the same product

Company I needs to choose its production volume, xi ≥ 0

The price is determined based on the overall production volume,

Each company has a production cost: The utility of company i is:

Cournot Competition

Case 1: Linear price, no production cost

◦ Utility:

◦ Pure Nash Equilibrium is reached at:

Cournot Competition

Case 2: Harmonic price, no production cost

◦ Company i’s utility:

◦ Companies have incentive to produce as much as they can – no pure or mixed Nash Equilibrium

Cournot Competition

n players wants to buy a single item which is on sale

Each player has a valuation for the product, Assume WLOG that Each player submits its bid, , all players

submit simultaneously.

Auction

Case 1: First price auction◦ The player with the highest bid wins◦ The price equals the bid◦ 1st Equilibrium is:

The first player needs to know the valuation of the second player – not practical

◦ 2nd Equilibrium is:

Auction

Case 2: Second price auction: Vickrey Auction◦ The player with the highest bid wins◦ The price equals the second highest bid

No incentive to bid higher than one’s valuation - a player’s utility when it bids its valuation is at least as high than when it bids any other value

This mechanism encourages players to bid truthfully Mechanism Design: reverse game theory –

set up a game so that the equilibria has a desired property

Auction

Equilibrium Concepts

pureNash

mixed Nash

correlated eq

no regret

best-responsedynamics

Traffic Flow: the Mathematical Model a directed graph G = (V,E) k source-destination pairs (s1 ,t1), …,

(sk ,tk) a rate (amount) ri of traffic from si to ti

for each edge e, a cost function ce(•)◦ assumed nonnegative, continuous,

nondecreasing

c(x)=x Flow = ½

Flow = ½c(x)=1

Example: (k,r=1)

Routings of Traffic

Traffic and Flows: fP = amount of traffic routed on si-ti path P flow vector f routing of traffic

Selfish routing: what are the equilibria?

Nash Flows

Some assumptions: agents small relative to network (nonatomic

game) want to minimize cost of their path

Def: A flow is at Nash equilibrium (or is a Nash flow) if all flow is routed on min-cost paths [given current edge congestion]

1Flow = .5

Flow = .5

Flow = 0

Flow = 1x

Example:

History + Generalizations

model, defn of Nash flows by [Wardrop 52]

Nash flows exist, are (essentially) unique◦ due to [Beckmann et al. 56]◦ general nonatomic games: [Schmeidler 73]

congestion game (payoffs fn of # of players)◦ defined for atomic games by [Rosenthal 73]◦ previous focus: Nash eq in pure strategies exist

potential game (equilibria as optima)◦ defined by [Monderer/Shapley 96]

The Cost of a Flow

Def: the cost C(f) of flow f = sum of all costs incurred by traffic (avg cost × traffic rate)

Cost = ½•½ +½•1 = ¾

The Cost of a Flow

Def: the cost C(f) of flow f = sum of all costs incurred by traffic (avg cost × traffic rate)

Formally: if cP(f) = sum of costs of edges of P (w.r.t. the flow f), then:

C(f) = P fP • cP(f)

s ts t

Cost = ½•½ +½•1 = ¾

Inefficiency of Nash FlowsNote: Nash flows do not minimize the cost observed informally by [Pigou 1920]

Cost of Nash flow = 1•1 + 0•1 = 1 Cost of optimal (min-cost) flow = ½•½ +½•1 = ¾ Price of anarchy := Nash/OPT ratio = 4/3

Braess’s Paradox

Initial Network:

s tx 1

cost = 1.5

Braess’s Paradox

Initial Network: Augmented Network:

s tx 1

cost = 1.5

s tx 1

Now what?

Braess’s Paradox

s tx 1

cost = 1.5 cost = 2

Braess’s Paradox

All traffic incurs more cost! [Braess 68]

see also [Cohen/Horowitz 91], [Roughgarden 01]

s tx 1

cost = 1.5 cost = 2

computational game theory amos fiat modified slides prepared for yishay mansour’s class

Documents

implementing the wisdom of the crowd ilan kremer, yishay...

790bc770bc750bc730bc710bc690bc uzziah amos joel? amos ~...

computational game theory lctn - yishay mansour

traitor tracing papers benny chor, amos fiat and moni naor,...

reinforcement learning: learning algorithms yishay mansour...

issue...

1.1 computational game theory - tel aviv...

competitive paging algorithms amos fiat, richard karp,...

boosting & adaboost lecturer: yishay mansour itay dangoor

broadcast encryption amos fiat & moni naor

machine learning: foundations yishay mansour tel-aviv...

efficient contention resolution protocols for selfish agents...

yishay mor, celia hoyles, ken kahn, richard noss and gordon...

sadna – ad auction lecture #3 time series yishay mansour...

machine learning: foundations course tau – 2012a prof. ...

amos 2 amos 5 amos 6 amos 7 amos 8 amos 2:1-8 next ... ·...

may 15, 2002stanford networking seminar associative peer to...

fiat - get ready by fiat

auction seminar optimal mechanism presentation by: alon...

amos fiat tel aviv university november 11, 2010