algorithmic game theory and economics: a very short...

Post on 17-Jun-2020

3 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Algorithmic Game Theory and Economics: A very short introduction

Mysore Park Workshop

August, 2012

PRELIMINARIES

Game

Rock Paper Scissors

0,0 -1,1 1,-1

1,-1 0,0 -1,1

-1,1 1,-1 0,0

Bimatrix Notation

A B

Stra

tegi

es o

f P

laye

r A

Strategies of Player B

Stra

tegi

es o

f P

laye

r A

Strategies of Player B

Zero Sum Game: A + B = all zero matrix

The Notion of Equilibrium

i i

j j

A B ๐ด๐‘–๐‘— โ‰ฅ ๐ด๐‘–โ€ฒ๐‘— โˆ€๐‘–โ€ฒ ๐ต๐‘–๐‘— โ‰ฅ ๐ต๐‘–๐‘—โ€ฒ โˆ€๐‘—

โ€ฒ

i is a best response to j

j is a best response to i

It neednโ€™t exist

0,0 -1,1 1,-1

1,-1 0,0 -1,1

-1,1 1,-1 0,0

Mixed Strategies and Equilibrium ๐‘1,๐‘

2,โ‹ฏ

,๐‘๐‘›

๐‘1,๐‘

2,โ‹ฏ

,๐‘๐‘›

๐‘ž1, ๐‘ž2, โ‹ฏ , ๐‘ž๐‘› ๐‘ž1, ๐‘ž2, โ‹ฏ , ๐‘ž๐‘›

A B

Expected utility of A = ๐‘๐‘–๐ด๐‘–๐‘—๐‘ž๐‘—๐‘–,๐‘— = ๐‘โŠค๐ด๐‘ž

Expected utility of B = ๐‘๐‘–๐ต๐‘–๐‘—๐‘ž๐‘—๐‘–,๐‘— = ๐‘โŠค๐ต๐‘ž

๐‘โŠค๐ด๐‘ž โ‰ฅ ๐‘โ€ฒโŠค๐ด๐‘ž โˆ€๐‘โ€ฒ ๐‘โŠค๐ต๐‘ž โ‰ฅ ๐‘โŠค๐ต๐‘žโ€ฒ โˆ€๐‘žโ€ฒ

J. von Nuemannโ€™s Minimax Theorem

Theorem (1928) If A+B=0, then equilibrium mixed strategies (p,q) exist.

max๐‘

min๐‘ž

๐‘โŠค๐ด๐‘ž = min๐‘ž

max๐‘

๐‘โŠค๐ด๐‘ž

"As far as I can see, there could be no theory of games โ€ฆ without that theorem โ€ฆ I thought there was nothing worth publishing until the Minimax Theorem was proved"

John Nash and Nash Equilibrium

Theorem (1950) Mixed Equilibrium always exist for finite games.

โˆƒ ๐‘ , ๐‘ž :

๐‘โŠค๐ด๐‘ž โ‰ฅ ๐‘โ€ฒโŠค๐ด๐‘ž โˆ€๐‘โ€ฒ

๐‘โŠค๐ต๐‘ž โ‰ฅ ๐‘โŠค๐ต๐‘žโ€ฒ โˆ€๐‘žโ€ฒ

Markets

Walrasian Model: Exchange Economies

โ€ข N goods {1,2,โ€ฆ,N} โ€ข M agents {1,2,โ€ฆ,M}

โ€ข ๐‘’๐‘– โ‰” (๐‘’๐‘–1, ๐‘’๐‘–2, โ€ฆ , ๐‘’๐‘–๐‘) โ€ข ๐‘ˆ๐‘– ๐’™ = ๐‘ˆ๐‘–(๐‘ฅ๐‘–1, โ€ฆ , ๐‘ฅ๐‘–๐‘)

Walrasian Model: Exchange Economies

โ€ข N goods {1,2,โ€ฆ,N} โ€ข M agents {1,2,โ€ฆ,M}

โ€ข ๐‘’๐‘– โ‰” (๐‘’๐‘–1, ๐‘’๐‘–2, โ€ฆ , ๐‘’๐‘–๐‘) โ€ข ๐‘ˆ๐‘– ๐’™ = ๐‘ˆ๐‘–(๐‘ฅ๐‘–1, โ€ฆ , ๐‘ฅ๐‘–๐‘)

โ€ข Prices ๐’‘ = ๐‘1, โ€ฆ , ๐‘๐‘ โ‡’ Demand (๐ท1, โ€ฆ , ๐ท๐‘€)

๐ท๐‘– ๐’‘ = argmax { ๐‘ˆ๐‘–(๐’™) โˆถ ๐’‘ โ‹… ๐’™ โ‰ค ๐’‘ โ‹… ๐’†๐’Š}

โ€ข ๐’‘, ๐’™๐Ÿ, โ€ฆ , ๐’™๐‘ด are a Walrasian equilibrium if

o ๐’™๐’Š โˆˆ ๐ท๐‘– ๐’‘ for all ๐‘– โˆˆ [๐‘€]

o ๐’™๐‘—๐’Š = ๐‘’๐‘–๐‘—๐’Š๐‘– for all j โˆˆ [๐‘]

Utility Maximization

Market Clearing

Leon Walras and the tatonnement

โ€ข Given price p, calculate demand for each good i.

โ€ข If demand exceeds supply, raise price.

โ€ข If demand is less than supply, decrease price.

Does this process converge? Does equilibrium exist?

1874

Arrow and Debreu

Theorem (1954). If utilities of agents are continuous, and strictly quasiconcave, then Walrasian equilibrium exists.

How does one prove such theorems?

โˆƒ ๐‘ , ๐‘ž : ๐‘โŠค๐ด๐‘ž โ‰ฅ ๐‘โ€ฒโŠค๐ด๐‘ž โˆ€๐‘โ€ฒ

๐‘โŠค๐ต๐‘ž โ‰ฅ ๐‘โŠค๐ต๐‘žโ€ฒ โˆ€๐‘žโ€ฒ Nash.

Define: ฮฆ ๐‘, ๐‘ž = (๐‘โ€ฒ, ๐‘žโ€ฒ)

โ€ข ๐‘โ€ฒ is a best-response to ๐‘ž

โ€ข ๐‘žโ€ฒ is a best-response to ๐‘

โ€ข Mapping is continuous.

Brouwers Fixed Point Theorem. Every continuous mapping from a convex, compact set to itself has a fixed point.

๐œ™ ๐‘, ๐‘ž = ๐‘, ๐‘ž โ‡’ Equilibrium.

EQUILIBRIUM COMPUTATION

Support Enumeration Algorithms

โˆƒ ๐‘ , ๐‘ž : ๐‘โŠค๐ด๐‘ž โ‰ฅ ๐‘โ€ฒโŠค๐ด๐‘ž โˆ€๐‘โ€ฒ; ๐‘โŠค๐ต๐‘ž โ‰ฅ ๐‘โŠค๐ต๐‘žโ€ฒ โˆ€๐‘žโ€ฒ Nash.

Suppose knew the support S and T of (p,q).

๐‘ , ๐‘ž : ๐‘’๐‘–

โŠค๐ด๐‘ž โ‰ฅ ๐‘’๐‘–โ€ฒโŠค๐ด๐‘ž; โˆ€๐‘– โˆˆ ๐‘†, โˆ€๐‘–โ€ฒ

๐‘โŠค๐ต๐‘’๐‘— โ‰ฅ ๐‘โŠค๐ต๐‘’๐‘—โ€ฒ โˆ€๐‘— โˆˆ ๐‘‡, โˆ€๐‘—โ€ฒ

๐‘๐‘– = 0; ๐‘ž๐‘— = 0 โˆ€๐‘– โˆ‰ S, โˆ€๐‘— โˆ‰ ๐‘‡

Every point above is a Nash equilibrium.

Lipton-Markakis-Mehta

Lemma. There exists ๐œ–-approximate NE with

support size at most ๐พ = ๐‘‚log ๐‘›

๐œ–2.

๐‘ , ๐‘ž : ๐‘โŠค๐ด๐‘ž โ‰ฅ ๐‘โ€ฒโŠค๐ด๐‘ž โˆ’ ๐œ– โˆ€๐‘โ€ฒ;

๐‘โŠค๐ต๐‘ž โ‰ฅ ๐‘โŠค๐ต๐‘žโ€ฒ โˆ’ ๐œ– โˆ€๐‘žโ€ฒ

๐-Nash

Given true NE (p,q), sample P,Q i.i.d. K times. Uniform distribution over P,Q is ๐œ–-Nash.

Chernoff Bounds

Values in [0,1]

Approximate Nash Equilibrium

Theorem (2003) [LMM] For any bimatrix game with at most ๐‘› strategies, an ๐œ–-approximate Nash equilibrium can be

computed in time ๐‘›๐‘‚ log ๐‘› /๐œ–2

Theorem (2009) [Daskalakis, Papadimitriou] Any oblivious algorithm for Nash equilibrium runs

in expected time ฮฉ(๐‘›[0.8โˆ’0.34๐œ–]log ๐‘›).

Lemke-Howson 1964

๐ด๐‘ง โ‰ค ๐Ÿ ๐‘ง โ‰ฅ 0

โˆ€๐‘–: ๐‘ง๐‘– = 0 OR ๐ด๐‘ง ๐‘– = 1

โ€ข Nonzero solution โ‰ˆ Nash equilibrium.

โ€ข Every vertex has one โ€œescapeโ€ and โ€œentryโ€

โ€ข There must be a sink โ‰ก Non zero solution.

123

113

133

123

How many steps?

โ€ข n strategies imply at most ๐‘‚(2๐‘›) steps.

โ€ข Theorem (2004).[Savani, von Stengel] Lemke-Howson can take ฮฉ ๐‘๐‘› steps, for some ๐‘ > 1.

Hardness of NASH

Theorem (2005) [Daskalakis, Papadimitriou, Goldberg]

Computing Nash equilibrium in a 4-player game is PPAD hard.

Theorem (2006) [Chen, Deng]

Computing Nash equilibrium in a 2-player game is PPAD hard.

Iโ€™ve heard of NP. Whatโ€™s this PPAD?

โ€ข Subclass of โ€œsearchโ€ problems whose solutions are guaranteed to exist (more formally, TFNP)

โ€ข PPAD captures problems where existence is proved via a parity argument in a digraph.

โ€ข End of Line. Given G = ( 0,1 ๐‘›, ๐ด), in/out-deg โ‰ค 1 out-deg(0๐‘›)=1, oracle for in/out(v); find v with in-deg(v)=1 and out-deg(v) = 0.

โ€ข PPAD is problems reducible to EOL.

Summary of Nash equilibria

โ€ข Exact calculation is PPAD-hard. Even getting an FPTAS is PPAD-hard (Chen,Deng,Teng โ€˜07) Quasi-PTAS exists.

โ€ข Some special cases of games have had more success โ€“ Rank 1 games. (Rutaโ€™s talk!)

โ€“ PTAS for constant rank (Kannan, Theobald โ€™07) sparse games (D+P โ€˜09) small probability games (D+P โ€™09)

โ€ข OPEN: Is there a PTAS to compute NE?

General Equilibrium Story

โ€ข Computing General Equilibria is PPAD-hard. (Reduction to Bimatrix games)

โ€ข Linear case and generalizations can be solved via convex programming techniques. Combinatorial techniques.

โ€ข โ€œSubstitutabilityโ€ makes tatonnement work.

MECHANISM DESIGN

Traditional Algorithms

ALGORITHM INPUT OUTPUT

MAXIMIZER N numbers (๐‘ฃ1, โ€ฆ , ๐‘ฃ๐‘)

๐‘ฃmax

Auctions

๐‘ฃ1

๐‘ฃ2

๐‘ฃ๐‘

๐‘ฃmax

Vickrey Auction โ€ข Setting

โ€“ Solicit bids from agents.

โ€“ Allocate item to one agent.

โ€“ Charge an agent (no more than bid.)

โ€ข Second Price Auction

โ€“ Assign item to the highest bidder.

โ€“ Charge the bid of the second highest bidder.

Theorem (1961). No player has an incentive to misrepresent bids in the second price auction.

Mechanism Design Framework

โ€ข Feasible Solution Space. ๐น โŠ† ๐‘น๐‘

โ€ข Agents. Valuations ๐‘ฃ๐‘–: ๐น โ†ฆ ๐‘น Reports type/bid ๐‘๐‘–: ๐น โ†ฆ ๐‘น

โ€ข Mechanism.

โ€“ Allocation: ๐‘ฅ1, ๐‘ฅ2, โ€ฆ , ๐‘ฅ๐‘ โˆˆ ๐น

โ€“ Prices: (๐‘1, ๐‘2, โ€ฆ ๐‘๐‘)

โ€“ Individual Rationality: ๐‘๐‘– โ‰ค ๐‘๐‘–(๐‘ฅ๐‘–)

โ€“ Incentive Compatibility: ๐‘ฃ๐‘– ๐‘ฅ๐‘–(๐‘ฃ๐‘–) โˆ’ ๐‘ ๐‘ฃ๐‘– โ‰ฅ ๐‘ฃ๐‘– ๐‘ฅ๐‘–(๐‘๐‘–) โˆ’ ๐‘๐‘– ๐‘๐‘–

โ€ข Goal.

Welfare Maximization: VCG

Goal. Maximize ๐‘ฃ๐‘– ๐‘ฅ๐‘– : ๐‘ฅ1, โ€ฆ , ๐‘ฅ๐‘ โˆˆ ๐น๐‘–

VCG Mechanism

โ€ข Find ๐’™โˆ— which maximizes welfare.

โ€ข For agent ๐‘Ž calculate

o ๐’™ maximizing ๐‘ฃ๐‘– ๐‘ฅ๐‘– : ๐’™ โˆˆ ๐น ๐‘–โ‰ ๐‘Ž

o Charge ๐‘๐‘Ž = ๐‘ฃ๐‘– ๐‘ฅ ๐‘– โˆ’ ๐‘ฃ๐‘– ๐‘ฅ๐‘–โˆ—

๐‘–โ‰ ๐‘Ž๐‘–โ‰ ๐‘Ž

โ€ข For single item โ†ฆ second price auction.

o Utility = ๐‘ฃ๐‘– ๐‘ฅ๐‘–โˆ— โˆ’ ๐‘ฃ๐‘–(๐‘ฅ ๐‘–)๐‘–โ‰ ๐‘Ž ๐‘–

Combinatorial Auctions

โ€ข N agents, M indivisible items. Utilities ๐‘ˆ๐‘–.

โ€ข Hierarchies of utilities: subadditive, submodular, budgeted additive, โ€ฆ.

โ€ข Problem with VCG: maximization NP hard.

โ€ข Approximation and VCG donโ€™t mix: even a PTAS doesnโ€™t imply truthfulness.

โ€ข Algorithmic Mechanism Design Nisan, Ronen 1999.

Minimax Objective: Load balancing

Goal. minimizemaxi

๐‘ฃ๐‘– ๐‘ฅ๐‘– : ๐‘ฅ1, โ€ฆ , ๐‘ฅ๐‘ โˆˆ ๐น

Age

nts

โ€ข VCG only works for SUM

โ€ข ๐‘‚(1)-approximation known for only special cases.

โ€ข General case = ฮฉ(๐‘›)?

โ€ข Characterization (?)

Single Dimensional Settings

โ€ข Each agent controls one parameter privately.

โ€ข Monotone allocation algorithm โ‡’ Truthful. (๐‘ฅ๐‘– should increase with ๐‘ฃ๐‘–)

โ€ข Is there a setting where monotonicity constraint degrades performance?

Techniques in AMD

โ€ข Maximal in Range Algorithms. Restrict range a priori s.t. maximization in P. Argue restriction doesnโ€™t hurt much. Nisan-Ronen 99, Dobzinski-Nisan-Schapira โ€˜05, Dobzinski-Nisan โ€˜07, Buchfuhrer et al 2010โ€ฆ.

โ€ข Randomized mechanisms. Linear programming techniques Lavi-Swamy 2005.

Maximal in Distributional Range Dobzinski-Dughmi 2009.

Convex programming Dughmi-Roughgarden-Yan 2011.

โ€ข Lower Bounds. Via characterizations. Dobzinski-Nisan 2007

Communication complexity. Nisan Segal 2001

Summary of Mechanism Design

โ€ข AMD opens up a whole suite of algorithm questions.

โ€ข Lower bounds to performance. Are we asking the right question?

โ€ข Can characterizations developed by economists exploited algorithmically?

WHAT I COULDNโ€™T COVER

THANK YOU

top related