a game theoretic study of energy efficient cooperative

11
A Game Theoretic Study of Energy Efcient Cooperative Wireless Networks Donald Richard Brown III and Fatemeh Fazel Abstract: In wireless networks, it is well-known that intermediate nodes can be used as cooperative relays to reduce the transmis- sion energy required to reliably deliver a message to an intended destination. When the network is under a central authority, en- ergy allocations and cooperative pairings can be assigned to op- timize the overall energy efciency of the network. In networks with autonomous selsh nodes, however, nodes may not be will- ing to expend energy to relay messages for others. This problem has been previously addressed through the development of extrin- sic incentive mechanisms, e.g., virtual currency, or the insertion of altruistic nodes in the network to enforce cooperative behavior. This paper considers the problem of how selsh nodes can decide on an efcient energy allocation and endogenously form coopera- tive partnerships in wireless networks without extrinsic incentive mechanisms or altruistic nodes. Using tools from both cooperative and non-cooperative game theory, the three main contributions of this paper are (i) the development of Pareto-efcient cooperative energy allocations that can be agreed upon by selsh nodes, based on axiomatic bargaining techniques, (ii) the development of neces- sary and sufcient conditions under which “natural” cooperation is possible in systems with fading and non-fading channels with- out extrinsic incentive mechanisms or altruistic nodes, and (iii) the development of techniques to endogenously form cooperative part- nerships without central control. Numerical results with orthogo- nal amplify-and-forward cooperation are also provided to quantify the energy efciency of a wireless network with sources selshly allocating transmission/relaying energy and endogenously forming cooperative partnerships with respect to a network with centrally optimized energy allocations and pairing assignments. Index Terms: Cooperative communication, game theory, wireless networks. I. INTRODUCTION Multihop or cooperative transmission is often used in wire- less ad hoc networks to increase energy efciency by allowing packets to be delivered over several short links [1]. One or more intermediate nodes between the source and destination can assist in the transmission by forwarding or relaying the packet along the route to the destination. Autonomous nodes acting in their own self-interest, however, may refuse to use their limited re- sources to forward packets for other nodes. This can lead to in- efcient use of the network resources since messages may have to be retransmitted or re-routed through different paths to the destination node [2]. Manuscript received February 5, 2010; approved for publication by Sanghoon Lee, Division II Editor, March 31, 2010. This work was supported by NSF award CCF-0447743. The authors are with the Electrical and Computer Engineering Department, Worcester Polytechnic Institute, Worcester, MA 01609, email: {drb, ffazel} @ece.wpi.edu. Several techniques have been proposed to encourage coop- eration and improve the efciency of wireless ad hoc networks with selsh autonomous nodes. A comprehensive study of these techniques can be found in [3]. One well-studied technique to encourage cooperation among selsh nodes is the development of extrinsic incentive mechanisms, e.g., virtual currency [4], [5], where nodes are reimbursed for cooperation. The idea of vir- tual currency is intuitively appealing in many scenarios, but the use of virtual currency has the potential for fraud and/or col- lusion as discussed in [6]. Another technique that can induce cooperation is the introduction of altruistic nodes into the net- work [7] that punish misbehaving nodes. While both of these techniques have been shown to encourage cooperation among selsh nodes, they both require some level of central authority in the network to perform accounting or to strategically insert altruistic nodes. They also implicitly assume that the near-term costs and benets of cooperative behavior are one-sided, hence remuneration is necessary to enable cooperation. While this as- sumption is true in some cases, recent studies, e.g., [8], have shown that the benets of cooperation can be two-sided and have considered the question of when “natural” cooperation is possible in large networks without any central authority. In [9], a two-player relaying game based on the orthogonal amplify- and-forward (OAF) cooperative transmission protocol [10] was analyzed in a non-cooperative game-theoretic framework and it was shown that natural cooperation without extrinsic incentive mechanisms or altruistic nodes can emerge under certain con- ditions on the channels. Two limitations of this work, however, are that it used centrally-controlled energy allocations and did not consider networks with more than two source nodes. This paper considers the problem of how selsh nodes can lo- cally decide on an efcient energy allocation and endogenously form cooperative partnerships in wireless networks with two or more source nodes and without any sort of extrinsic incentive mechanisms, altruistic nodes, or central authority. We rst use bargaining tools from cooperative game theory to determine ef- cient energy allocations that can be locally computed and agreed to by a pair of selsh nodes. We then develop a repeated-game framework and employ tools from non-cooperative game theory to describe necessary and sufcient conditions under which nat- ural cooperation between a pair of nodes is possible. To extend our results to networks with more than two source nodes, we then consider the question of how to endogenously form cooper- ative partnerships in general networks and propose the use of the “stable roommates” algorithm [11] to form partnerships that are stable with respect to unilateral or bilateral deviations. Finally, numerical results with OAF cooperation are provided to quan- tify the energy efciency of a wireless network with sources selshly allocating transmission/relaying energy and endoge- nously forming cooperative partnerships with respect to a net- 1229-2370/11/$10.00 c 2011 KICS

Upload: others

Post on 27-Dec-2021

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A Game Theoretic Study of Energy Efficient Cooperative

A Game Theoretic Study of Energy Efficient CooperativeWireless Networks

Donald Richard Brown III and Fatemeh Fazel

Abstract: In wireless networks, it is well-known that intermediatenodes can be used as cooperative relays to reduce the transmis-sion energy required to reliably deliver a message to an intendeddestination. When the network is under a central authority, en-ergy allocations and cooperative pairings can be assigned to op-timize the overall energy efficiency of the network. In networkswith autonomous selfish nodes, however, nodes may not be will-ing to expend energy to relay messages for others. This problemhas been previously addressed through the development of extrin-sic incentive mechanisms, e.g., virtual currency, or the insertionof altruistic nodes in the network to enforce cooperative behavior.This paper considers the problem of how selfish nodes can decideon an efficient energy allocation and endogenously form coopera-tive partnerships in wireless networks without extrinsic incentivemechanisms or altruistic nodes. Using tools from both cooperativeand non-cooperative game theory, the three main contributions ofthis paper are (i) the development of Pareto-efficient cooperativeenergy allocations that can be agreed upon by selfish nodes, basedon axiomatic bargaining techniques, (ii) the development of neces-sary and sufficient conditions under which “natural” cooperationis possible in systems with fading and non-fading channels with-out extrinsic incentive mechanisms or altruistic nodes, and (iii) thedevelopment of techniques to endogenously form cooperative part-nerships without central control. Numerical results with orthogo-nal amplify-and-forward cooperation are also provided to quantifythe energy efficiency of a wireless network with sources selfishlyallocating transmission/relaying energy and endogenously formingcooperative partnerships with respect to a network with centrallyoptimized energy allocations and pairing assignments.

Index Terms: Cooperative communication, game theory, wirelessnetworks.

I. INTRODUCTION

Multihop or cooperative transmission is often used in wire-less ad hoc networks to increase energy efficiency by allowingpackets to be delivered over several short links [1]. One or moreintermediate nodes between the source and destination can assistin the transmission by forwarding or relaying the packet alongthe route to the destination. Autonomous nodes acting in theirown self-interest, however, may refuse to use their limited re-sources to forward packets for other nodes. This can lead to in-efficient use of the network resources since messages may haveto be retransmitted or re-routed through different paths to thedestination node [2].

Manuscript received February 5, 2010; approved for publication by SanghoonLee, Division II Editor, March 31, 2010.This work was supported by NSF award CCF-0447743.The authors are with the Electrical and Computer Engineering Department,

Worcester Polytechnic Institute, Worcester, MA 01609, email: {drb, ffazel}@ece.wpi.edu.

Several techniques have been proposed to encourage coop-eration and improve the efficiency of wireless ad hoc networkswith selfish autonomous nodes. A comprehensive study of thesetechniques can be found in [3]. One well-studied technique toencourage cooperation among selfish nodes is the developmentof extrinsic incentive mechanisms, e.g., virtual currency [4], [5],where nodes are reimbursed for cooperation. The idea of vir-tual currency is intuitively appealing in many scenarios, but theuse of virtual currency has the potential for fraud and/or col-lusion as discussed in [6]. Another technique that can inducecooperation is the introduction of altruistic nodes into the net-work [7] that punish misbehaving nodes. While both of thesetechniques have been shown to encourage cooperation amongselfish nodes, they both require some level of central authorityin the network to perform accounting or to strategically insertaltruistic nodes. They also implicitly assume that the near-termcosts and benefits of cooperative behavior are one-sided, henceremuneration is necessary to enable cooperation. While this as-sumption is true in some cases, recent studies, e.g., [8], haveshown that the benefits of cooperation can be two-sided andhave considered the question of when “natural” cooperation ispossible in large networks without any central authority. In [9],a two-player relaying game based on the orthogonal amplify-and-forward (OAF) cooperative transmission protocol [10] wasanalyzed in a non-cooperative game-theoretic framework and itwas shown that natural cooperation without extrinsic incentivemechanisms or altruistic nodes can emerge under certain con-ditions on the channels. Two limitations of this work, however,are that it used centrally-controlled energy allocations and didnot consider networks with more than two source nodes.This paper considers the problem of how selfish nodes can lo-

cally decide on an efficient energy allocation and endogenouslyform cooperative partnerships in wireless networks with two ormore source nodes and without any sort of extrinsic incentivemechanisms, altruistic nodes, or central authority. We first usebargaining tools from cooperative game theory to determine effi-cient energy allocations that can be locally computed and agreedto by a pair of selfish nodes. We then develop a repeated-gameframework and employ tools from non-cooperative game theoryto describe necessary and sufficient conditions under which nat-ural cooperation between a pair of nodes is possible. To extendour results to networks with more than two source nodes, wethen consider the question of how to endogenously form cooper-ative partnerships in general networks and propose the use of the“stable roommates” algorithm [11] to form partnerships that arestable with respect to unilateral or bilateral deviations. Finally,numerical results with OAF cooperation are provided to quan-tify the energy efficiency of a wireless network with sourcesselfishly allocating transmission/relaying energy and endoge-nously forming cooperative partnerships with respect to a net-

1229-2370/11/$10.00 c© 2011 KICS

Page 2: A Game Theoretic Study of Energy Efficient Cooperative

work with centrally optimized energy allocations and pairingassignments.Unlike the previous studies on this subject, the novelty of

the approach in this paper is that the nodes in the network be-have selfishly without any form of central authority, commu-nity enforcement, or extrinsic incentive mechanisms. Selfish au-tonomous nodes endogenously form cooperative partnerships,locally determine efficient energy allocations, and cooperate byrelaying messages during transmission sessions with multipleframes. Throughout this paper, we assume that nodes exhibit ra-tional individual choice behavior, meaning that each individualsource node has a consistent preference relation over all pos-sible energy allocations and partners, and always chooses themost preferred feasible alternative. We also assume that nodescan always refuse to cooperate if it is in their best interest to doso.The rest of the paper is organized as follows. Section II

introduces the system model used throughout the paper. InSection III, we present a two-player relaying game in stagegame formulation, develop axiomatic bargaining solutions todetermine an optimum energy allocation that two selfish play-ers can agree upon, and then develop necessary and sufficientconditions under which selfish nodes will cooperate and notdefect under a repeated-game formulation for both fading andnon-fading channels. Section IV extends these two-player re-sults to networks with K>2 sources and describes a techniquein which the sources can endogenously form stable cooperativepairings. Section V provides numerical energy efficiency exam-ples based on OAF cooperative transmission and concluding re-marks are made in Section VI.

II. SYSTEM MODEL

We consider an ad hoc wireless network with L half-duplexnodes and a discrete model of time where nodes transmit in-formation to other nodes in the network in transmission ses-sions of variable duration. The sets of source nodes and des-tination nodes in a given transmission session are denoted asS and D, respectively, where |S| = |D| = K ≤ L/2 andS∩D = ∅. Fig. 1 shows such a network for the case whenL = 4and K = 2. The destination node for source node i ∈ S is de-noted as di ∈ D. It is assumed that the number of nodes and/orthe amount of offered network traffic is sufficiently large suchthat, in any given transmission session, K ≥ 2 source nodeswish to transmit independent information to distinct destinationnodes in the network. The channel h ij [n] between node i ∈ Sand node j ∈ D ∪ S\i in frame n is assumed to be frequencynon-selective. The squared channel magnitude between node iand j in frame n, normalized with respect to the power of the ad-ditive white Gaussian noise (AWGN) in the channel, is denotedas Hij [n].In each transmission session, the K source nodes involved

in the transmission session take turns transmitting using time-division multiple access (TDMA). A transmission session iscomposed of N ≥ 1 frames and each frame is composed of2K timeslots as shown in Fig. 2 for the case when K = 2. Inthe firstK timeslots of each frame, each source node transmits apacket to its destination. Due to the undirected nature of wireless

1

2 3

4

h12[n]

h13[n]

h21[n]h24[n]

h14[n]

h23[n]

Fig. 1. An ad hoc wireless network with L = 4 nodes. The two sourcenodes, shown in white and denoted as S = {1, 2}, wish to commu-nicate independent information to the two destination nodes, shownin gray and denoted as D = {d1, d2} = {3, 4}.

Time slot 11 transmits

Time slot 22 transmits

Time slot 31 relays

Time slot 42 relays

Frame 0 Frame N-1 Transmission session

Time slot 11 transmits

Time slot 22 transmits

Time slot 31 relays

Time slot 42 relays

Fig. 2. A transmission session composed of N frames with S = {1, 2}.

transmission, the TDMA transmissions in the first K timeslotsare also overheard by the other source nodes in the transmissionsession. The signal received by node j from node i in frame nis given as

yij [n] =√Hij [n]xi[n] + uij [n]

for all i ∈ S and all j ∈ D ∪ S\i in the current transmissionsession where xi[n] is the packet transmitted by source nodei in frame n and uij [n] is zero-mean unit-variance AWGN. Inthe remaining K timeslots of the frame, each source node canpotentially help one other source node by relaying a packet toits intended destination. When source node j ∈ S\i elects torelay a packet for source node i, it transmits a function of theobservation received from source node i in the first half of theframe. Destination node di ∈ D receives

zjdi [n] =√Hjdi [n]f(yij [n]) + vjdi [n]

where the relaying function f depends on the cooperativeprotocol [10] and vjdi [n] is zero-mean unit-variance AWGN.All noise terms are assumed to be spatially and temporallywhite. Note that each destination di ∈ D always receives atleast one observation in each frame, i.e., the direct transmissionyidi [n], and may receive two observations if another source nodeelects to relay the packet from source node i. It is assumed thatthe normalized channel magnitudesH[n] = {H ij [n]} are eitherfixed (non-fading), i.e.,H[n] ≡ H , or quasi-static and fading inthe sense thatH[n] is constant over the duration of each frame,but are independent and identically distributed (i.i.d.) in differ-ent frames of the transmission session. The current channel stateis assumed to be known by the K source nodes involved in thecurrent transmission session. Channel phases are only assumedto be known at the respective receivers.

III. TWO-PLAYER RELAYING GAMEThis section considers the scenario shown in Figs. 1 and 2

when there are two source nodes, denoted as S = {1, 2}, that

Page 3: A Game Theoretic Study of Energy Efficient Cooperative

wish to communicate independent information to two distinctdestination nodes, denoted as D = {d1, d2} = {3, 4}. We ex-tend the ideas developed in this section to the case with K>2source nodes in Section IV.

A. Stage Game Formulation

A stage game is defined in terms of the players, available ac-tions, and payoffs received by each player as a consequence ofthe actions for one frame of the current transmission session.The players in the game are the source nodes. In the first twotimeslots of frame n, each source node transmits to its desti-nation using transmit energy E1[n] and E2[n], respectively. Ifa source node does not request relaying from the other sourcenode, i.e., it uses direct transmission to its destination, it willtransmit with sufficient energy to satisfy a minimum quality-of-service (QoS) constraint, e.g., signal-to-noise ratio (SNR) orrate, at its destination based on the current channel state. Forexample, if the required at destination node d i is ρ, then sourcenode i will transmit with energy Ei[n] = ρ/Hidi [n] in frame nwhen it uses direct transmission. We denote the required directtransmission energy for source node i in frame n as E dt

i [n]. Ifa source node requests relaying from the other source node inframe n, it will transmit with energy 0 < Ei[n] < Edt

i [n].Referring to the timeslot schedule in Fig. 2, if node 2 has re-

quested relaying, then node 1 must decide in timeslot 3 whetherto fulfill this relaying request. Since both source nodes know thechannel state, node 1 can determine the minimum relaying en-ergy Ermin

1 [n] required to ensure the QoS constraint is satisfiedat destination node 4 [12]. Although node 1 can choose anynon-negative relaying energy E r

1 [n], a selfish node will neverrationally choose a relaying energy larger than the minimum re-quired relaying energy since there is no benefit to either sourcenode if a node expends excess relaying energy. If source node 1transmits with relaying energy less than the minimum requiredrelaying energy, then the packet will not be received at desti-nation node 4 with the required QoS and node 2 will need totransmit with additional energy at the end of the frame to ensurethe QoS constraint is satisfied. For these reasons, we assumethat node 1 chooses from the discrete set of actions “do not re-lay” (a1[n] = DNR ⇔ Er

1[n] = 0) and “relay with minimumrequired relaying energy” (a1[n] = R ⇔ Er

1[n] = Ermin1 [n])

in frame n. If source node 1 chooses the action DNR whenErmin1 [n] > 0, then node 2 will transmit at the end of the framewith the remaining energy required to ensure the QoS constraintis satisfied at node 4. Since the channel magnitudes are assumedto be constant over the duration of the frame, the total transmis-sion energy expended by node 2 in this case will be the same asif node 2 had used direct transmission in timeslot 2. If sourcenode 1 chooses the action R, the packet will be received bynode 4 at the required QoS level without any additional trans-mission energy from node 2.In timeslot 4, if node 1 has requested relaying, node 2 must

also decide between the actions DNR and R. The situation is thesame in this case as when node 1 relays for node 2 except thatnode 2 has the advantage of having just observed whether or notnode 1 fulfilled its relaying request and can choose its actionaccordingly.Since packets from both source nodes are always delivered

liT 1

tratS fo emarf

iT sem l to 1RR:1TD:1

tolsemiT 2RR:2TD:2RR:2TD:2

tolsemiT 3RND:1RND:1RND:1 R:1R:1 RND:1

tolsemiT 4RND:2RND:2RND:2 RND:2 RND:2RND:2R:2 R:2R:2

),( *21

nim εε r− )0,0()0,0()0,0()0,0( ),( nim2

*1

rεε − ),( nim2

*1

rεε − ),( *21

nim εε r− ),( nimnim2

*21

*1

rr εεεε −−

Fig. 3. Two-player relaying game in extensive form with source nodesS = {1, 2}. The actions DT, RR, R, and DNR correspond to “directtransmission (no relay request),” “request relay,” “relay,” and “do notrelay,” respectively. The pairs at the bottom of the tree correspond tothe payoffs of nodes 1 and 2, respectively, at the end of the frame.

to each destination irrespective of whether relaying requests arefulfilled or not, we define the stage game payoff as the trans-mission energy saved in the current frame with respect to directtransmission. The payoff received by source node i in frame nis denoted as πi(a[n], n) where

a[n] = (a1[n], a2[n])

∈ {(DNR,DNR), (DNR,R), (R,DNR), (R,R)}

is the action profile of both players in frame n. Note that when-ever a[n] = (DNR,DNR), both source nodes receive a payoffof zero (both nodes transmitted with the same energy as directtransmission). If source node i chooses R and node j choosesDNR, then node i receives a payoff of πi(a[n], n) = −Ermin

iand node j receives a payoff of πj(a[n], n) = E∗

j where

E∗j [n] := Edt

j [n]− Ej [n] > 0

is defined as the energy saved by node j with respect to directtransmission if node j requests relaying and node i *= j ful-fills the relaying request by relaying with sufficient energy toensure the QoS constraint is satisfied at destination node dj .Finally, if a[n] = (R,R), the source nodes receive payoffs(π1(a[n], n),π2(a[n], n)) = (E∗

1 − Ermin1 , E∗

2 − Ermin2 ). Fig. 3

summarizes the two-player relaying game in extensive form [13]and shows the payoffs received by each source node as a func-tion of the actions chosen by the players in the current frame.

B. Feasible Payoff Set and Pareto-Efficient Payoff Pairs

Based on the two-player stage game formulation in the pre-vious section, we note that both nodes transmit with energy0 < Ei[n] ≤ Edt

i [n] in the first two timeslots of the frame. Thisenergy, along with the relaying actions of both nodes in the thirdand fourth timeslots of the frame, specifies the payoff pair thatboth source nodes receive in the frame. We denote the set ofall non-negative feasible stage game payoffs as U ⊂ R2 and aPareto-efficient payoff pair as follows.Definition 1: A payoff pair (π1,π2) ∈ U is Pareto-efficient if

there exists no other payoff pair (π ′1,π

′2) ∈ U such that π′

1 > π1and π′

2 ≥ π2 or π′1 ≥ π1 and π′

2 > π2.In other words, a payoff pair is Pareto-efficient if it is impos-

sible for one source to improve its stage game payoff without

Page 4: A Game Theoretic Study of Energy Efficient Cooperative

reducing the payoff of the other source. We denote the set of allPareto-efficient payoff pairs as U ⊆ U .The notion of natural cooperation, i.e., cooperation without

extrinsic incentive mechanisms like virtual currency, is centeredaround the possibility that the set U\(0, 0) is not empty. Whenthis set is not empty, there exists at least one point where onesource node does strictly better than direct transmission andthe other node does no worse. When U\(0, 0) = ∅, it is clearthat both nodes will use direct transmission, since one or bothnodes will do worse than direct transmission otherwise. WhenU\(0, 0) *= ∅, there remains the question of what payoff pairin the set U the nodes should use. In a system with centralcontrol, one approach would be to minimize the total transmis-sion/relaying energy by maximizing the total payoff π 1 + π2.Selfish nodes, however, may not agree to this division of the“surplus”. The following section discusses how selfish sourcenodes can agree to a Pareto-efficient payoff pair using axiomaticbargaining tools from cooperative game theory.

C. Axiomatic Bargaining for Cooperative Energy Allocation

When the set U\(0, 0) is not empty, selfish nodes will at-tempt to arrive at a unique mutually agreeable payoff pair (and,consequently, a unique energy allocation) through “bargaining”.The bargaining problem is one of the paradigms of cooperativegame theory in which a group of two or more participants arefaced with a set of feasible outcomes, any of which can be thebargaining solution if agreed to unanimously. Our use of theterm bargaining here is somewhat misleading in the sense thatthe nodes do not actually bargain by communicating offers andcounteroffers to each other. Rather, since the channel state isknown to both source nodes and each node knows how the otherwill bargain, each node can determine the bargaining solutionlocally without any additional communication. The technique ofuniquely dividing a surplus among selfish players is commonlycalled “bargaining” in the cooperative game-theory literature,however, and we will use this term here for consistency.Let us define the pair (U ,∆) as the bargaining problem,

where U is the set of all non-negative feasible stage game pay-off pairs and ∆ = (0, 0) is the disagreement payoff. If bothsources fail to reach an agreement, they use direct transmissionto deliver their packets to their intended destinations and receivethe disagreement payoff in the current stage game. Note that∆ is always in U because direct transmission is always feasi-ble. It is assumed that U is a convex and closed set, boundedfrom above1. Given the definition of the bargaining problemand the set of Pareto-efficient payoff pairs U , an axiomatic bar-gaining solution is a function B, based on a set of “reasonable”axioms, that maps every (U ,∆) to a unique member of U . Spec-ifying these axioms serves to characterize the solution uniquelyfrom among the set of Pareto-efficient points.The most commonly used axiomatic bargaining solution is

the Nash bargaining solution (NBS) which is based on foursimple and well-accepted axioms and has been shown to haveclose connections to subgame-perfect equilibria in infinite hori-zon games [13]. These axioms can be briefly described as

1Sufficient conditions under which this assumption holds are provided in theAppendix.

1. (Pareto-efficiency) the bargaining solutionB(U ,∆) must bePareto-efficient.

2. (Independence of linear transformations) if T = ax + bwith a > 0, then the bargaining solution B(T (U), T (∆)) =T (B(U ,∆)), i.e., the bargaining solution must be indepen-dent of the utility scales of the players.

3. (Symmetry) the bargaining solutionB(U ,∆) will give equalpayoffs to both players if the set U is symmetric in the sensethat (u1, u2) ∈ U implies (u2, u1) ∈ U .

4. (Independence of irrelevant alternatives) if V ⊆ U andB(U ,∆) ∈ V , then B(V ,∆) = B(U ,∆), i.e., the addi-tion of irrelevant alternatives does not affect the bargainingsolution.

Other axiomatic bargaining solutions based on different sets ofaxioms include the Raiffa Kalai-Smorodinsky (RBS) [14] andthe modified Thomson bargaining solutions (MTBS). A unifiedview of all these axiomatic models is presented in [15].The NBS, RBS, and MTBS bargaining solutions can all be

expressed as

Bβ(U ,∆) = arg max(w1,w2)∈U

w1w2 (1)

where β is a scalar parameter specified by the bargaining solu-tion (β = 0 for NBS, β = 1 for RBS, and β = −1 for MTBS)and

wi :=πimi

+ β

(1− πj

mj

)

is the preference function of source node i, with j ∈ {1, 2},j *= i, and mi is the maximum stage game payoff of sourcei over U . The value of β in the preference function impliesa tradeoff between a player’s own gain and the other player’slosses, normalized by each player’s maximum gain. When play-ers use the NBS (β = 0), the bargaining solution is such thatplayers only maximize their own payoff without considerationof the losses incurred by the other player. When players use theRBS (β = 1), the bargaining solution is such that each player’spayoff is proportional to its maximum. Finally, when playersuse the MTBS (β = −1), each player has the same preferencefunction and the bargaining solution is such that the sum of thenormalized payoffs π1/m1 + π2/m2 is maximized.To illustrate the feasible payoff set, the Pareto-efficient sub-

set, as well as the different bargaining solutions, Fig. 4 showsthe positive quadrant of the feasible stage game payoffs as wellas the NBS, RBS, and MTBS for a two-player relaying gameusing OAF cooperation with channel state H12 = H21 = 5,H13 = 0.5, H14 = 5, H23 = 3, and H24 = 0.9 and anSNR = 10 dB QoS constraint. The maximization in (1) is per-formed numerically for β ∈ {−1, 0, 1} to obtain the three differ-ent bargaining solutions. The centrally controlled maximum to-tal payoff (or, equivalently, minimum total energy), is also plot-ted for comparison.

D. Stage Game Nash Equilibrium Analysis

Given a bargaining solution in the two-player stage game suchthat Ermin

i > 0 for both source nodes, this section considersthe question of whether selfish source nodes really will follow

Page 5: A Game Theoretic Study of Energy Efficient Cooperative

0 2 4 6 8 10 12 14 160

1

2

3

4

5

6

7

8

9

10

Payoff player 1

Payo

ff p

laye

r 2

Feasible payoff allocationRKS bargaining solution ( β = 1)Nash bargaining solution ( β = 0)Modified Thomson bargaining solution ( β = −1)Maximum sum payoff solution

Fig. 4. Illustration of NBS, RBS, and MTBS two-player bargaining solu-tions over the set of feasible stage game payoffs, U .

through on the agreement by looking at the consequences of de-fection in the context of non-cooperative game theory. Inspec-tion of the payoff pairs in Fig. 3 shows that, when node 2 isrequested to relay such that E rmin

2 > 0, node 2 will choose theaction DNR because its payoff of choosing DNR is always bet-ter than choosing R, irrespective of node 1’s actions. Knowingthis, node 1 will also choose the action DNR for the same rea-sons. Since both source nodes know that any relay requests willalways be rejected, both source nodes will choose to communi-cate with their respective destinations by direct transmission andreceive the payoff pair (π1(a[n], n),π2(a[n], n)) = (0, 0).To formalize this result, we briefly review the concept of a

Nash Equilibrium (NE) [13]. In a k-player game, the action pro-file (a∗1, · · ·, a∗k) is an NE if, for each player i, a∗

i is player i’sbest response to a∗

−i, where a−i denotes the actions of all theplayers except player i. In frame n, this can be expressed as

πi({a∗i [n],a∗−i[n]}, n) ≥ πi({ai[n],a∗

−i[n]}, n)

for all ai in the set of available actions for player i and where π i

is the payoff function player i. Intuitively, if all of the playersare choosing NE actions, no player can increase their payoffby unilaterally deviating from the NE action profile. It is notdifficult to show that the only NE of the two-player relayingstage game is a[n] = (DNR,DNR).The dilemma in this result is that the bargaining solution de-

veloped in the previous section specifies an energy allocationsuch that both nodes would receive a payoff better than (0, 0) byaccepting relay requests. In other words, if the channel state issuch that both nodes could save energy through mutual cooper-ation, both nodes would do better by choosing a[n] = (R,R)than a[n] = (DNR,DNR). But selfish nodes must act ratio-nally, and a simple non-cooperative game theoretical analysis ofthe extensive form of the stage game shows that there is only oneNE action profile for selfish players: Mutual non-cooperation. Inthe following section, we extend this stage game analysis to arepeated game formulation and show that, unlike a single-stagegame, a repeated game with uncertain ending, i.e., a transmis-

sion session with an uncertain number of frames, can include amutually cooperative NE for selfish players.

E. Repeated-Game Nash Equilibrium Analysis

Since each transmission session is composed of N ≥ 1frames, the stage game formulation developed in the prior sec-tion can be extended to a repeated-gamemodel where the play-ers interact over multiple stage games. If N is known to bothsource nodes in the current transmission session, backward in-duction arguments can be used to show that both players willchoose a[n] = (DNR,DNR) in each stage game. To see this,first consider the last stage game. Since there is no possibilityof gain from future cooperation, node 2 will rationally chooseDNR to maximize its payoff. Knowing this, node 1 will alsochoose DNR for the same reasons. Since each node knows thatthe other node will choose DNR in the last stage game, they willalso choose a[n] = (DNR,DNR) in the second to last stagegame, and so on, ensuring that the only rational strategy for bothsource nodes is to reject relay requests in all of the stage games[16, p.10].Now consider the scenario when the number of frames in the

current transmission session is not known by the source nodes.This scenario can occur, for example, in a cognitive radio net-work where the source nodes are secondary users and the endof the game occurs when the primary user becomes active. Weconsider the case when the transmission session continues af-ter the current frame with fixed probability δ, where δ is knownto both source nodes. In this case, the number of frames N is ageometrically distributed random variable with probability massfunction pN (n) = (1−δ)δn−1 for n = 1, 2, · · ·. Since the num-ber of frames in the current transmission session is not known tothe source nodes (and, as will be discussed in subsection III-E,the payoffs in future frames may also be unknown), both sourcenodes seek to maximize their expected total payoff in the trans-mission session. We define the expected total payoff of node ias

Πi := E

{N−1∑

n=0

πi(a[n], n)

}

where N is random and πi(a[n], n) is random when the chan-nels are fading. Under the assumption that the stage game pay-offs are independent of N , the expected total payoff can beshown to be equivalent to a repeated game having an infinitenumber of stages with future payoffs discounted according tothe expected duration of the game [16]. For player i, this can beexpressed as

Πi =∞∑

n=0

δnE{πi(a[n], n)}

where δ is called the discount factor and πi(a[n], n) is the ithplayer’s payoff in frame n given action profile a[n].In repeated games, players use a strategy to specify their ac-

tions in each stage game as a function of the channel state, co-operative protocol, QoS constraint, and previous actions of theother players. We define a trigger strategy in the repeated two-player relaying game as follows: If the bargaining solution spec-ifies Ermin

i [n] > 0, player i chooses the action ai[n] = R unlessthe other player has previously chosen DNR when relaying was

Page 6: A Game Theoretic Study of Energy Efficient Cooperative

requested. If player i chooses DNR when E rmini [n] > 0, then

player i is said to defect. If either player defects, the otherplayer “triggers” punishment by choosing DNR in all futurestage games (note that, since node 2 chooses its action after itobserves the action of node 1 in the current stage game, node2 will trigger punishment by playing DNR in the current stagegame).In our analysis of the repeated game scenario, the channel

states in the current and previous frames are assumed to beknown to both sources. When the channels are fading, the chan-nel states in future frames are not known; only their distributionis known. The energy allocation

E [n] = {E1[n], E2[n], Ermin1 [n], Ermin

2 [n]}

is assumed to be dynamically determined via a bargaining solu-tion in each frame n = 0, 1, · · · according to the known chan-nel state, the cooperative protocol, and the QoS constraint. Thefollowing proposition establishes necessary and sufficient con-ditions under which the (TRIGGER,TRIGGER) strategy profile isan NE of the repeated two-player relaying game with uncertainending in systems with quasi-static i.i.d. fading channels.Proposition 1: In a system with quasi-static i.i.d. fading

channels, the strategy profile (TRIGGER,TRIGGER) is an NE ofthe repeated two-player relaying game with uncertain ending ifand only if E rmin

1 [n] ≤ E∗1 [n] +

δ1−δ E1 and E

rmin2 [n] ≤ δ

1−δ E2for all n = 0, 1, · · ·, where Ei := E {E∗

i [n]− Ermini [n]}.

Proof: In frame n′, if both nodes have faithfully playedand continue to play the strategy profile (TRIGGER,TRIGGER),they will receive an expected total payoff of

Πi =n′−1∑

n=0

(E∗i [n]− Ermin

i [n]) + (E∗i [n

′]− Ermini [n′])

+∞∑

n=n′+1

δn−n′Ei.

The first and second terms in this expression correspond to theknown total payoff of the previous frames and the known pay-off of the current frame, respectively. The final term in this ex-pression corresponds to the expected total payoff from mutualcooperation in future stage games where the i.i.d. channel stateassumption has been used to remove the dependence of themeanon n.If node 1 deviates from the TRIGGER strategy by defecting in

stage game n′, it will receive a total payoff of

Π1 =n′−1∑

n=0

(E∗1 [n]− Ermin

1 [n])

because node 2 will punish node 1 immediately for its defec-tion in the current stage game. Note that this total expectedpayoff does not exceed the total expected payoff from faith-fully playing the TRIGGER strategy when (E ∗

1 [n′]−Ermin

1 [n′])+∑∞n=n′+1 δ

n−n′ E1 ≥ 0. Hence, node 1 has no incentive todeviate from the strategy profile (TRIGGER,TRIGGER) whenErmin1 [n′] ≤ E∗

1 [n′] + δ

1−δ E1.

If node 2 deviates from the TRIGGER strategy by defecting instage game n′, it is punished by node 1 in the next stage gameand receives a total expected payoff of

Π2 =n′−1∑

n=0

(E∗2 [n]− Ermin

2 [n]) + E∗2 [n

′].

The second term here corresponds to the payoff received bynode 2 in stage game n′ when its packet is forwarded by node 1but it does not reciprocate. This total expected payoff does notexceed the total expected payoff from faithfully playing theTRIGGER strategy when E rmin

2 [n′] ≤ δ1−δ E2. !

Proposition 1 implies that, as long as both source nodes canfind a bargaining solution that specifies an energy allocationsuch that the nodes are not requested to expend “too much”relaying energy in the current frame, then mutual cooperation(with the threat of punishment for defection) is an NE of the re-peated two-player relaying game with uncertain ending. In eachframe, each source node must check the NE conditions to de-termine if the bargaining solution implies an energy allocationthat satisfies the NE criteria. If not, both sources use direct trans-mission in the current frame. Note that this action is not inter-preted as defection since both sources have agreed to use directtransmission in this frame. We also note that the strategy pro-file (ALWAYS DEFECT, ALWAYS DEFECT) is also an NE of therepeated two-player relaying game with uncertain ending sinceneither player stands to gain from cooperation with an opponentthat always defects.It is worth mentioning here that the quantity Ei :=

E {E∗i [n]− Ermin

i [n]} in Proposition 1 is implicitly defined inthe sense that the expectation on the right hand side of thisequality depends on E1 and E2. In the numerical results in Sec-tion V, we calculate E1 and E2 numerically by first comput-ing N bargaining solutions based on N quasi-static i.i.d. fad-ing channel realizations and then computing the sample means1N

∑Nn=1 {E∗

i [n]− Ermini [n]} for i = 1, 2. These values are used

as the first estimates of E1 and E2 to check whether the energyallocation specified by the bargaining solution in each frame sat-isfies the NE criteria. If any of the energy allocations fail to sat-isfy the NE criteria, the energy allocations in these frames areset to direct transmission. The sample means are then recom-puted and this process is performed iteratively until the energyallocations in all of the frames satisfy the NE criteria based onthe sample means.As a special case of Proposition 1, we can also consider

a system with non-fading channels where the channel state isthe same over all of the frames in the transmission session,i.e., H[n] ≡ H for all n = 0, 1, · · ·, and the sources usefixed energy allocations, i.e., E [n] ≡ {E1, E2, Ermin

1 , Ermin2 } for

all n = 0, 1, · · ·. The following corollary establishes necessaryand sufficient conditions under which the (TRIGGER,TRIGGER)strategy profile is an NE of the repeated two-player relayinggame with uncertain ending in systems with non-fading chan-nels.Corollary 1: In a system with non-fading channels, the strat-

egy profile (TRIGGER,TRIGGER) is an NE of the repeated two-player relaying game with uncertain ending if and only ifErmin1 ≤ E∗

1 and Ermin2 ≤ δE∗

2 .

Page 7: A Game Theoretic Study of Energy Efficient Cooperative

The proof of Corollary 1 follows from Proposition 1 by sub-stituting Ei = E∗

i − Ermini for i ∈ {1, 2}. Unlike the case with

fading channels, all of the future payoffs are known when thechannels are non-fading; only the duration of the transmissionsession is unknown. Both source nodes have no incentive todefect when they expect the transmission session to be longenough such that they receive more long-term benefit from co-operation than short-term benefit from defection. The NE con-ditions are also simple to compute in this case.In each frame n = 0, 1, · · ·, Proposition 1 (or Corollary 1

in systems with non-fading channels) identifies a set of fea-sible energy allocations under which selfish nodes will ratio-nally choosemutual cooperation. If the bargaining solution fromsubsection III-C implies an energy allocation that is in this set,then selfish nodes will follow through on their agreement, relay-ing messages from each other and receiving stage game payoffsaccording to the bargaining solution. If the NE criteria are notsatisfied for energy allocation implied by the bargaining solu-tion, then both nodes will use direct transmission in the currentstage game.

IV. K >2 PLAYER RELAYING GAME

The previous section establishes how a pair of selfish sourcenodes can arrive at a mutually agreeable energy allocationthrough bargaining and also establishes the conditions underwhich this pair of nodes will not defect from their bargainingagreement. This section extends these results to transmissionsessions withK>2 source nodes. We restrict our attention hereto the particular scenario in which the source nodes form fixedtwo-player partnerships for the duration of a transmission ses-sion. While this restriction excludes more general network con-figurations, e.g., relaying coalitions with more than two nodesor dynamic partnerships, the restriction to fixed partnerships isbased on the principle that nodes are less likely to defect if theprobability of additional interactions with their current partneris high; dynamic partnerships among K selfish source nodeswhenK is large can result in non-cooperative behavior becauseof the low likelihood of additional interactions after the currentframe. The central problem is then the assignment of partnersto each source node (note that one source will have no partnerwhen K is odd). Specifically, we consider the problem of howto form stable partnerships endogenously by a network of selfishsource nodes.We define a pairing instance P as a set of two-player part-

nerships in which all but at most one source nodes are disjointlypaired. It is not difficult to show that each pairing instance in aK>2 player relaying game is an equilibriumwith respect to uni-lateral deviations when the energy allocations are determined bya bargaining solution and the NE conditions are satisfied. Pairinginstances may not be an equilibriumwith respect to multi-playerdeviation, however, where two or more players leave their cur-rent partners and form different partnerships. As an example,consider a network with K = 4 source nodes denoted as S ={1, 2, 3, 4} and a pairing instance P = {{1, 2}, {3, 4}}. Sup-pose that all nodes receive an identical expected payoff of πP >0 under this pairing instance. Suppose further that, under pair-ing instanceQ = {{1, 3}, {2, 4}}, nodes 1 and 3 each receive a

payoff of πQ > πP while nodes 2 and 4 receive a payoff of zero.It is clear that nodes 1 and 2 both improve their payoff by deviat-ing from pairing instanceP toQ, and they can do so without anyconsent (or repercussions) from nodes 3 and 4. Hence, althoughpairing instance P is an equilibrium with respect to unilateraldeviation, it is not an equilibrium with respect to multi-playerdeviations.While there are many notions of stability in K > 2 player

games, we restrict our attention here to the notion of a pairwise-stable network [17]. A pairwise-stable network is a pairing in-stance that is immune to any improving two-player deviations,where an improving two-player deviation in our context is a de-viation in which two players sever their current partnerships andform a new partnership such that at least one player in the newpartnership receives a strictly greater expected payoff while theother player in the new partnership receives an expected payoffno worse than before. Pairing instance P in the previous para-graph is clearly not pairwise stable.The problem of how to endogenously form a pairwise-stable

network among selfish nodes has been studied extensively un-der the title of stable matching problems [11]. Stable match-ing problems are generally divided into two categories: Two-sided matching and one-sided matching. In a two-sided match-ing problem, also referred to as the marriage problem, there aretwo disjoint sets of participants and the matching is a one-to-one mapping between the two sets. This is the classic matchingproblem discussed in [11], where it is shown that every instanceof the two-sided matching problem always admits at least onepairwise-stable solution. In the one-sided matching problem, amatching results in a partition of the single set of participantsinto disjoint pairs. This is a generalization of the marriage prob-lem and is known as the roommate problem. A major differencebetween a two-sided (marriage) problem and a one-sided (room-mate) problem is that the roommate problem may not necessar-ily have a stable matching.In our context with K>2 source nodes that need to endoge-

nously form a pairing instance to receive positive payoffs, thematching problem is one-sided since the source nodes are ho-mogeneous. In the absence of central control, the source nodescan attempt to form a stable matching at the start of the trans-mission session by first computing the expected bargaining pay-offs (also taking into consideration that the bargaining payoff iszero if the energy allocation does not satisfy the NE conditions)for each of the K − 1 possible partners in the network. Thesepayoffs then imply a preference table, known to each node, thatis used to determine a pairwise-stable matching, if one exists.If a pairwise-stable matching exists, the nodes then cooperatein the transmission session with these pairings using transmis-sion/relaying energies specified by the appropriate bargainingsolution (or direct transmission if the bargaining solution spec-ifies disagreement or does not satisfy the NE conditions). If apairwise-stable matching does not exist, one approach is to re-sort to direct transmission in the current transmission session.Another approach is to locally compute the centrally controlledpairings, i.e., the pairing instance that would result in the maxi-mum sum (or product) payoff over all of the source nodes, andform partnerships based on this pairing. We note that both ofthese solutions are stable with respect to unilateral deviations

Page 8: A Game Theoretic Study of Energy Efficient Cooperative

Table 1. Preference table for a 4-source matching problem.

Source First Second Thirdpreference preference preference

1 4 2 32 1 3 43 2 4 14 3 1 2

but may not be pairwise stable.As an example of finding stable matchings, consider a sce-

nario in which K = 4 sources compute their bargaining solu-tions with each other and generate the preference table givenin Table 1. In this example, it is impossible for each node tobe paired with their first preference since, for example, node 1prefers node 4, but node 4 prefers node 3. Nevertheless, it is notdifficult to verify that {(1, 2), (3, 4)} is a stable matching forthis example. Source 1 would prefer source 4 to source 2, butsource 4 is paired with his first choice and is unwilling to de-viate from this solution. Similarly, source 3 prefers source 2 tosource 4, but source 2 is paired with his first choice. Of the otherpossible matchings, it is not difficult to show that {(1, 4), (2, 3)}is stable and {(1, 3), (2, 4)} is unstable. For small networks, it isstraightforward to simply test all possible matchings for stabil-ity. For larger networks, a more efficient algorithm for findingstable matchings with complexity ofO(n2) for systems with 2nplayers is provided in [18].

V. NUMERICAL RESULTSTo compare the performance of wireless networks with self-

ish energy allocation and endogenous partner selection to thatof networks under central control, this section provides numer-ical examples demonstrating the relative energy efficiency ofseveral schemes under two performance metrics: (i) The totalenergy savings of the network (sum payoff) and (ii) the prod-uct of the energy savings of the network (product payoff), bothwith respect to direct transmission. In all of the numerical re-sults presented here, we assume a wireless network using OAFcooperative transmission, a discount factor of δ = 0.98, andan SNR = 10 dB QoS requirement. In each transmission ses-sion, K source and K destination nodes are randomly placed(with uniform distribution) on a disk of radius R = 10 meters.The relative energy efficiency of several schemes, averaged overthe random node positions (and over the channel realizations insystems with fading channels), are compared to a system withcentrally controlled (CC) energy allocations, i.e., the energy al-location resulting in the maximum sum/product payoff for eachpair, and CC pairing assignments, i.e., the pairing assignmentresulting in the maximum sum/product payoff over the network.A relative energy efficiency of one corresponds to the maximumnetwork payoff according to the sum/product payoff metric (ob-tained through CC energy allocations and CC pairing assign-ments) and a relative energy efficiency of zero corresponds todirect transmission.Fig. 5 shows the relative energy efficiency of a wireless net-

work using cooperative transmission through non-fading path-loss channels under the sum payoff metric. The squared chan-

2 3 4 5 6 7 8 9 10 11 120

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Number of nodes in transmission session (K)

Rela

tive

ener

gy e

ffic

ienc

y

CC (max sum payoff) energy allocation with random pairingsNBS/NE energy allocation with CC (max sum payoff) pairingsNBS/NE energy allocation with optimistic SR pairingsNBS/NE energy allocation with pessimistic SR pairingsNBS/NE energy allocation with random pairings

Fig. 5. Relative energy efficiency of a wireless network under the sumpayoff metric with non-fading pathloss channels using OAF cooperativetransmission with CC or selfish (NBS/NE) energy allocation and withrandom, CC, or optimistic/pessimistic SR pairing assignments.

nel magnitude between each node pair i and j is calculated asHij =

((xi − xj)2 + (yi − yj)2

)−γ/2 where γ = 4 is the path-loss parameter and (xi, yi) is the cartesian coordinate pair ofnode i. The CC energy allocations in this case correspond tothe maximum sum payoff for each node pair and the CC pairingassignments correspond to the maximum sum payoff over thenetwork. These results show that a system with purely selfishenergy allocations (NBS/NE) and endogenously formed nodepairings (SR) can achieve a relative energy efficiency of approx-imately half of that of a system with CC energy allocations andCC pairings for values of K ≥ 10. Since a stable roommatenode pairing solution does not always exist, Fig. 5 shows “op-timistic” and “pessimistic” bounds on the relative energy effi-ciency of NBS/NE energy allocations with SR pairings by usingthe CC pairing assignment and direct transmission, respectively,when a SR pairing solution is not found. Note that the resultscorresponding to CC energy allocations are not stable with re-spect to unilateral deviations because some nodes may receivenegative payoffs under a CC energy allocation and these nodeswould rationally choose defection. Also note that all of the re-sults corresponding to NBS/NE energy allocations are stablewith respect to unilateral deviations in the sense that no singlenode can improve its payoff by defecting. The NBS/NE resultswith SR pairings are pairwise stable when the SR pairing solu-tion exists, whereas the CC and random pairings are not.Fig. 6 shows the relative energy efficiency of a wireless net-

work using cooperative transmission through non-fading path-loss channels under the product payoff metric. As discussed in[19] and [20], the product payoff metric, unlike sum payoff, ac-counts for some degree of fairness in the payoffs since no nodeshould be given a small (or zero) payoff when the goal is tomaximize the product payoff. The CC energy allocations in thiscase correspond to the NBS energy allocations for each nodepair (since, according to its definition in (1), the NBSmaximizesthe product payoff for a given node pair) and the CC pairing as-signments correspond to the maximum product payoff over the

Page 9: A Game Theoretic Study of Energy Efficient Cooperative

2 3 4 5 6 7 8 9 10 11 120

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Number of nodes in transmission session (K)

Rela

tive

ener

gy e

ffic

ienc

y

CC (max product payoff) energy allocation with random pairingsNBS/NE energy allocation with CC (max product payoff) pairingsNBS/NE energy allocation with optimistic SR pairingsNBS/NE energy allocation with pessimistic SR pairingsNBS/NE energy allocation with random pairings

Fig. 6. Relative energy efficiency of a wireless network under the productpayoff metric with non-fading pathloss channels using OAF cooperativetransmission with CC or selfish (NBS/NE) energy allocation and withrandom, CC, or optimistic/pessimistic SR pairing assignments.

network. Since the CC energy allocations are the NBS energyallocations in this example, this metric also allows us to iso-late the effect of the pairing mechanism on the efficiency of thenetwork. The results in Fig. 6 show that the SR pairing schemecan provide more than 75% of the product payoff of the CCpairing scheme in terms of energy efficiency for the network,and clearly outperforms random pairing by margin of 50% atN = 10 nodes. Note that when we implement NBS locally ona per node basis, the nodes cooperate only if the NE criterionis satisfied. However, when we use the NBS energy allocationsin central manner, the nodes are forced to cooperate regardlessof the NE criterion. The results in Fig. 6 demonstrate that satis-fying NE criteria has an almost negligible effect on the energyefficiency of the network in non-fading channels.Fig. 7 shows the relative energy efficiency of a wireless net-

work using cooperative transmission through quasi-static fad-ing channels under the sum payoff metric. The channel meansare specified according to the path-loss model used in the previ-ous simulations and the channel realizationshij [n] are generatedindependently (both spatially and temporally) according to theNakagami-m distribution with m = 2. The results in this caseare similar to those in Fig. 5, with all of the selfish techniquesperforming slightly better in fading channels with respect to asystem under central control. A system with purely selfish en-ergy allocations (NBS/NE) and endogenously formed node pair-ings (pessimistic SR) can achieve an energy efficiency of betterthan 50% of that of a system with CC energy allocation and CCpairings forN ≥ 6.Overall, in the absence of central control in a wireless ad

hoc network, these numerical examples confirm that there isa price to pay for selfish behavior in terms of the overall en-ergy efficiency of the network. Whether this price is too highdepends to a large extent on the application. Nevertheless, theseresults also demonstrate that a network with sources that en-dogenously form cooperative pairs and selfishly allocate trans-mission/relaying energies without any external incentivemecha-

2 3 4 5 6 7 8 9 10 11 120

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Number of nodes in transmission session (K)

Rela

tive

ener

gy e

ffic

ienc

y

CC (max sum payoff) energy allocation with random pairingsNBS/NE energy allocation with CC (max sum payoff) pairingsNBS/NE energy allocation with optimistic SR pairingsNBS/NE energy allocation with pessimistic SR pairingsNBS/NE energy allocation with random pairings

Fig. 7. Relative energy efficiency of a wireless network under the sumpayoff metric with Nakagami-m (m = 2) independent fading channelsusing OAF cooperative transmission with CC or selfish (NBS/NE)energy allocation and with random, CC, or optimistic/pessimistic SRpairing assignments.

nisms or altruistic nodes to enforce cooperation can significantlyimprove the overall energy efficiency of the network with re-spect to that of direct transmission.

VI. CONCLUSIONThis paper employs both cooperative and non-cooperative

game theoretic tools to analyze the energy efficiency of wirelessad hoc networks with selfish energy allocation and endogenouspartner selection. The novelty of this study is that cooperationis established without the added complexity of extrinsic incen-tive mechanisms, altruistic nodes, and/or community enforce-ment. We first used axiomatic bargaining tools from cooperativegame theory to describe how two selfish source nodes can de-cide on a mutually-agreeable Pareto-efficient energy allocationthat can be locally computed without central authority in scenar-ios where mutual benefit is possible through cooperative trans-mission. We then analyzed the conditions under which selfishnodes would follow through on this agreement and not defect ina repeated relaying game scenario and developed the necessaryand sufficient conditions under which “natural” cooperation ispossible for both fading and non-fading channels. We then ex-tended the two-player model to networks withK>2 players andproposed a technique to endogenously form cooperative partner-ships without central control through the stable roommate algo-rithm. Finally, we provided numerical results based on OAF co-operative transmission to quantify the energy efficiency of an adhoc wireless network with selfish sources with respect to a net-work with centrally optimized energy allocations and pairings.

APPENDIXThis appendix describes a set of sufficient conditions under

which the non-negative feasible payoff set U is convex, closed,and bounded from above. Recall that the payoff received bynode j is defined as the energy saved by node j with respect

Page 10: A Game Theoretic Study of Energy Efficient Cooperative

to direct transmission. When node i *= j is the relay for node j,this payoff can be expressed as

πj = E∗j − Ermin

j = ψj(Ermini )− Ermin

j

where ψj : [0,∞) .→ [0, Edtj ) is the relay energy function that

maps the relaying energy offered by node i, i.e., E rmini , to the

reduction in transmit energy with respect to direct transmissionat node j, i.e., E∗

j . Note that ψj is bounded from above since itis impossible for node j to reduce its transmit energy by morethan Edt

j . It is also reasonable to assume that any useful relay en-ergy function ψj is monotonically increasing since any increasein the relaying energy offered by node i should not diminish thereduction in transmit energy experienced by node j. In additionto these properties, we assume that ψj is concave and continu-ous. The concavity condition, in particular, is intuitively justi-fied by the fact that each increase of the relaying energy offeredby node i should result in smaller reductions in transmit energyat node j. We note that the amplify-and-forward relaying func-tion satisfies all of these properties [12].Before proving that a pair of nodes using monotonically in-

creasing, concave, continuous, and bounded from above relayenergy functions have a non-negative feasible payoff set that isconvex, closed, and bounded from above, we will first show thatif (πi,πj) is a feasible payoff pair, then all payoff pairs (π ′

i,π′j)

such that π′i ≤ πi and π′

j ≤ πj are also feasible.Lemma 1: Assume ψi and ψj are monotonically increasing,

concave, continuous, and bounded from above. If (π i,πj) isa feasible payoff pair, then any payoff pair (π ′

i,π′j) such that

π′i ≤ πi and π′

j ≤ πj is also feasible.Proof: The proof follows from the fact that either node

can reduce its payoff arbitrarily by increasing its relay energy.Each node’s relaying energy is continuous and unbounded. Sup-pose node i increases its relaying energy to reduce its payoff toπ′i. Since ψj is monotonically increasing, this increase in relay-ing energy by node i may lead to an increased payoff at node j.Hence, node j must increase its relaying energy energy to re-duce its payoff to π ′

j . This, in turn, may lead to an increasedpayoff at node i, hence node i must again increase its relayingenergy to reduce its payoff to π ′

i. This iteration is repeated be-tween nodes i and j and converges because ψ i and ψj are con-cave, continuous, and bounded from above. Hence, it is possiblefor both nodes achieve any payoff pair (π ′

i,π′j)with π′

i ≤ πi andπ′j ≤ πj . !

This result will be used in the following lemma.Lemma 2: If ψi and ψj are monotonically increasing, con-

cave, continuous, and bounded from above then the non-negative feasible payoff set U is convex, closed, and boundedfrom above.

Proof: We will first prove that U is convex. For nota-tional convenience, let (xi, xj) := (Ermin

i , Erminj ) denote a valid

relaying energy allocation with xi ≥ 0 and xj ≥ 0. Supposethere exists two feasible payoff pairs in U with valid relaying en-ergy allocations denoted as (xi, xj) → (πi,πj) and (x′

i, x′j) →

(π′i,π

′j). We will show that (απi+(1−α)π′

i,απj +(1−α)π′j)

is also in U for all 0 ≤ α ≤ 1, hence the set U is convex.Form the relaying energy allocation (x ′′

i , x′′j ) = α(xi, xj) +

(1−α)(x′i, x

′j). Note that this relaying energy allocation is valid

for all 0 ≤ α ≤ 1. The payoff for node i is

π′′i = ψi(αxj + (1− α)x′

j)− αxi − (1 − α)x′i

≥ αψi(xj) + (1 − α)ψi(x′j)− αxi − (1− α)x′

i

= απi + (1− α)π′i

where the inequality is a consequence of the concavity of π i.Similarly, π′′

j ≥ απj+(1−α)π′j . Hence, if (πi,πj) and (π′

i,π′j)

are feasible payoff pairs and 0 ≤ α ≤ 1, the nodes can achieveanother feasible payoff pair (x ′′

i , x′′j ) → (π′′

i ,π′′j ) with a valid

relaying energy allocation such that π ′′i ≥ απi + (1− α)π′

i andπ′′j ≥ απj + (1 − α)π′

j . Lemma 1 implies that the payoff pair(απi+(1−α)π′

i,απj+(1−α)π′j)must also be feasible, hence

U is convex.To see that U is closed and bounded from above, let X =

{(xi, xj) : 0 ≤ xi ≤ Edti and 0 ≤ xj ≤ Edt

j }. Note that X isa compact set and includes all of the relaying energy allocationsunder which both nodes receive a non-negative payoff. Let Y ={(πi,πj) : (xi, xj) → (πi,πj) ∀(xi, xj) ∈ X} be the imageof X in the payoff plane. Note that U ⊆ Y and that Y mustalso be compact since ψi and ψj are continuous functions. Theset U can be formed by taking the intersection of Y with the setZ = {(πi,πj) : 0 ≤ πi ≤ Edt

i and 0 ≤ πj ≤ Edtj }. Since the

intersection of two compact sets is also compact, U must also becompact. Hence, U is closed and bounded from above. !

REFERENCES[1] Q. Zhao and L. Tong, “Energy efficiency of large-scale wireless networks:

Pro-active vs. reactive networking,” IEEE J. Sel. Areas Commun., vol. 23,no. 5, pp. 1100–1112, May 2005.

[2] S. Marti, T. J. Giuli, K. Lai, and M. Baker, “Mitigating routing misbehav-ior in mobile ad hoc networks,” in Proc. ACM/IEEE Mobicom, 2000, pp.255–265.

[3] J. Yang, A. G. Klein, and D. Brown III, “Cooperation without extrinsicincentive mechanisms in wireless networks with selfish nodes,” to be ap-peared in IEEE Signal Process. Mag., 2009.

[4] J.-P. Hubaux, T. Gross, J.-Y. Le Boudec, and M. Vetterli, “Towards self-organized mobile ad hoc networks: The terminodes project,” IEEE Com-mun. Mag., vol. 39, no. 1, pp. 118–124, Jan. 2001.

[5] L. Blazevic, L. Buttyan, S. Capkun, S. Giordano, J.-P. Hubaux, and J.-Y. L.Boudec, “Self organization in mobile ad hoc networks: The approach ofterminodes,” IEEECommun. Mag., vol. 39, no. 6, pp. 166–174, June 2001.

[6] M. Jakobsson, J.-P. Hubaux, and L. Buttyan, “A micro-payment schemeencouraging collaboration in multi-hop cellular networks,” FinancialCrypto, 2003.

[7] L. Lai and H. El Gamal, “On cooperation in energy efficient wirelessnetworks: The role of altruistic nodes,” IEEE Trans. Wireless Commun.,vol. 7, no. 5, pp. 1868–1878, May 2008.

[8] M. Felegyhazi, J.-P. Hubaux, and L. Buttyan, “Nash equilibria of packetforwarding strategies in wireless ad hoc networks,” IEEE Trans. MobileComput., vol. 5, pp. 1044–1049, May. 2006.

[9] J. Yang and D. R. Brown III, “Energy efficient relaying games in coop-erative wireless transmission systems,” in Proc. the Forty-First AsilomarConference on Signals, Systems and Computers, Nov. 2007, pp. 835–839.

[10] J. N. Laneman, D. N. C. Tse, and G. W.Wornell, “Cooperative diversity inwireless networks: Efficient protocols and outage behavior,” IEEE Trans.Inf. Theory, vol. 50, no. 12, pp. 3062–3080, 2004.

[11] D. Gale and L. Shapley, “College admissions and the stability of mar-riage,” The American Mathematical Monthly, vol. 69, no. 1, pp. 9–15,Jan. 1962.

[12] J. Yang, D. Gunduz, D.R. Brown III, and E. Erkip, “Resource allocationfor cooperative relaying,” in Proc. CISS, Princeton, NJ, Mar 2008.

[13] K. Binmore, Fun and Games: A Text on Game Theory. Lexington, MA:D.C. Heath, 1992.

[14] E. Kalai and M. Smorodinsky, “Other solutions to Nash’s bargaining prob-lem,” Econometrica, vol. 43, no. 3, pp. 513–518, 1975.

[15] X. Cao, “Preference functions and bargaining solutions,” in Proc. IEEEConf. Decision and Control, 1982, pp. 164–171.

Page 11: A Game Theoretic Study of Energy Efficient Cooperative

[16] R. Axelrod, The Evolution of Cooperation. Basic Books, 1984.[17] M. Jackson and A. Wolinsky, “A strategic model of social and economic

networks,” Journal of Economic Theory, vol. 71, no. 1, pp. 44–74, June1996.

[18] D. Gusfield and R. W. Irving, The Stable Marriage Problem: Structureand Algorithms. Cambridge, MA: MIT Press, 1989.

[19] Y. Chen, P. Cheng, P. Qiu, and Z. Zhang, “Optimal partner selection strate-gies in wireless cooperative networks with fixed and variable transmitpowers,” in Proc IEEE WCNC, Mar. 2007, pp. 4080–4084.

[20] R. Mazumdar, L. G. Mason, and C. Douligeris, “Fairness in network op-timal flow control: Optimality of product forms,” IEEE Trans. Commun.,vol. 39, no. 5, pp. 775–782, May 1991.

D. Richard Brown III is an Associate Professor inthe Department of Electrical and Computer Engineer-ing at Worcester Polytechnic Institute. He received aPh.D. in Electrical Engineering from Cornell Univer-sity in 2000 and M.S. and B.S. degrees in ElectricalEngineering from The University of Connecticut in1996 and 1992, respectively. He is a Senior Memberof the IEEE and also held an appointment as a Visit-ing Associate Professor at Princeton University fromAugust 2007 to June 2008.

Fatemeh Fazel is a Postdoctoral Associate at North-eastern University. She received a Ph.D. in ElectricalEngineering and Computer Science from the Univer-sity of California in Irvine in 2008, an M.Sc. degreefrom the University of Southern California in 2002,and the B.Sc. degree from Sharif University of Tech-nology in 2000.