Critical Discount Factor Values in Discounted Supergames

Berg, Kimmo; Kärki, Markus

doi:10.3390/g9030047

Open AccessArticle

Critical Discount Factor Values in Discounted Supergames

by

Kimmo Berg

^*

and

Markus Kärki

Department of Mathematics and Systems Analysis, Aalto University School of Science, P.O. Box 11100, FI-00076 Aalto, Finland

^*

Author to whom correspondence should be addressed.

Games 2018, 9(3), 47; https://doi.org/10.3390/g9030047

Submission received: 5 June 2018 / Revised: 27 June 2018 / Accepted: 6 July 2018 / Published: 10 July 2018

Download

Browse Figures

Versions Notes

Abstract

:

This paper examines the subgame-perfect equilibria in symmetric

2 \times 2

supergames. We solve the smallest discount factor value for which the players obtain all the feasible and individually rational payoffs as equilibrium payoffs. We show that the critical discount factor values are not that high in many games and they generally depend on how large the payoff set is compared to the set of feasible payoffs. We analyze how the stage-game payoffs affect the required level of patience and organize the games into groups based on similar behavior. We study how the different strategies affect the set of equilibria by comparing pure, mixed and correlated strategies. This helps us understand better how discounting affects the set of equilibria and we can identify the games where extreme patience is required and the type of payoffs that are difficult to obtain. We also observe discontinuities in the critical values, which means that small changes in the stage-game payoffs may affect dramatically the equilibrium payoffs.

Keywords:

repeated game; folk theorem; discount factor; 2 × 2 games; payoff set; correlated equilibrium

MSC:

91A20

JEL Classification:

C73

1. Introduction

The folk theorem tells us that any feasible and individually rational (FIR) payoff is an equilibrium, when the players are patient enough [1,2,3,4]. However, the players may not be extremely patient but they rather have some intermediate value for the discount factor. This paper finds the smallest discount factor value for which the FIR payoffs are the equilibrium payoffs in the symmetric

2 \times 2

supergames. This extends the folk theorem by solving exactly how patient the players need to be as a function of the stage-game payoffs. This reveals why and in what type of games extreme patience is required in the folk theorem and what payoffs are difficult to obtain.

This paper compares three types of strategies: pure strategies with and without public correlation and mixed strategies without public correlation. We want to examine how these assumptions affect the results. In some applications, it may be reasonable to assume that the players may only use pure strategies and they may not be able to coordinate their actions using a correlation device. The pure-strategy model has been examined in [5,6,7,8]. These papers characterize the subgame-perfect equilibria with a set-valued fixed-point equation, which forms the basis of our analysis. The mixed-strategy model has been studied in the more general model of imperfect public monitoring [9,10,11]; see also [12,13] for the model examined in this paper.

The computation of the payoff set has been examined in [14,15,16,17,18]. These papers assume public correlation, which makes the payoff set convex and simplifies the computation dramatically. References [19,20] have developed a method for computing pure-strategy equilibria without public correlation.

The main result of this paper is to solve the critical values, i.e., the smallest discount factor value for which the players obtain all the FIR payoffs as equilibrium payoffs. The results are based on solving analytically and geometrically the fixed-point equation of [7,8]. This idea has been presented in [21] where the critical values are solved in a class of prisoner’s dilemma games under public randomization; see also Sections 2.5.3 and 2.5.6 in [11]. The pure-strategy model without public correlation is a quite straightforward extension of [21], but the mixed-strategy model requires analyzing totally new types of strategies with complicated payoffs; see [12,13]. We restrict our analysis to the symmetric

2 \times 2

games since the set of pure-strategy equilibria may be empty in asymmetric games, it may be difficult to find the smallest equilibrium payoffs [22], and it is tedious to examine all the asymmetric games.

The results of this paper can be used (i) to identify the games where the players have trouble in obtaining all the payoffs and to find the payoffs that are difficult to obtain, (ii) to understand better how the equilibrium payoffs depend on the discount factor, and (iii) to find a range of discount factor values for which the computation of equilibria is easy. For example, we know that the payoff set coincides with the FIR payoffs for all the discount factor values above the critical value, and there is no need to compute the payoff set for these values. We also observe discontinuities in the critical values, which means that the set of equilibria may not behave well, i.e., small changes in the stage-game payoffs may affect dramatically the payoff set.

We organize the

2 \times 2

games into groups based on the equations that determine the critical values and provide a useful visualization of the critical values in different games. From the figures, it is easy to see when the high level of patience is needed and to make the comparison between the different strategies.

2. The Repeated Game

2.1. Stage Games

In a repeated game, a stage game is played again and again by the same players. A stage game is defined by a finite set of players

N = {1, \dots, n}

, a finite set of pure actions for each player

A_{i}

,

i \in N

, and the players’ utilities for each action profile

u : A \mapsto R^{n}

, where

A = \times_{i \in N} A_{i}

is the set of pure-action profiles. Moreover, a pure action of player i is denoted

a_{i} \in A_{i}

and a pure-action profile is

a \in A

.

Each player

i \in N

may randomize over his pure actions

a_{i} \in A_{i}

. This defines a mixed action

q_{i}

such that

q_{i} (a_{i}) \geq 0

for each

a_{i} \in A_{i}

and

\sum_{a_{i} \in A_{i}} q_{i} (a_{i}) = 1

. The set of probability distributions over

A_{i}

is denoted

Q_{i}

and

Q = \times_{i \in N} Q_{i}

. A mixed-action profile is denoted by

q = (q_{1}, \dots, q_{n}) \in Q

. A support of a mixed action is the set of pure actions that is played with a strictly positive probability:

S u p p (q_{i}) = {a_{i} \in A_{i} | q_{i} (a_{i}) > 0}

. We also define

S u p p (q) = \times_{i \in N} S u p p (q_{i})

, and for each

a \in S u p p (q)

, we let

π_{q} (a)

be the probability that the action profile a is realized if the mixed-action profile q is played:

π_{q} (a) = \prod_{j \in N} q_{j} (a_{j})

. In pure strategies, we make the restriction that

q_{i} (a_{i}) = 1

for one action

a_{i} \in A_{i}

.

In a model with public correlation, the players observe a realization

ω \in [0, 1]

of a public lottery and they can condition their action based on the signal

ω

. For example, two players may agree to take action

a_{1}

if

ω \leq 1 / 2

and

a_{2}

otherwise. This way the players can coordinate their actions such that they randomize between the outcomes

(a_{1}, a_{1})

and

(a_{2}, a_{2})

, and avoid the outcomes

(a_{1}, a_{2})

and

(a_{2}, a_{1})

. The latter outcomes would be realized if standard mixed strategies were used and no public correlation device was available.

The stage-game payoffs are given by function

u : Q \mapsto R^{n}

. For example, if the players choose a mixed-action profile

q \in Q

, then player i receives an expected payoff of

u_{i} (q) = \sum_{a \in A} u_{i} (a) π_{q} (a) .

(1)

Let

q_{- i} \in Q_{- i} = \times_{j \in N, j \neq i} Q_{j}

denote player i’s opponents’ actions. Now, an action profile q is a Nash equilibrium in the stage game if no player has a profitable deviation, i.e.,

u_{i} (q) \geq u_{i} (q_{i}^{'}, q_{- i}) for all q_{i}^{'} \in Q_{i} and i \in N .

(2)

The twelve symmetric strict ordinal

2 \times 2

games are presented in Figure 1, see ref. [23] for the taxonomy. Strict ordinality means that all of the payoffs must be unequal and there can be no indifferences. The two actions are C (cooperate) and D (defect), and they give the players the payoffs

a = 1

,

b

,

c

and

d = 0

; the corresponding action profiles are also denoted by letters

a, b, c, d

. For example, if the players choose the action profile

b = (C, D)

, the players receive payoffs

b

and

c

.

2.2. Repeated Games

We examine a model where the stage game is repeated infinitely many times and these games are sometimes called as supergames. We assume that the players observe all the past realized pure actions but not the probabilities that the other players are using in their mixed strategies. The public past play is denoted by the set of histories

H^{k} = A^{k} = \prod_{k} A

, where

H^{0} = A^{0} = {⌀}

is the empty set and corresponds to the beginning of the game. Thus, the history contains all the pure actions that were played in the previous rounds. The set of all possible histories is

H = ⋃_{k = 0}^{\infty} H^{k}

. A behavior strategy

σ_{i}

of player

i \in N

is a mapping that assigns a probability distribution over player i’s pure actions for each possible history

σ_{i} : H \mapsto Q_{i}

. The set of player i’s strategies is

Σ_{i}

. The players’ strategies form a strategy profile

σ = (σ_{1}, \dots, σ_{n})

, a strategy profile of all players except player i is denoted by

σ_{- i}

and the set of strategy profiles is given by

Σ = \times_{i \in N} Σ_{i}

. A pure strategy assigns a pure action for each possible history

σ_{i} : H \mapsto A_{i}

. With public correlation, the players observe a public signal

ω^{k} \in [0, 1]

on each round k before making their decisions, and thus the history contains all the past pure actions, signals and the current signal.

We assume that the players discount the future payoffs with a common discount factor

δ \in [0, 1)

. They have the same discount factor since we examine symmetric games. The expected discounted payoff of a strategy profile

σ

to player i is

U_{i} (σ) = E [(1 - δ) \sum_{k = 0}^{\infty} δ^{k} u_{i}^{k} (σ)],

(3)

where

u_{i}^{k} (σ)

is the payoff of player i at round k induced by the strategy profile

σ

. A strategy profile

σ

is a Nash equilibrium if no player has a profitable deviation, i.e.,

U_{i} (σ) \geq U_{i} (σ_{i}^{'}, σ_{- i}) for all σ_{i}^{'} \in Σ_{i} and i \in N,

(4)

and it is a subgame-perfect equilibrium (SPE) if it induces a Nash equilibrium in every subgame, i.e.,

U_{i} (σ | h) \geq U_{i} (σ_{i}^{'}, σ_{- i} | h) for all σ_{i}^{'} \in Σ_{i}, h \in H, and i \in N,

(5)

where

σ | h

is the restriction of the strategy profile after history

h \in H

. From now on, by equilibrium we mean subgame-perfect equilibrium.

2.3. Critical Discount Factor Values

Let V be the compact set of SPE payoffs and we also use

V (δ)

when we want to emphasize the players’ discount factor

δ

. By

V^{C}

,

V^{P}

and

V^{M}

we refer to the equilibria in pure strategies with and without public correlation, and mixed strategies, respectively.

The player i’s minimum equilibrium payoff, which is also called the punishment payoff, is denoted by

v_{i}^{-} (δ) = min {v_{i} : v \in V (δ)}

, when

V (δ)

is non-empty; and this is the case in the symmetric

2 \times 2

supergames. Similarly, the maximum equilibrium payoff is

v_{i}^{+} (δ) = max {v_{i} : v \in V (δ)}

. The minimax payoff is

{\underset{̲}{v}}_{i} = min_{q_{- i} \in Q_{- i}} max_{q_{i} \in Q_{i}} u_{i} (q_{i}, q_{- i}) .

(6)

Please note that the minimax payoffs can be different in pure and mixed strategies. However, it holds under perfect monitoring that

v_{i}^{-} (δ) \geq {\underset{̲}{v}}_{i}

,

\forall i \in N

[4,11]. The player’s minimum and maximum payoffs in a compact set W are denoted by

v_{i}^{-} (W)

and

v_{i}^{+} (W)

, respectively. Moreover,

v^{s} (W) = max {v_{i} \in W, v_{i} = v_{j}, \forall j \in N}

is the maximum symmetric payoff in the set W.

Let

V^{†} = c o (v \in R^{n} : \exists q \in A s . t . v = u (q))

be the set of feasible payoffs, where

c o

denotes the convex hull of the set. The set of feasible and individually rational (FIR) payoffs are

V^{*} = {v \in V^{†} : v_{i} \geq {\underset{̲}{v}}_{i}, i \in N} .

Let us denote the critical discount factor by

δ^{F} = min {δ : V (δ) = V^{*}},

(7)

which gives the smallest discount factor value when the payoff set coincides with the FIR payoffs. Please note that

V^{*}

is convex and thus

V (δ) = V^{*}

for all

δ \geq δ^{F}

by Theorem 3.

Please note that the minimum equilibrium payoff

v_{i}^{-} (δ)

may be strictly higher than the minimax payoff

{\underset{̲}{v}}_{i}

, and then it is impossible to obtain all the FIR payoffs for given

δ < 1

. The minimum pure-strategy payoffs have been studied in [24], and [22] present an algorithm for finding the punishment paths and payoffs.

For most of the symmetric

2 \times 2

games, the minimum equilibrium payoffs in pure strategies are equal to the minimax payoffs for all discount factors. No conflict, its anti-game and anti-stag hunt games are the only exceptions. In these games, the minimum payoffs are equal to the minimax values when the players are patient enough. This issue does not affect the results, since it can be shown the required levels of patience are smaller than the critical values.

The minimax payoffs in mixed strategies are the same as in pure strategies, except in leader, battle of the sexes, coordination and anti-coordination games. In these games, the minimax payoff is given by a mixed-strategy Nash equilibrium. Thus,

v^{-} (δ) = \underset{̲}{v}

for all

δ

in mixed strategies. It can be shown that the FIR payoffs are not obtained for any

δ < 1

in these games.

2.4. The Characterization of Equilibria

A pair

(a, w)

of an action profile

a \in A

and a continuation payoff

w \in W

is admissible with respect to W if it satisfies

(1 - δ) u_{i} (a) + δ w_{i} \geq max_{a_{i}^{'} \in A_{i}} [(1 - δ) u_{i} (a_{i}^{'}, a_{- i}) + δ v_{i}^{-} (W)], \forall i \in N .

(8)

This incentive-compatibility condition means that it is better for player i to take action

a_{i}

and get the continuation payoff

w_{i}

than to deviate and then obtain

v_{i}^{-} (W)

.

For a set of continuation payoffs W, the set of supportable action profiles is denoted by

F^{δ} (W) = {a \in A such that (a, w) is admissible for some w \in W} .

(9)

For

a \in F^{δ} (W)

, we denote the set of admissible continuation payoffs as

C_{a}^{δ} (W) = {w \in W such that (a, w) is admissible} .

(10)

Let

D_{a}^{δ} : R^{n} \mapsto R^{n}

be an affine mapping that corresponds to an action profile

a \in A

and a discount factor

δ

D_{a}^{δ} (w) = (I - T) u (a) + T w,

(11)

where I is an

n \times n

identity matrix and T is an

n \times n

diagonal matrix with the discount factor

δ

on the diagonal. The mapping

D_{a}^{δ}

is also defined for sets; then the addition is the Minkowski sum and

D_{a}^{δ} (⌀) = ⌀

. Finally, we denote the admissible payoffs that start with an action profile

a \in A

by

B_{a}^{δ} (W) = D_{a}^{δ} (C_{a}^{δ} (W))

.

Theorem 1.

The set of pure-strategy subgame-perfect equilibrium payoffs

V^{P}

is the unique largest compact set satisfying the fixed-point [8,11]

W = B^{δ} (W) ≐ ⋃_{a \in F^{δ} (W)} B_{a}^{δ} (W) .

(12)

The payoff set under public correlation

V^{C}

is given by the largest compact set satisfying [11]

W = B^{δ} (W) ≐ c o (⋃_{a \in F^{δ} (V)} B_{a}^{δ} (W)) .

(13)

Let us now characterize the set of equilibria in mixed strategies [12,13]. In a repeated game, the play at each round is strategically equivalent to playing an augmented stage game, where the continuation payoffs are included in the payoffs. For each action profile

a \in A

, the payoff in the augmented game is given by

{\tilde{u}}_{δ} (a) ≐ (1 - δ) u (a) + δ x (a),

where

x (a)

is the continuation payoff after a. Please note that in pure strategies there are only two continuation payoffs for each player:

w_{i}

if the player follows the equilibrium path or

v_{i}^{-}

if the player deviates. In mixed strategies, there can be a different continuation payoff

x (a)

after each action profile

a \in A

. Let

M (u (a))

denote the set of Nash equilibrium payoffs in a stage game with payoffs

u (a)

,

a \in A

.

Now, we are ready to state the characterization for the subgame-perfect equilibrium payoffs [12,13]. This result has been derived earlier in a more general model of imperfect monitoring [11].

Theorem 2.

The payoff set

V^{M}

is the largest compact fixed point of

B

:

W = B (W) ≐ ⋃_{x (a) \in W} M ((1 - δ) u (a) + δ x (a)) .

(14)

This means that the payoff set

V^{M}

corresponds to the set of equilibria in augmented stage games where the payoffs are

(1 - δ) u (a) + δ x (a)

and each

x (a)

can be chosen from the set

V^{M}

.

2.5. Monotonicity and Helpful Results

In this subsection, by

B (V)

we refer to the sets

B (V)

,

B (V)

and

B (V)

, depending on which strategies are in question. A set W is called self-generating if

W \subseteq B^{δ} (W)

. The following result follows directly from Theorems 1 and 2 and Equation (13).

Proposition 1.

If a bounded set W is self-generating then

B^{δ} (W) \subseteq V (δ)

.

The following shows that the payoff set is monotone in the discount factor when it is convex [8,11,24,25].

Theorem 3.

Suppose

V (δ_{1})

is convex then

V (δ_{1}) \subseteq V (δ_{2})

for

δ_{2} \geq δ_{1}

.

Let

v^{A} (W)

,

v^{B} (W)

,

v^{C} (W)

and

v^{D} (W)

be the corners of a quadrilateral set W corresponding to payoffs

u (a)

,

u (b)

,

u (c)

and

u (d)

. For example, if

u (a)

and

u (b)

are the payoffs in the northeast and northwest corners of

V^{†}

, then

v^{A} (B_{b} (δ))

and

v^{B} (B_{a} (δ))

are the northeast and the northwest corners of the sets

B_{b}

and

B_{a}

; see Figure 2a. Moreover, let

v^{M} (δ) = (1 - δ) max_{a \in A} u_{i} (a) + δ v_{i}^{-} (δ) .

(15)

This is the right-hand side of Equation (8) for the column that contains the maximum payoff in the game.

Remark 1.

If

B_{a} (δ) \neq ⌀

,

a \in A

, then

v^{M} (δ) = v_{i}^{-} (B_{a} (δ))

for all a and i where player i can deviate to the maximum payoff of the stage game.

The following result describes how the sets

B_{a}

,

a \in A

, may cover the boundaries of

V^{*}

. The result implies that we need as many sets

B_{a}

,

a \in A

, to cover the FIR payoffs as there are corner points in

V^{*}

.

Proposition 2.

The set

B_{a}^{δ} (V^{*})

,

a \in A

, may only cover the corner point of

V^{*}

closest to

u (a)

. It cannot cover the other corner points or the boundary of

V^{*}

between these other corner points.

Proof.

By the definition of

D_{q}^{δ}

, the set

V^{*}

is contracted by

δ

and is thus strictly smaller than

V^{*}

. The translation part

(I - T) u (a)

moves the set towards

u (a)

. ☐

3. Results for Different Strategies

We present now the results for the three different strategies, and the proofs are given in the appendix. Section 4 gives an example of the proofs in a prisoner’s dilemma game. The proofs for the mixed strategies are novel in Appendix C. However, the principle behind the proofs is the same but finding all the mixed-strategy payoffs is more complicated.

The results are based on Theorems 1 and 2 and Equation (13), which tell that the payoff set coincides with the FIR payoffs when the admissible payoffs cover all the payoffs in

V^{*}

. To find the critical value

δ^{F}

, we need to find the smallest discount factor for this to happen. The main idea is to find the last payoff point

v^{F} \in V^{*}

that is covered when the discount factor is increased to

δ^{F}

. The value

δ^{F}

is typically solved from a condition that two sets, say

B_{b}

and

B_{c}

intersect. In the proofs, we show both the necessary and sufficient conditions for

δ^{F}

: the point

v^{F}

is not covered for a smaller discount factor value and all the other FIR payoffs are covered for the given value

δ^{F}

.

The results are given in Table 1, Table 2 and Table 3. They show the values of

δ^{F}

for different groups of games. Please note that the group boundaries cross the game boundaries. For example, there are two groups of prisoner’s dilemma (PD) games in pure strategies without public correlation: the quadrilateral PDs belong to Group I (see example in Figure 2a) and the triangle PDs in Group IIIb, and the boundary between these groups is given by the equation

c = 2 - b

. Also, different games may belong to the same group: the triangle PDs and triangle chicken games all belong to Group IIIb. Figure 3 shows the groups for the different strategies. The thick grey lines show that there are discontinuities of

δ^{F}

between the groups. This means that the value of

δ^{F}

is not continuous and there may be a jump, when b and c are changed. Figure 4 shows the values of

δ^{F}

in different games.

The groups are based on the equations that determine the value of

δ^{F}

and the location of the last payoff

v^{F}

. We note that our classification is heuristic, and the games could be organized differently into groups. In pure strategies without public correlation: in Group I,

v^{F}

is found on the upper edge of

V^{*}

between

u (a)

and

u (b)

, on the bottom edge between

u (b)

and

u (d)

in Group II, and in the middle in Groups IIIa-d. In Group IV, we have

δ^{F} = 1

and in Group V,

δ^{F} = 0

.

With public correlation, the groups are based on the last corner point to be covered: in Group I, the last corner is in the northwest, corresponding to

u (b)

or

u (c)

, the

u (a)

corner in Group II, and the

u (d)

corner in Group III.

In mixed strategies, the groups are the following. In Group Ia, the last point to be covered is on the boundary between

u (b)

and

u (a)

, i.e., the intersection of

B_{a}

and

B_{b}

. Group Ib corresponds to the triangle games, where

B_{b}

and

B_{c}

intersect. In Group II, a corner point determines the value of

δ^{F}

:

u (b)

corner is last to be covered in Group IIa and

u (d)

corner in Group IIb. The intersection of

B_{d}

and

B_{b}

determines the value for Group III.

The overview of the values of

δ^{F}

is similar for all the strategies. Groups IV and V are the same, and the locations of the high and low values are about the same. However, the values of

δ^{F}

are much lower with public correlation. The scale with public correlation is between 0 and 1, and between

1 / 2

or

2 / 3

to 1 with the other strategies. The smallest and the highest values within the game classes are shown with

z_{1}

to

z_{4}

. The difference between the pure and mixed strategies is surprisingly small; the values are typically smaller than

0.05

in quadrilateral games and smaller than

0.15

in triangle games.

We note that for all groups, there are some payoff parameters for which

δ^{F} \to 1

, except in Group V where

δ^{F} = 0

for all payoffs. For example,

δ^{F} \to 1

when

b \to - \infty

for prisoner’s dilemma, stag hunt and coordination games. This means that we cannot extend the folk theorem unless these extreme payoff parameters can be ruled out.

We can observe that the value of

δ^{F}

depends on how large

V^{*}

is compared to

V^{†}

. When

V^{*}

is small, it is difficult to play certain actions in the game and the value of

δ^{F}

is high. For example, it is difficult to play the actions b and c in a prisoner’s dilemma when

b \to - \infty

. Geometrically, this means that

V^{*}

stays almost the same but

V^{†}

keeps increasing, making the proportion of

V^{*}

to

V^{†}

smaller. On the other hand, if

V^{*}

is large then

δ^{F}

is smaller. Please note that Groups IV and V are exceptions, where

δ^{F}

is a constant and thus independent of

V^{*}

and

V^{†}

.

We also note that there are discontinuities in

δ^{F}

on some of the group boundaries; see the thick grey lines in Figure 3. The discontinuity means that small changes in the payoffs can affect dramatically the payoff set and how large

δ^{F}

is. The discontinuity between the prisoner’s dilemma games is surprising and it shows that a small geometric change from the triangle-shape to the quadrangle-shape may affect whether the Pareto efficient payoffs are obtained or not.

For mixed strategies, we have a few remarks. The necessary and sufficient conditions for

δ^{F}

are more difficult to prove, since the strategies and their payoffs are more complicated. For Groups IIb and III, the values are only upper bounds since we only use the sufficient conditions. In all leader, battle of the sexes, coordination and anti-coordination games,

δ^{F} = 1

but in the figure we use the values for which the pure-strategy FIR payoffs are obtained as a comparison.

4. Group I in Pure Strategies without Public Correlation

This group is defined by parameters

1 < c < 2 - b

and

b < 0

. These are the prisoner’s dilemmas where

V^{*}

is a quadrilateral with an obtuse angle in

u (a)

corner; see Figure 2a. It is enough to examine the upper half of

V^{*}

where

v_{2} > v_{1}

since

V^{*}

is symmetric with respect to the center line. Thus, the last point

v^{F}

that is covered is only defined in the upper half of

V^{*}

.

The point

v_{I}^{F}

in Group I is located on the upper edge between

u (a)

and

u (b)

. This point and the corresponding discount factor

δ_{I}^{F}

are solved from the intersection of

B_{a}

and

B_{b}

. It is enough to consider only player 1’s payoffs:

\begin{matrix} v_{I}^{F} & = & v^{B} (B_{a} (δ^{F})) = v^{A} (B_{b} (δ^{F})) \\ \Rightarrow & (1 - δ^{F}) c + δ^{F} \cdot 0 = (1 - δ^{F}) b + δ^{F} \cdot 1 \end{matrix}

\begin{matrix} \Rightarrow δ_{I}^{F} & = & \frac{c - b}{1 + c - b}, \end{matrix}

(16)

\begin{matrix} v_{I}^{F} & = & (\frac{c}{1 - b + c}, \frac{b - 2 c}{b - c - 1}) . \end{matrix}

(17)

On the second line, the first payoff

v_{1}^{B} (B_{a} (δ^{F}))

can be solved from the right-hand side of the admissibility condition.

We first show that

v^{F} \notin B_{a}^{δ} (V^{*})

,

a \in A

, if

δ < δ^{F}

. Since the sets

B_{a}

and

B_{b}

intersect at

δ^{F}

, it means that

v^{F}

does not belong to either

B_{a}

or

B_{b}

for

δ < δ^{F}

. Moreover, the sets

B_{c}

and

B_{d}

cannot cover

v^{F}

on the boundary of

V^{*}

by Proposition 2.

Now, let us show that every

v = \{v_{2} \geq v_{1}, v \in V^{*}\} \in B_{a}^{δ} (V^{*})

for some

a \in A

and

δ = δ^{F}

. We divide

V^{*}

into three regions. If

v_{1} \geq v^{M} (δ^{F})

then

v \in B_{a} (δ^{F})

since

v^{M} (δ)

coincides with

v_{1}^{-} (B_{a} (V^{*}))

for all

δ \geq δ^{F}

by Remark 1. If

v_{1} \leq v^{M} (δ^{F})

and

v_{2} \leq v^{M} (δ^{F})

then we show that

v \in B_{d} (δ^{F})

. We show that all corner points

(v^{M} (δ^{F}), v^{M} (δ^{F}))

,

(0, 0)

and

(0, v^{M} (δ^{F}))

belong to

B_{d}

.

(0, 0)

is the Nash equilibrium and belongs to

B_{d}

for any

δ

. Also,

(0, v^{M} (δ^{F}))

belongs to

B_{d}

if

(v^{M} (δ^{F}), v^{M} (δ^{F}))

belongs to

B_{d}

, because

v_{2}^{B} (B_{d})

is higher than

v_{2}^{A} (B_{d})

due to

c > 1

. So for the last corner, we need to show that

v^{s} (V_{d} (δ^{F})) \geq v^{M} (δ^{F})

. It holds that

v^{s} (V_{d} (δ^{F})) = δ^{F}

and

v^{M} (δ^{F}) = (1 - δ^{F}) c

and the above condition holds if

δ^{F} \geq (1 - δ^{F})

. Using Equation (16), this is equal to

b \leq 0

and this is true in this group.

Finally, if

v_{1} \leq v^{M} (δ^{F})

and

v_{2} \geq v^{M} (δ^{F})

then we show that

v \in B_{b} (δ^{F})

. The corner points are

v^{B} (V^{*})

,

(0, v^{M} (δ^{F}))

,

(v^{M} (δ^{F}), v^{M} (δ^{F}))

and

v^{B} (B_{a} (δ^{F}))

.

v^{B} (B_{a})

belongs to

B_{b}

trivially by the definition of

v^{F}

.

v^{B} (V^{*}) \in B_{b}

since

v^{F}

is at the upper edge and

B_{b}

covers the edge all the way to

v^{B} (V^{*})

.

(0, v^{M} (δ^{F})) \in B_{b}

since

v^{M} (δ)

coincides with

v_{2}^{-} (B_{b} (V^{*}))

for all

δ \geq δ^{F}

when

c > b

by Remark 1. Again,

c > 1

ensures that

v_{1}^{C} (B_{b}) > v_{1}^{A} (B_{b})

and thus

(v^{M} (δ^{F}), v^{M} (δ^{F})) \in B_{b}

since

v_{1}^{A} (B_{b} (δ^{F})) = v_{1}^{F} = v^{M} (δ^{F})

.

5. Conclusions

This paper examines the discount factor values for which the subgame-perfect equilibrium payoffs coincide with the FIR payoffs in the symmetric

2 \times 2

supergames. The main motivation is to study if the folk theorem could be extended in a class of games and find out the reasons why a high level of patience such as

δ \to 1

is required in some games. We find that the main reason is that it is impossible to obtain payoffs close to the minimax values: (1) this happens in Group IV in all strategies, (2) it is a result of the fact that the mixed-strategy punishment payoff is strictly smaller than the pure-strategy punishment in leader, battle of the sexes, coordination and its anti-game in mixed strategies, and (3) it is due to geometrical reasons in some games, i.e., how the stage-game payoffs are located and how large the FIR payoffs

V^{*}

are in proportion to feasible payoffs

V^{†}

. If

V^{*}

is small, which also means that it is difficult to play certain actions in the game, then a high level of patience is required.

We also examine how the public correlation and the mixed strategies affect the results. The games are organized into a few groups based on the equation that determines the smallest discount factor value as a function of the stage-game payoffs. The groups and the equations are a bit different under different strategies, but the overview looks similar. Even though a lower level of patience is required with public randomization, the highest and the lowest values are obtained in the same regions. The limit

δ^{F} \to 1

is obtained when

b \to - \infty

, when

c \approx b

, when

b \geq 1

and

b + c \approx 2

, or in certain anti-games. Thus, it is not possible to extend the folk theorem in any of the typical game classes, such as prisoner’s dilemma, chicken and stag hunt games, unless certain extreme payoffs can be ruled out. Furthermore, the public correlation or mixed strategies do not provide a remedy in these games and it holds that

δ^{F} \to 1

under all strategies.

The results of this paper help in determining the payoff set for high discount factor values. If the discount factor is higher than

δ^{F}

, then all the FIR payoffs are subgame-perfect equilibrium payoffs. Moreover, it is a bit surprise how small

δ^{F}

can be with public correlation. It was also observed that there are certain boundaries where

δ^{F}

is discontinuous, which means that small changes in the payoffs may affect dramatically the equilibrium payoffs.

It should be noted that this kind of analysis could be done in asymmetric games with more than two actions and more than two players. Furthermore, it is left for future research how to compute efficiently all the equilibrium payoffs when the discount factors are smaller than

δ^{F}

.

Author Contributions

Kimmo Berg is the main author. Markus Kärki calculated the values for

δ^{F}

and prepared the figures of admissible sets.

Funding

Berg acknowledges funding from Emil Aaltosen Säätiö (Post doc-pooli).

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Pure Strategies without Public Correlation

Appendix A.1. Groups II and IIIa

Groups II and IIIa include no conflict, stag hunt and coordination games. The payoffs satisfy

b < c < 1

and

V^{*}

is a quadrangle with an acute angle in

u (a)

corner; see Figure A1a. In these games,

δ^{F}

is a maximum of Equations (A2) and (A4). In Group II, the maximum is Equation (A2) and the point

v_{I I}^{F}

is located at the bottom edge between

u (b)

and

u (d)

. In Group IIIa, the maximum is Equation (A4) and

v_{I I I a}^{F}

is in the middle of

V^{*}

. The boundary between the groups is given when the two equations are equal, i.e.,

c = 1 + (b - 1) \sqrt{\frac{b}{b - 2}} .

(A1)

This means that

δ^{F}

is continuous on the boundary.

The point

v_{I I}^{F}

and

δ_{I I}^{F}

are solved from the intersection of

B_{b}

and

B_{d}

:

\begin{matrix} v_{I I}^{F} & = & v^{D} (B_{b} (δ^{F})) = v^{B} (B_{d} (δ^{F})) \\ \Rightarrow & (1 - δ^{F}) \cdot 1 + δ^{F} v_{2}^{-} (δ^{F}) = \frac{c - b}{1 - b} δ^{F} + \frac{1 - c}{1 - b} v_{2}^{-} (δ^{F}) \end{matrix}

\begin{matrix} \Rightarrow δ_{I I}^{F} & = & \frac{1 - b - v^{-} (δ^{F}) + c v^{-} (δ^{F})}{1 - 2 b + c - v^{-} (δ^{F}) + b v^{-} (δ^{F})}, \end{matrix}

(A2)

\begin{matrix} v_{I I}^{F} & = & (0, \frac{c - b + (1 - c) v_{2}^{-} (δ^{F}) + (c - 1) v_{2}^{-} {(δ^{F})}^{2}}{1 + c + b (v_{2}^{-} (δ^{F}) - 2) - v_{2}^{-} (δ^{F})}) . \end{matrix}

(A3)

Figure A1. Admissible payoffs in Groups II, IIIa and IV. (a) Group II and IIIa; (b) Group IV.

On the second line, the payoff

v_{2}^{D} (B_{b} (δ^{F}))

is again solved from the admissibility condition. The term

v_{2}^{B} (B_{d} (δ^{F}))

can be solved from the equations:

v_{1}^{B} (B_{d} (δ^{F})) = (1 - δ^{F}) \cdot 0 + δ^{F} y_{1} = v_{1}^{-} (δ^{F})

,

v_{2}^{B} (B_{d} (δ^{F})) = (1 - δ^{F}) \cdot 0 + δ^{F} y_{2}

, where

(y_{1}, y_{2})

is a payoff on the line between payoffs

(b, c)

and

(a, a)

, defined by

z_{1}

and

z_{2}

:

b z_{1} + z_{2} = c

,

a z_{1} + z_{2} = a

and

y_{1} z_{1} + z_{2} = y_{2}

.

First, let us show that

v^{F} \notin B_{a}^{δ} (V^{*})

,

a \in A

, if

δ < δ^{F}

. This is exactly the same as before:

v^{F} \notin B_{d} (δ)

or

B_{b} (δ)

when

δ < δ^{F}

and

v^{F} \notin B_{a} (δ)

or

B_{c} (δ)

by Proposition 2.

It is enough to check that

B_{d} (δ^{F})

covers

[0, v^{M}] \times [0, v^{M}]

, i.e., all the corner points

(0, 0)

,

(0, v^{M} (δ^{F}))

and

(v^{M} (δ^{F}), v^{M} (δ^{F}))

. It will be shown with Group IIIa that the other parts of

V^{*}

are covered if

δ \geq δ_{I I I a}^{F}

and in Group II it holds that

δ_{I I}^{F} > δ_{I I I a}^{F}

. Now,

(0, v^{M} (δ^{F})) \in B_{d} (δ^{F})

by definition of

δ_{I I}^{F}

and since

v^{M}

coincides with

v_{2}^{-} (B_{b} (δ^{F}))

by Remark 1. By the shape of

V^{*}

,

v^{s} (B_{d}) > v_{2}^{B} (B_{d})

since

c < 1

. Thus,

(0, 0)

and

(v^{M} (δ^{F}), v^{M} (δ^{F}))

belong to

B_{d} (δ_{I I}^{F})

.

In Group IIIa, it holds that

δ_{I I I a}^{F} > δ_{I I}^{F}

. The point

δ_{I I I a}^{F}

is located in the middle of

V^{*}

at the intersection of

B_{a}

,

B_{b}

and

B_{d}

:

\begin{matrix} v_{2, I I I a}^{F} = v_{2}^{+} (B_{d} \cap B_{b}) = v_{2}^{-} (B_{a} \cap B_{b}) \end{matrix}

\begin{matrix} \Rightarrow δ_{I I I a}^{F} & = \frac{3 - 3 b - 2 c + b (b + c)}{5 - 4 b + b^{2} - 3 c + bc}, \end{matrix}

(A4)

\begin{matrix} v_{I I I a}^{F} & = (\frac{2 - b - c}{5 - 3 c + b (b + c - 4)}, \frac{2 + b (b - 2) - c^{2}}{5 - 3 c + b (b + c - 4)}), \end{matrix}

(A5)

see Figure A1a. First, we show again that

v^{F} \notin B_{a}^{δ} (V^{*})

,

a \in A

, if

δ < δ^{F}

.

v^{F} \notin B_{a}

by definition of

v^{F}

.

v^{F}

is not only located at the boundary of the intersection of

B_{b}

and

B_{d}

but also on the boundary of the union of these sets. Thus,

v^{F} \notin B_{b}

or

B_{d}

. Finally,

v^{F} \notin B_{c} (δ)

,

δ < δ^{F}

, since

v_{1}^{-} (B_{c} (δ)) > v_{1}^{-} (B_{c} (δ^{F})) = v_{1}^{-} (B_{a} (δ^{F}))

.

If

v_{1} \geq v_{1}^{F}

then

v \in B_{a} (δ^{F})

. If

v_{1} \leq v_{1}^{F}

and

v_{2} \geq v^{M} (δ^{F})

then v belongs to either

B_{b}

or

B_{d}

because the slope between

v^{A} (B_{d})

and

v^{B} (B_{d})

is greater than the slope between

v^{A} (B_{b})

and

v^{B} (B_{b})

. Finally, the region where

0 \leq v_{2} \leq v^{M} (δ^{F})

is examined with Group II and it is covered since

δ_{I I I A}^{F} > δ_{I I}^{F}

in this group.

Appendix A.2. Group IIIb

In this group, the payoffs satisfy

b + c > 2

and

c > b

. These are triangular versions of prisoner’s dilemma, chicken and leader games. The set

V^{*}

is a triangle since

b + c > 2

and

u (a)

is inside the set

V^{*}

. The point

v^{F}

is located in the middle of

V^{*}

and it is solved from the intersection of

B_{d}

,

B_{b}

and

B_{c}

:

\begin{matrix} v_{I I I b}^{F} & = & v^{s} (B_{d} (δ^{F})) = (v^{M} (δ^{F}), v^{M} (δ^{F})) \\ \Rightarrow & (1 - δ^{F}) \cdot 0 + δ^{F} \frac{b + c}{2} = (1 - δ^{F}) c + δ^{F} v^{-} (δ^{F}) \end{matrix}

\begin{matrix} \Rightarrow δ_{I I I b}^{F} & = & \frac{2 c}{b + 3 c - 2 v^{-} (V^{*})}, \end{matrix}

(A6)

\begin{matrix} v_{I I I b}^{F} & = & (\frac{c (b + c)}{b + 3 c - 2 v^{-} (δ^{F})}, \frac{c (b + c)}{b + 3 c - 2 v^{-} (δ^{F})}) . \end{matrix}

(A7)

First, we examine

δ < δ^{F}

.

v^{F} \notin B_{a}

since

v_{i}^{-} (B_{a} (δ)) \geq v^{M} (δ) > v^{F} (δ^{F})

,

i = 1, 2

.

v^{F} \notin B_{b}

since

v_{2}^{-} (B_{b} (δ)) \geq v^{M} (δ) > v^{M} (δ^{F})

.

v^{F} \notin B_{c}

since

v_{1}^{-} (B_{c} (δ)) \geq v^{M} (δ) > v^{M} (δ^{F})

. Finally,

v^{F} \notin B_{d}

by the definition of

v^{F}

.

Let

v = \{v_{2} \geq v_{1}, v \in V^{*}\}

. If

v_{2} \geq v^{M} (δ^{F})

then

v \in B_{b} (δ^{F})

by the geometry. Similarly, if

v_{2} \leq v^{M} (δ^{F})

then

v \in B_{d} (δ^{F})

.

Appendix A.3. Group IIIc

This group is the reversed version of Group IIIb so that

b > c

. These are the battle of the sexes, the triangular versions of anti-prisoner’s dilemma and anti-chicken games. The set

V^{*}

is triangular as in Group IIIb but the sets

B_{b}

and

B_{c}

are in reverse order. Again, the point

v^{F}

is located in the middle of

V^{*}

and it is in the intersection of sets

B_{a}

,

B_{b}

and

B_{c}

:

\begin{matrix} v_{I I I c}^{F} & = & v^{s} (B_{a} (δ^{F})) = (v_{1}^{-} (B_{b} (δ^{F})), v_{2}^{-} (B_{c} (δ^{F}))) \\ \Rightarrow & (1 - δ^{F}) \cdot 1 + δ^{F} \frac{b + c}{2} = (1 - δ^{F}) b + δ^{F} v^{-} (δ^{F}) \end{matrix}

\begin{matrix} \Rightarrow δ_{I I I c}^{F} & = & \frac{2 b - 2}{- 2 + 3 b - c} \end{matrix}

(A8)

\begin{matrix} v_{I I I c}^{F} & = & (\frac{b^{2} + b c - 2 v^{-} (δ^{F})}{3 b + c - 2 (1 + v^{-} (δ^{F}))}, \frac{b^{2} + b c - 2 v^{-} (δ^{F})}{3 b + c - 2 (1 + v^{-} (δ^{F}))}), \end{matrix}

(A9)

see Figure 2b. Let

δ < δ^{F}

.

v^{F} \notin B_{a} (δ)

by the definition.

v^{F} \notin B_{b}

since

v_{1}^{-} (B_{b} (δ)) > v_{1}^{-} (B_{b} (δ^{F}))

.

v^{F} \notin B_{c}

because

v_{2}^{-} (B_{c} (δ)) > v_{2}^{-} (B_{b} (δ^{F}))

. Finally,

v^{F} \notin B_{d}

since

v^{s} (B_{d} (δ^{F})) < v^{s} (B_{a} (δ^{F}))

due to

d < a

.

Now,

v_{2} \geq v_{2}^{-} (B_{c} (δ^{F}))

implies that

v \in B_{c} (δ^{F})

. Also,

v_{2} \leq v_{2}^{-} (B_{c} (δ^{F}))

implies

v \in B_{a} (δ^{F})

.

Appendix A.4. Group IIId

This group contains the quadrilateral versions of chicken such that

b + c > 2

and the

u (a)

corner is obtuse due to

c > 1

. These games are only slightly different from Group I. Since

b > d = 0

,

B_{d}

needs higher discount factor to reach

B_{a}

than

B_{b}

does. The point

v^{F}

is in the middle and solved as an intersection of

B_{a}

and

B_{d}

:

\begin{matrix} v_{I I I d}^{F} & = & v^{A} (B_{d} (δ^{F})) = v^{D} (B_{a} (δ^{F})) \\ \Rightarrow & (1 - δ^{F}) \cdot 0 + δ^{F} \cdot 1 = (1 - δ^{F}) c + δ^{F} b \end{matrix}

(A10)

\begin{matrix} \Rightarrow δ_{I I I d}^{F} & = & \frac{c}{1 + c - b}, \\ v_{I I I d}^{F} & = & (\frac{c}{1 + c - b}, \frac{c}{1 + c - b}) . \end{matrix}

(A11)

Let

δ < δ^{F}

.

v^{F} \notin B_{a}

or

B_{d}

by the definition of

v^{F}

.

v^{F} \notin B_{b}

since

v_{2}^{-} (B_{b} (δ)) = v_{2}^{-} (B_{a} (δ)) > v_{2}^{-} (B_{a} (δ^{F}))

. Similarly,

v^{F} \notin B_{c}

since

v_{1}^{-} (B_{b} (δ)) = v_{1}^{-} (B_{a} (δ)) > v_{1}^{-} (B_{a} (δ^{F}))

.

Now,

v_{1} \geq v^{M} (δ^{F})

implies that

v \in B_{a} (δ^{F})

.

v_{2} \leq v^{M} (δ^{F})

implies that

v \in B_{d} (δ^{F})

due to the shape and

c > 1

as in Group I. Next, we show that

v_{1} \leq v^{M} (δ^{F}), v_{2} \geq v^{M} (δ^{F})

belong to

B_{b} (δ^{F})

. We examine all the corner points:

p^{1} = (b, v^{M} (δ^{F})), p^{2} = (v^{M} (δ^{F}), v^{M} (δ^{F})), p^{3} = v^{B} (B_{a} (δ^{F}))

and

p^{4} = u (b)

.

u (b)

is a Nash equilibrium of the stage game and thus

u (b) \in B_{b} (δ)

,

\forall δ < 1

. By Remark 1,

p_{2}^{k} \geq v_{2}^{-} (B_{b} (δ^{F})) = v^{M} (δ^{F})

. Also, it holds that

v_{1}^{C} (B_{b} (δ^{F})) > v_{1}^{A} (B_{b} (δ^{F})) > v^{A} (B_{d} (δ^{F})) \geq p_{1}^{k}

. Thus,

p^{k} \in B_{b} (δ^{F})

for all k.

Appendix A.5. Groups IV and V

Group IV contains the anti-coordination, anti-no conflict and anti-stag hunt games. The payoff set is covered only in the limit when

δ \to 1

. By Proposition 2, the corner point

v^{C} (V^{*})

can only be covered with

B_{c}

. However, this is not covered with any

δ < 1

because

v_{1}^{-} (B_{c} (δ)) = v^{M} (δ) = (1 - δ) a + δ v^{-} (V^{*}) > v^{-} (V^{*})

.

Group V contains the trivial anti-games, where

b > 1

and

b + c < 2

. In these games

a

is both the minimax and Pareto-efficient payoff. Thus, the set

V^{*} = (1, 1)

is a single point and the payoff set is always covered, i.e., for all

0 \leq δ < 1

.

Appendix B. Pure Strategies with Public Correlation

Appendix B.1. Corner I

Let us examine the northwest corner of

V^{*}

. The set that covers this corner may only be

B_{b}

when

c > b

and

B_{c}

otherwise, by Proposition 2. There are three conditions that are needed: the set

B_{b}

(or

B_{c}

) should reach south, east and west enough to cover the corner. The first condition is that

v^{M} (δ) \leq v_{2}^{B} (V^{*})

and from this we can solve

δ_{I a}^{F} = \frac{v_{2}^{+} (V^{†}) - v_{2}^{+} (V^{*})}{v_{2}^{+} (V^{†}) - v_{2}^{-} (δ)} .

(A12)

This is the maximum of the three values and thus a sufficient condition in the triangular games which form Group Ia.

In quadrilateral games, where

c + b < 2

, the second condition is

v_{1}^{+} (B_{b}) \geq v_{1}^{-} (δ)

, which gives

δ_{I b}^{F} = \frac{b - v_{2}^{-} (δ)}{b - 1} .

(A13)

This condition is the maximum in Group Ib games.

The third condition is that

v_{1}^{-} (B_{b}) = v_{1}^{-} (δ)

. This is satisfied for all

δ

in Group Ia and Ib games. In Group IV games, this condition does not hold for any

δ < 1

, regardless of public correlation. These are the only games where the column maximum that limits the set is different from the punishment of the game.

Appendix B.2. Corner II

The corner

v^{A} (V^{*})

exist only if the set

V^{*}

is quadrilateral, i.e.,

c + b < 2

. This point is covered when

v^{M} (δ) \leq a

, from which we get

δ_{I I}^{F} = \frac{v_{2}^{+} (V^{†}) - 1}{v_{2}^{+} (V^{†}) - v_{2}^{-} (δ)} .

(A14)

Please note that this condition means that

δ_{I I}^{F} = 0

, i.e., a is a Nash equilibrium, if

c < 1

.

Appendix B.3. Corner III

The corner

v^{D} (V^{*})

is covered by the set

B_{d}

if

b < c

, and by

B_{a}

if

b > c

. In the case where

B_{a}

covers the corner, the discount factor is solved from the equation

(1 - δ) a + δ v^{S} (V^{*}) = v_{2}^{-}

, which gives

δ_{I I I a}^{F} = \frac{v_{2}^{-} (δ) - a}{v^{s} (V^{*}) - a} .

(A15)

In the

B_{d}

case, the value is solved from

(1 - δ) d + δ v^{S} (V^{*}) = v_{2}^{-}

and we get

δ_{I I I b}^{F} = \frac{v_{2}^{-} (δ) - d}{v^{s} (V^{*}) - d} .

(A16)

These equations simplify a bit in different games, since

v^{s} (V^{*}) = a

in quadrilateral games and

v^{s} (V^{*}) = (b + c) / 2

in triangular games. Please note that

δ_{I I I}^{F} = 0

if the punishment is a or d.

Appendix C. Mixed Strategies

Appendix C.1. Group I

In Group Ia, the necessary and sufficient condition for

δ^{F}

is that

B_{a}

and

B_{b}

intersect. In prisoner’s dilemma, this is obvious since the last point to be covered in pure strategies is on the boundary and this payoff cannot be obtained in non-pure mixed strategies; thus, the condition in PD is the same as in pure strategies. The same argument for the necessity of the condition also holds for the other games in Group Ia. However, we need to prove the sufficiency for the other games.

In quadrilateral chicken, a sufficient condition is that (1)

B_{a}

and

B_{b}

intersect and (2)

B_{d}

and

B_{b}

intersect. It is possible to cover the payoffs in the middle that do not belong to the pure-strategy sets

B_{a}

,

a \in A

, with mixed strategies. For example, we can construct a stage game with payoffs

(u_{1}^{+}, z_{1}) \in B_{a}

,

(u_{1}^{-}, z_{2}) \in B_{b}

,

(u_{1}^{+}, u_{2}) \in B_{c}

and

(u_{1}^{-}, u_{2}) \in B_{d}

, where

u_{1}^{+} > u_{1}^{-}

and

z_{1}

and

z_{2}

do not matter as long as the payoffs are in

B_{a}

and

B_{b}

. Now, there is a Nash equilibrium where the first player plays bottom and the second player can use any probability. Thus, they can obtain any payoff on the line between

(u_{1}^{-}, u_{2})

and

(u_{1}^{+}, u_{2})

. By going through the sets

B_{c}

and

B_{d}

, these lines cover all the payoffs in the middle that are left between the sets

B_{a}

,

a \in A

. Please note that this is only a sufficient condition, and it is possible that all these payoffs can be obtained with lower discount factors with some other mixed strategies.

$u_{1}^{+}, z_{1}$	$u_{1}^{-}, z_{2}$
$u_{1}^{+}, u_{2}$	$u_{1}^{-}, u_{2}$

In stag hunt and coordination games, the necessary and sufficient conditions are that (1)

B_{a}

and

B_{d}

intersect and (2)

v^{B} (V^{*})

is covered with

B_{b}

. The reason is the fact that the line between

u (a)

and

u (b)

can only be obtained by playing pure strategies a and b. In Group Ia, condition (1) implies (2) which makes condition (1) a necessary and sufficient condition. In Group IIa, condition (2) implies (1), which makes condition (2) a necessary and sufficient condition. In other words, the condition is the maximum over (1) and (2) in all these games. The gaps between the pure-strategy sets

B_{a}

,

a \in A

, can be covered with similar strategies as explained above.

In no conflict games, there is an additional condition (3)

B_{d}

should be non-empty. The value of

δ^{F}

is a maximum over the three conditions, and these give the three regions Ia (

B_{a}

and

B_{b}

), IIa (b corner covered) and IIb (d corner covered). Again, the gaps between the sets

B_{a}

,

a \in A

, are covered with suitable mixed strategies.

In triangle PD, anti-PD and anti-chicken, the necessary and sufficient condition is that

B_{b}

and

B_{c}

intersect. This condition implies that

B_{d}

and

B_{b}

has intersected. Thus, it is possible to cover the gaps between the sets

B_{a}

,

a \in A

with suitable mixed strategies as explained before. The condition is necessary since the payoffs between

u (b)

and

u (c)

can only be obtained by playing the pure strategies b and c.

Appendix C.2. Group II

In Group IIb, a necessary and sufficient condition is that the payoffs near

v^{D} (V^{*})

are covered. A sufficient condition for this is that

B_{d}

is non-empty. This is satisfied when it is possible to play d:

(1 - δ) \cdot 0 + δ \cdot 1 = (1 - δ) b + δ b

and thus

δ^{F} = b

for Group IIb. We are not sure if this condition is necessary as all the payoffs near

v^{D} (V^{*})

may be obtained with some mixed strategies with lower discount factor value.

In Group IIa, a necessary and sufficient condition is that

v^{B} (V^{*})

is covered with

B_{b}

. This implies that the sets

B_{a}

and

B_{d}

have intersected, as explained before. Please note that the conditions are the same as with public correlation of Group Ia.

Appendix C.3. Group III

A sufficient condition for this group is that

B_{d}

and

B_{b}

intersect. This guarantees that the gaps between the pure-strategy sets

B_{a}

,

a \in A

, are covered with suitable mixed strategies. We can solve the value

δ^{F}

for triangle chicken and leader games in the following way. We first solve the value of

δ

when it is possible to play d:

(1 - δ) \cdot 0 + δ (b + c) / 2 = (1 - δ) b + δ b

, from which we get

δ_{1} = 2 b / (b + c)

. Now,

v^{B} (B_{d} (V^{*}))

moves linearly from

v^{D} (V^{*})

to

v^{B} (V^{*})

when

δ

goes from the above value to 1:

z_{1} δ_{1} + z_{2} = b

and

z_{1} \cdot 1 + z_{2} = c

, from which we get

z_{1} = b + c

and

z_{1} = - b

. Finally, we can solve the value when it intersects

v^{D} (B_{b})

:

z_{1} δ^{F} + z_{2} = (1 - δ^{F}) c + δ^{F} b

and we have

δ^{F} = (b + c) / 2 c

. Again, we are not sure if this condition is necessary but it provides an upper bound for

δ^{F}

.

References

Abreu, D.; Dutta, P.K.; Smith, L. The folk theorem for repeated games: A Neu condition. Econometrica 1994, 62, 393–948. [Google Scholar] [CrossRef]
Friedman, J.W. A non-cooperative equilibrium for supergames. Rev. Econ. Stud. 1971, 38, 1–12. [Google Scholar] [CrossRef]
Fudenberg, D.; Maskin, E. The folk theorem in repeated games with discounting and incomplete information. Econometrica 1986, 54, 533–554. [Google Scholar] [CrossRef]
Fudenberg, D.; Tirole, J. Game Theory; MIT Press: Cambridge, MA, USA, 1991. [Google Scholar]
Abreu, D. Extremal equilibria of oligopolistic supergames. J. Econ. Theory 1986, 39, 191–225. [Google Scholar] [CrossRef]
Abreu, D. On the theory of infinitely repeated games with discounting. Econometrica 1988, 56, 383–396. [Google Scholar] [CrossRef]
Abreu, D.; Pearce, D.; Stacchetti, E. Optimal cartel equilibria with imperfect monitoring. J. Econ. Theory 1986, 39, 251–269. [Google Scholar] [CrossRef]
Abreu, D.; Pearce, D.; Stacchetti, E. Toward a theory of discounted repeated games with imperfect monitoring. Econometrica 1990, 58, 1041–1063. [Google Scholar] [CrossRef]
Fudenberg, D.; Levine, D.; Maskin, E. The Folk theorem with imperfect public information. Econometrica 1994, 62, 997–1039. [Google Scholar] [CrossRef]
Fudenberg, D.; Yamamoto, Y. Repeated games where the payoffs and monitoring structure are unknown. Econometrica 2010, 78, 1673–1710. [Google Scholar]
Mailath, G.J.; Samuelson, L. Repeated Games and Reputations: Long-Run Relationships; Oxford University Press: Oxford, UK, 2006. [Google Scholar]
Berg, K. Set-Valued Games and Mixed-Strategy Equilibria in Discounted Supergames; Working Paper; Aalto University: Espoo, Finland, 2016. [Google Scholar]
Berg, K.; Schoenmakers, G. Construction of mixed subgame-perfect equilibria in repeated games. Games 2017, 8, 47. [Google Scholar] [CrossRef]
Abreu, D.; Sannikov, Y. An algorithm for two player repeated games with perfect monitoring. Theor. Econ. 2014, 9, 313–338. [Google Scholar] [CrossRef]
Burkov, A.; Chaib-draa, B. An Approximate Subgame-Perfect Equilibrium Computation Technique for Repeated Games. In Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, Atlanta, GA, USA, 11–15 July 2010; pp. 729–736. [Google Scholar]
Cronshaw, M.B. Algorithms for finding repeated game equilibria. Comput. Econ. 1997, 10, 139–168. [Google Scholar] [CrossRef]
Cronshaw, M.B.; Luenberger, D.G. Strongly symmetric subgame perfect equilibria in infinitely repeated games with perfect monitoring. Games Econ. Behav. 1994, 6, 220–237. [Google Scholar] [CrossRef]
Judd, K.; Yeltekin, S.; Conklin, J. Computing supergame equilibria. Econometrica 2003, 71, 1239–1254. [Google Scholar] [CrossRef]
Berg, K.; Kitti, M. Equilibrium Paths in Discounted Supergames. Working Paper. 2012. Available online: http://sal.aalto.fi/publications/pdf-files/mber09b.pdf (accessed on 9 July 2018).
Berg, K.; Kitti, M. Computing equilibria in discounted 2 × 2 supergames. Comput. Econ. 2013, 41, 71–78. [Google Scholar] [CrossRef]
Stahl, D.O. The graph of prisoners’ dilemma supergame payoffs as a function of the discount factor. Games Econ. Behav. 1991, 3, 368–384. [Google Scholar] [CrossRef]
Berg, K.; Kärki, M. An Algorithm for Finding the Minimal Pure-Strategy Subgame-Perfect Equilibrium Payoffs in Repeated Games; Working Paper; Aalto University: Espoo, Finland, 2014. [Google Scholar]
Robinson, D.; Goforth, D. The Topology of the 2 × 2 Games: A New Periodic Table; Routledge: Abingdon, UK, 2005. [Google Scholar]
Berg, K. Extremal Pure Strategies and Monotonicity in Repeated Games. Comput. Econ. 2016, 49, 387–404. [Google Scholar] [CrossRef]
Sorin, S. On repeated games with complete information. Math. Oper. Res. 1986, 11, 147–160. [Google Scholar] [CrossRef]

Figure 1. Symmetric ordinal

2 \times 2

games with parameters b and c.

Figure 1. Symmetric ordinal

2 \times 2

games with parameters b and c.

Figure 2. Admissible payoffs in Groups I and IIIc. The shaded areas show the

B_{a}

sets for

a \in A

. (a) Group I; (b) Group IIIc.

Figure 2. Admissible payoffs in Groups I and IIIc. The shaded areas show the

B_{a}

sets for

a \in A

. (a) Group I; (b) Group IIIc.

Figure 3. Illustration of the groups in pure, correlated and mixed strategies. The thick grey lines show the discontinuities between the classes.

Figure 4. Values of

δ^{F}

with pure, correlated and mixed strategies. The darker shade means that the value of

δ^{F}

is higher.

Figure 4. Values of

δ^{F}

with pure, correlated and mixed strategies. The darker shade means that the value of

δ^{F}

is higher.

Table 1. The values of the discount factor

δ^{F}

without public correlation. Some games have multiple groups, and the equation that gives the boundary is shown on the right.

Table 1. The values of the discount factor

δ^{F}

without public correlation. Some games have multiple groups, and the equation that gives the boundary is shown on the right.

Game	First Group	Second Group	Group Boundary
Prisoner’s dilemma	I: $δ^{F} = \frac{c - b}{1 + c - b}$	IIIb: $δ^{F} = \frac{2 c}{b + 3 c}$	$c = 2 - b$
Chicken	IIId: $δ^{F} = \frac{c}{1 + c - b}$	IIIb: $δ^{F} = \frac{2 c}{3 c - b}$	$c = 2 - b$
Leader	IIIb: $δ^{F} = \frac{2 c}{3 c - b}$
Battle of the sexes	IIIc: $δ^{F} = \frac{2 b - 2}{3 b - c - 2}$
Anti-PD	V: $δ^{F} = 0$	IIIc: $δ^{F} = \frac{2 b - 2}{3 b - c - 2}$	$c = 2 - b$
Anti-chicken	V: $δ^{F} = 0$	IIIc: $δ^{F} = \frac{2 b - 2}{3 b - c - 2}$	$c = 2 - b$
Stag hunt	II: $δ^{F} = \frac{1 - b}{1 - 2 b + c}$	IIIa: $δ^{F} = \frac{3 - 3 b - 2 c + b (b + c)}{5 - 4 b + b^{2} - 3 c + bc}$	$c = 1 + (b - 1) \sqrt{\frac{b}{b - 2}}$
Coordination	II: $δ^{F} = \frac{1 - b}{1 - 2 b + c}$		$c = 1 + (b - 1) \sqrt{\frac{b}{b - 2}}$
No conflict	II: $δ^{F} = \frac{1 + b (c - 2)}{1 + b (b - 3) + c}$
Anti-coordination	IV: $δ^{F} = 1$
Anti-no conflict
Anti-stag hunt

Table 2. The values of discount factor

δ^{F}

with public correlation.

Table 2. The values of discount factor

δ^{F}

with public correlation.

Game	First Group	Second Group	Third Group
Prisoner’s dilemma	II: $δ^{F} = (c - 1) / c$	Ib: $δ^{F} = b / (b - 1)$	Ia: $δ^{F} = - b / c$
Stag hunt	Ia: $δ^{F} = (1 - c) / (1 - b)$
Coordination
No conflict		IIIa: $δ^{F} = b$
Chicken	IIIa: $δ^{F} = 2 b / (b + c)$	IIIa: $δ^{F} = b$	II: $δ^{F} = \frac{c - 1}{c - b}$
Leader	IIIa: $δ^{F} = 2 b / (b + c)$
Anti-PD	V: $δ^{F} = 0$	Ia: $δ^{F} = \frac{c - 1}{1 - b}$
Anti-chicken	V: $δ^{F} = 0$	Ia: $δ^{F} = \frac{c - 1}{1 - b}$
Battle of the sexes	IIIb: $δ^{F} = (2 - 2 c) / (2 - b - c)$
Anti-coordination	IV: $δ^{F} = 1$
Anti-no conflict
Anti-stag hunt

Table 3. The values of discount factor

δ^{F}

in mixed strategies.

Table 3. The values of discount factor

δ^{F}

in mixed strategies.

Game	First Group	Second Group	Third Group
Prisoner’s dilemma	Ib: $δ^{F} = (c - b) / 2 c$	Ia: $δ^{F} = (c - b) / (1 + c - b)$
Chicken	III $^{*}$ : $δ^{F} = (b + c) / 2 c$	Ia: $δ^{F} = (c - b) / (1 + c - 2 b)$	III $^{*}$ : $δ^{F} = \frac{1}{2 - b}$
Leader $^{* *}$	III $^{*}$ : $δ^{F} = (b + c) / 2 c$
Battle of the sexes $^{* *}$	III $^{*}$ : $δ^{F} = \frac{b + c - 2}{2 (b - 1)}$
Stag hunt	IIa: $δ^{F} = (1 - c) / (1 - b)$	Ia: $δ^{F} = (b - 1) / (b - 2)$
Coordination $^{* *}$		Ia: $δ^{F} = (b - 1) / (b - 2)$
No conflict		Ia: $δ^{F} = 1 / 2$	IIb $^{*}$ : $δ^{F} = b$
Anti-no conflict	IV: $δ^{F} = 1$
Anti-coordination $^{* *}$
Anti-stag hunt
Anti-chicken	V: $δ^{F} = 0$	Ib: $δ^{F} = (b - c) / 2 (b - 1)$
Anti-PD	V: $δ^{F} = 0$	Ib: $δ^{F} = (b - c) / 2 (b - 1)$

^{*}

Possibly only an upper bound. **

δ^{F} = 1

, the values correspond to obtaining the pure-strategy FIR payoffs.

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Berg, K.; Kärki, M. Critical Discount Factor Values in Discounted Supergames. Games 2018, 9, 47. https://doi.org/10.3390/g9030047

AMA Style

Berg K, Kärki M. Critical Discount Factor Values in Discounted Supergames. Games. 2018; 9(3):47. https://doi.org/10.3390/g9030047

Chicago/Turabian Style

Berg, Kimmo, and Markus Kärki. 2018. "Critical Discount Factor Values in Discounted Supergames" Games 9, no. 3: 47. https://doi.org/10.3390/g9030047

APA Style

Berg, K., & Kärki, M. (2018). Critical Discount Factor Values in Discounted Supergames. Games, 9(3), 47. https://doi.org/10.3390/g9030047

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Critical Discount Factor Values in Discounted Supergames

Abstract

1. Introduction

2. The Repeated Game

2.1. Stage Games

2.2. Repeated Games

2.3. Critical Discount Factor Values

2.4. The Characterization of Equilibria

2.5. Monotonicity and Helpful Results

3. Results for Different Strategies

4. Group I in Pure Strategies without Public Correlation

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Pure Strategies without Public Correlation

Appendix A.1. Groups II and IIIa

Appendix A.2. Group IIIb

Appendix A.3. Group IIIc

Appendix A.4. Group IIId

Appendix A.5. Groups IV and V

Appendix B. Pure Strategies with Public Correlation

Appendix B.1. Corner I

Appendix B.2. Corner II

Appendix B.3. Corner III

Appendix C. Mixed Strategies

Appendix C.1. Group I

Appendix C.2. Group II

Appendix C.3. Group III

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI