μ–σ Games

Dulleck, Uwe; Löffler, Andreas

doi:10.3390/g12010005

Open AccessArticle

μ–σ Games

by

Uwe Dulleck

^1,2,†

and

Andreas Löffler

^3,*,†

¹

Centre for Behavioural Economics, Society and Technology (BEST), Queensland University of Technology, Brisbane, QLD 4000, Australia

²

Crawford School of Public Policy, Australian National University, Canberra, ACT 4000, Australia

³

Freie Universität Berlin, 14195 Berlin, Germany

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work. The authors would like to thank Adam Clements for comments and feedback.

Games 2021, 12(1), 5; https://doi.org/10.3390/g12010005

Submission received: 4 August 2020 / Revised: 16 November 2020 / Accepted: 8 January 2021 / Published: 12 January 2021

Download

Browse Figures

Versions Notes

Abstract

:

Risk aversion in game theory is usually modeled using expected utility, which was criticized early on, leading to an extensive literature on generalized expected utility. In this paper we are the first to apply

μ

–

σ

theory to the analysis of (static) games.

μ

–

σ

theory is widely accepted in the finance literature; using it allows us to study the effect on uncertainty endogenous to the game, i.e., mixed equilibria. In particular, we look at the case of linear

μ

–

σ

utility functions and determine the best response strategy. In the case of 2 × 2 and N × M games, we are able to characterize all mixed equilibria.

Keywords:

μ–σ utility; game theory; mixed strategies; equilibrium

1. Introduction

It is a well established fact in economic research that people differ in their attitudes towards risk and uncertainty. In game theoretic models of strategic interaction, the main tool to capture risk and uncertainty is expected utility theory [1]. This model was criticized by [2,3], and that lead to an extended literature on generalized expected utility models that are able to accommodate the identified paradoxes. To name just few, [4,5] proposed models of rank-dependent utility relaxing the independence axiom; [6] relaxed the reduction of compound lotteries axiom to explain the Ellsberg paradox. Contributions to this literature usually start out by relaxing axioms that are the basis for von Neumann and Morgenstern’s model, and show that by relaxing these axioms one can find a generalized model of expected utility while keeping most of the attractiveness of the standard model, and at the same time, enriching the framework to allow for seemingly irregular behavior with respect to the standard model.

One idea that comes to mind is the

μ

–

σ

theory. Its equilibrium equivalent, the CAPM—capital asset pricing model—describes the relationship between systematic risk and expected returns for assets traded on markets, and has wide applications in finance—as every textbook (see, among few, ([7] (Section 9.1)) and many papers (a still contemporary and in addition critical overview is [8]) show. CAPM, with its statements on optimal portfolio choice, is one of the most widely used models in finance today. Therefore, it seems worthwhile to understand its effects on game theory. We follow the finance approach and capture risk not using expected utility, but using

μ

–

σ

utility (see [9]).

Sometimes,

μ

–

σ

is seen as a special case of quadratic utility functions, of the form

u (x) = a x - b x^{2}

, with x being the payoffs of a lottery or an asset. We do not restrict ourselves to this special case but will deal with a general form

V (μ, σ^{2})

. These utility functions are considered today not as a special case of expected utility, but as an entire different coverage of risk.1 This approach will require using monetary (material) instead of utility based payoffs in the game.2

Most of the above-mentioned generalizations in game theory can be formalized using Choquet integrals.3 Notice that our approach cannot be described by Choquet integrals, but modern finance theory is based it on its own axiomatization.4

In game theory, the linear formulation of expected utility (with respect to probabilities) does not allow one to capture preferences over uncertainty endogenous to the game. A mixed strategy of a player, whether interpreted as the belief of another player or a real randomization, causes uncertainty for players. In the standard model, due to the linear formulation, this uncertainty is treated like it would be under the assumption of risk neutrality, i.e., in fact ignored. Looking at

μ

–

σ

utility, the circumstances are different—since probabilities enter the variance

σ^{2}

(due to the fact that

σ^{2} = E [x^{2}] - E {[x]}^{2}

with x being payoffs). Hence, any mixed strategy of a player will now cause real uncertainty that will not be disregarded. Thus, in many cases where mixed-strategy equilibria exist in games assuming expected utility, they do not exist when one assumes

μ

–

σ

utility. That players tend to avoid mixed strategies in economic experiments can be seen, for example, in the Stag–Hunt game (see [19,20]).5

In this article we discuss how equilibrium predictions change. As we concentrate on two-player games, we interpret a mixed strategy as a real randomization by a player and not as a belief of the composition of population from which the other player is chosen randomly, with this player then choosing a pure strategy. One important, additional aspect of the interpretation of a mixed strategy as real randomization is that in the case of mixed equilibria, a player’s own strategy now affects his or her utility, even though it does not change the expected value of the payoff. One may interpret this in the context of repeated play as a cost to changing ones actions in different rounds of the game. In this context, the literature on ambiguity aversion [22,23] comes to mind. Our approach is one of ex-ante utility—i.e., randomizing over ones actions is costly as it increases the variance. Taking a population interpretation of mixed strategy equilibria and applying payoff dominance as selection criterion [24] shares with our approach that for coordination games, mixed strategies are not selected as mixing reduces expected payoffs—whereas with the

μ

–

σ

approach there is a direct cost of randomization in form of the increased variance.

To refer back to finance theory where expected utility theory and

μ

–

σ

theory would argue that a portfolio does better than a single investment, a similar result in the application of

μ

–

σ

theory to game theory will not hold! In static games based on standard expected utility, reducing the variance of (monetary) payoffs is not important because the expected value of the utility payoffs stay the same; but

μ

–

σ

theory will indicate that a pure strategy, i.e., choosing one action instead of randomizing, is preferable. By explicitly capturing variance caused by an agent’s own strategy choice, we may provide some reasoning as to why experimental players refrain frequently form using mixed strategies.

To illustrate our argument, consider the simple

μ

–

σ

utility function

V (μ, σ^{2}) = μ - \frac{r}{2} σ^{2}

. This utility function is linear in expectation and variance, and therefore is sometimes called “linear utility,” although linearity here does not refer to the material payoffs or the probabilities of the players. In a typical

μ

–

σ^{2}

diagram, any indifference curves are upward sloping, which follows from the fact that a higher variance needs to be compensated by a higher expected value; in our case, the indifference curves are straight lines with slope

\frac{\partial V}{\partial μ} / \frac{\partial V}{\partial σ} = \frac{r}{2}

; for details, see ([12] (p. 426)). We focus on this particular utility function because it plays a major role in finance: Every investor with this utility function will exhibit a behavior known from “constant absolute risk aversion” in the expected utility framework (see ([12] (property 5))). This utility function serves as a starting point for understanding our idea.

We now consider a game where a player chooses a possibly mixed strategy given the possibly mixed strategy profile of other player(s); then the player faces a lottery where material payoffs depend on the specified game, and probabilities depend on the strategy profile. For a given strategy of the other player(s), each strategy of a player represents a point in the

μ

–

σ

diagram given the expected value and the implied variance of the strategy; i.e., for any strategy

α

this point is determined by

μ = E (α)

and

σ^{2} = Var (α)

. We will show that mixing between two strategies of player i with the same utility (see Figure 1, where these strategies are denoted

α_{i}

and

α_{i}^{'}

) actually leads to a of utility as the variance increases. This is in sharp contrast to the usual “egg shaped” efficient frontier seen in almost every textbook in finance, where mixing decreases the variance and therefore contributes to an increase in utility.

One main difference relative to other applications of non-generalized or generalized expected utility functions to game theory (see, for example, [25]) is that terminal node utilities of players are now dependent on how this terminal node is reached. In the case of random events, whether due to moves by nature or mixed strategies of one of the players, payoffs that are identical with respect to their material (or monetary) payoffs give rise to differences in utility under the

μ

–

σ

paradigm. That terminal node utility may depend on endogenous aspects of other players’ behavior has recently received some detailed attention under the heading of psychological game theory (see, for example, [15,16,26]) where beliefs of players matter for the utility payoffs of players).

μ

–

σ

utility in games implies that strategies and the history of play, whenever random events are involved, affect the utility.

We proceed by first defining games based on

μ

–

σ

utility functions and study static 2 × 2 and N × N games based on linear utility functions. We then discuss nonlinear

μ

–

σ

utility functions.

2. Definition of Static $μ$ – $σ$ Games

To understand the applicability of

μ

–

σ

utility functions to games, we start by analyzing static games with complete information. We consider games with a finite set N of players and nonempty and finite sets

A_{i}

(

i \in N

) of actions. Any profile of pure strategies

a \in \times_{i = 1}^{N} A_{i}

will provide player i with a material (not utility) payoff

u_{i} (a)

; we use the notation

a = (a_{1}, \dots, a_{N})

.

We consider mixed strategies—that is, elements of

\times_{i} Δ (A_{i})

. Let

α

be a profile of mixed strategies, that is, a vector

α = (α_{1}, \dots, α_{N})

with

α_{i} \in Δ (A_{i})

. Furthermore,

α_{i} (a_{i})

with

a_{i} \in A_{i}

is the probability that player i will play the pure strategy

a_{i} \in A_{i}

and

α (a) = \prod_{i} α_{i} (a_{i})

for

a \in \times_{i} A_{i}

.

Player i can expect the following material payoff from strategy combination

α

E [α] : = \sum_{a \in \times_{i} A_{i}} α (a) u_{i} (a)

(1)

and a variance of

Var [α] : = \sum_{a \in \times_{i} A_{i}} α (a) u_{i}^{2} (a) - E^{2} [α]

(2)

For ease of notation we use

μ = E [α]

and

σ^{2} =

Var

[α] .

Definition 1.

A μ–σ game is a game where the utility of player from strategy combination α is given by a μ–σ utility function

U_{i} (α) = V_{i} (μ, σ^{2}) .

V_{i}

is strictly increasing in the first variable and strictly decreasing in the second variable, and is strictly quasiconcave in μ and

σ^{2}

.

We assume strict quasiconcavity to ensure uniqueness of the solution to classical maximization problems in finance. Relaxing this assumption most likely does not alter our results but makes the arguments very tedious.6

We first analyze the case of two players. Furthermore, we assume that the utility function is of the following simple linear form:

V (μ, σ^{2}) = μ - \frac{r}{2} σ^{2}

with r being the parameter for the strength of the variance aversion.7 We refer to a utility function of this form as linear utility.8

In the next paragraphs we analyze the effect of this utility model in well known examples of the literature on game theory. This allows us to show that in

μ

–

σ

games a Nash equilibrium does not always exist.

3. First Results for $μ$ – $σ$ Games

3.1. Best Response with Linear Utility

Compared to standard game theory,

μ

–

σ

games may have different sets of equilibria. In standard game theory mixed strategies, i.e., strategies that randomize over actions that lead to the same expected (material) utility, yield the same utility payoff for a player, due to the the linearity in probabilities assumed by the expected utility framework. For

μ

–

σ

games the randomization of a mixed strategy comes at a price.

Given the behavior of the other player(s) in the game, the following maximization problem determines the best response of a player with

μ

–

σ

utility.

\begin{matrix} max_{α_{i}} & \sum_{a \in \times_{i} A_{i}} α (a) u_{i} (a) - \frac{r}{2} (\sum_{a \in \times_{i} A_{i}} α (a) u_{i}^{2} (a) - E^{2} [α]) \\ = max_{α_{i}} & \sum_{a \in \times_{i} A_{i}} α (a) (u_{i} (a) - \frac{r}{2} u_{i}^{2} (a)) - \frac{r}{2} {(\sum_{a \in \times_{i} A_{i}} α (a) u_{i} (a))}^{2} \end{matrix}

This is a quadratic equation in

α_{i}

, where the coefficient on the quadratic term

a_{i}^{2} (a_{i})

is always negative.

Lemma 1

(best response with mixed strategy). In equilibrium a player with μ–σ utility does not choose a mixed strategy, unless all actions chosen with positive probability are characterized by the same expected value and the same variance—i.e., they are characterized by the same point in the μ–

σ^{2}

diagram.

Proof.

We show first that all convex combinations of two (mixed) strategies lie on a convex curve in a

μ

–

σ

diagram. Let

α_{i}

and

α_{i}^{'}

be two strategies of player i. Expected value and variance given the strategies of other players can be denoted as

(E [α_{i}], Var [α_{i}]) and (E [α_{i}^{'}], Var [α_{i}^{'}]) .

Let a convex mixture choose

λ < 1

times to play

α_{i}

and

1 - λ

times to play

α_{i}^{'}

; we refer to this strategy as

λ

. The expected value can then be calculated following (1):

\begin{matrix} E [λ] & = λ E [α_{i}] + (1 - λ) E [α_{i}^{'}] \\ = E [α_{i}^{'}] + λ (E [α_{i}] - E [α_{i}^{'}]) . \end{matrix}

(3)

and the variance is given as

\begin{matrix} Var [λ] & = \sum_{a \in \times_{i} A_{i}} (λ α_{i} + (1 - λ) α_{i}^{'}) α_{- i} (a) u_{i}^{2} (a) - E^{2} [λ] \\ = λ \sum_{a \in \times_{i} A_{i}} α_{i} α_{- i} (a) u_{i}^{2} (a) + (1 - λ) \sum_{a \in \times_{i} A_{i}} α_{i}^{'} α_{- i} (a) u_{i}^{2} (a) \\ - {(λ E [α_{i}] + (1 - λ) E [α_{i}^{'}])}^{2} \\ = λ (Var [α_{i}] + E^{2} [α_{i}]) + (1 - λ) (Var [α_{i}^{'}] + E^{2} [α_{i}^{'}]) \\ - {(λ E [α_{i}] + (1 - λ) E [α_{i}^{'}])}^{2} \\ = λ Var [α_{i}] + (1 - λ) Var [α_{i}^{'}] + λ (1 - λ) {(E [α_{i}] - E [α_{i}^{'}])}^{2} . \end{matrix}

(4)

In the case that

E [α_{i}] \neq E [α_{i}^{'}]

\begin{matrix} Var [λ] = \frac{E [λ] - E [α_{i}^{'}]}{E [α_{i}] - E [α_{i}^{'}]} Var [α_{i}] + (1 - \frac{E [λ] - E [α_{i}^{'}]}{E [α_{i}] - E [α_{i}^{'}]}) Var [α_{i}^{'}] + \\ + \frac{E [λ] - E [α_{i}^{'}]}{E [α_{i}] - E [α_{i}^{'}]} (1 - \frac{E [λ] - E [α_{i}^{'}]}{E [α_{i}] - E [α_{i}^{'}]}) {(E [α_{i}] - E [α_{i}^{'}])}^{2} \end{matrix}

and the second derivative is given as

\frac{d^{2} Var [λ]}{d E {[λ]}^{2}} = - 2 .

Therefore, any convex combinations of two strategies lie on a convex curve. Figure 2 illustrates such curves for three different combinations of strategies.

Our result follows from the observation that indifference curves of a player in the

μ

–

σ

diagram are given by straight lines between two strategies, given the linear utility function

μ - \frac{r}{2} σ^{2}

. For this reason any convex combination of strategies must be worse than either of the strategies that are combined. Even if a player is indifferent between two strategies

α_{i}

and

α_{i}^{'}

, he will see any mixture between these strategies as inferior. □

This implies an equilibrium in mixed strategies only exists if both strategies are represented by the same point in the

μ

–

σ

diagram—i.e., they do not only have the same expected material payoff but also lead to the same variance.

Although this lemma seems to be related to Lemma 1 in [14], our model differs from theirs in one substantial point. The set of all pure actions in [14] is convex, which is not the case in static games with a finite number of pure strategies. Similarly, [13] looked at games where the players violate von Neuman and Morgenstern’s independence axiom, and assumed that the preferences (in terms of payments) were quasiconcave. Again, our paper differs from that work because (in terms of payments)

μ

–

σ

utility functions need not be quasiconcave.9

We next study the implication of this with respect to the best response towards a pure strategy and when an equilibrium in mixed strategies exist.

Lemma 2

(best response given pure strategies of another player). The best response to a pure strategy is a pure strategy, unless the material payoff of the player under consideration is the same for a set of at least two actions. Any (best response) mixed strategy can only randomize over this set of actions.

Proof.

Referring again to Figure 2. Given that the other player chooses a pure strategy, all strategies of the player under consideration will be a point on the

μ

-axis. This implies that the point higher on the axis will be chosen unless two strategies lead to exactly the same material payoff. □

3.2. 2 × 2 Games with Linear Utility

To answer the question when mixed strategy equilibria exist in

μ

–

σ

games, we start with the case of 2 × 2 games. The game we consider is given in Figure 3. We state our result for player 1 (the row player) and denote by q the probability that player 2 (the column player) chooses left. The following lemma characterizes the necessary condition for a best mixed strategy best response in comparison to the condition assuming standard expected utility theory.

Lemma 3

(best response in 2 × 2 games). The best response in any 2 × 2 game is a mixed strategy if and only if, (a) the usual condition of expected utility game theory holds,

0 = (a - c) q + (b - d) (1 - q),

(5)

and (b) the following condition is true:

a + c = b + d .

(6)

This lemma shows that 2 × 2

μ

–

σ

games do not have more mixed equilibria than an equivalent standard 2 × 2 game. Being a mixed strategy equilibrium in the standard game is a necessary but not sufficient condition for being an equilibrium in the equivalent

μ

–

σ

game. The second condition (6) has to be fulfilled as well; therefore, many

μ

–

σ

games will not have any equilibrium.

Condition (6) has an insightful interpretation. It requires that, given any pure strategy of the other player, the sum over all material payoffs that the player is able to achieve over all his strategies is constant, i.e., independent of the pure strategy that is chosen given the randomization of the other player. Regardless the other player’s choice, it is only the slice of the cake and not the size of the cake that is determined by the player’s own actions.

Proof.

We apply Lemma 1, which implies for the 2 × 2 game that expected value and variance have to be the same for any variation in the probability p of player 1 to choose up. This leads to the following two conditions:

\begin{matrix} E [p] & = a p q + b p (1 - q) + c (1 - p) q + d (1 - p) (1 - q) = const \\ Var [p] & = a^{2} p q + b^{2} p (1 - q) + c^{2} (1 - p) q + d^{2} (1 - p) (1 - q) - {(E [p])}^{2} = const . \end{matrix}

The constant expected value implies

(a - c) q + (b - d) (1 - q) = 0 .

A non-degenerate mixed equilibrium requires

0 < q < 1

, and thus

a \neq c

and

b \neq d

. Simplifying the condition under constant variance by solving for

E [p]

and calculating the first-order condition p, gives us the second condition, so it follows as

(a^{2} - c^{2}) q + (b^{2} - d^{2}) (1 - q) = 0 .

Combining both conditions gives us the second condition for the existence of a mixed equilibrium

a + c = b + d

. □

We next characterize the games where mixed equilibria do survive.

Theorem 1

(Mixed Equilibria in 2 × 2 games). A mixed equilibrium in 2 × 2 μ–σ games with linear utility functions exists if and only if

(i): The candidate for equilibrium is a mixed equilibrium of a standard (expected utility) game with utility payoffs equal to the monetary payoffs of the μ–σ game;
(ii): For each strategy of the other player, the sum of monetary payoffs of a player is the same for the strategies available to the player.

Proof.

The two conditions imply the existence of a mixed strategy equilibrium follows directly from Lemma 3, in particular Equations (5) and (6). □

In the following we discuss special cases. We start by analyzing zero-sum games.

Theorem 2

(2 × 2 zero-sum games). The only 2 × 2-μ–σ zero-sum game with an equilibrium in non-degenerate mixed strategies is matching pennies.

Proof.

We denote the material payoffs as given by Figure 4.

Given the previous results, we know that the following two conditions have to be fulfilled in a mixed strategy equilibrium

\begin{matrix} a + c & = b + d \end{matrix}

(7)

\begin{matrix} α + β & = γ + δ \end{matrix}

(8)

Given that we study zero-sum games we know

a + α = b + β = c + γ = d + δ = C .

Substituting the last equation into the previous two, gives us

C - α + C - γ = C - β + C - δ .

Adding this equation to (8), yields

β = γ

and

b = c

and

a = d

and

α = δ

; therefore, Figure 5 represents this game. □

Our next result concerns non-zero-sum games. It is well-known—despite disturbing results in

μ

–

σ

theory—that a portfolio with higher payoffs is not necessarily preferred by an investor (preferences need not be monotone).10 Therefore, it is not clear that dominated strategies in

μ

–

σ

games cannot be equilibrium strategies. In 2 × 2 games we can show the following result.

Theorem 3

(2 × 2 games with dominated strategies). If a strategy in a 2 × 2 game is dominated in monetary payoffs, then no mixed equilibrium of the 2 × 2 μ–σ games exists.

Proof.

This follows immediately from the fact any equilibrium of the

μ

–

σ

game must be an equilibrium in the equivalent expected utility framework. In expected utility, games dominated strategies never receive a positive probability weight. □

While this result seems to be obvious at first, it is less so if one considers that a player’s monetary payoff choosing the—in monetary payoffs—dominant action may lead to a higher variance than the dominated action over compensating for the loss in payoff. As the result shows, this cannot be the case. These results have a set of implications that are noteworthy:

Only coordination games and games without a pure strategy equilibrium in the expected utility framework can have mixed strategy equilibria.
This follows from the observation that solving (5) for q and substituting (6). Given $q \in (0, 1)$ , either one action dominates the other or players prefer payoffs in two diagonal corners. Theorem 3 rules out the former, leaving the cases where both players either prefer the same two diagonal corners (coordination games) or they prefer different corners—games without a pure strategy equilibrium in standard games.
Battle of sexes $μ$ – $σ$ games do not have a mixed strategy equilibrium unless players—in case of miscoordination—receive an additional payoff equal to the difference in their payoffs between the preferred and the alternative equilibrium.
This can be seen from Figure 6. A mixed strategy equilibrium exists iff (6) is satisfied. This is equivalent to $a = b + c$ .

3.3. N×M Games with Linear Utility

Mixed equilibria in 2 × 2

μ

–

σ

games only exist if an additional constraint on payoffs holds to ensure that these strategies are a best response. In this section we show that for N × M games a mixed equilibrium only exists if the game is degenerate. To show this, we show that any best response avoids mixing unless the material payoff for this player are constant over all possible outcomes of the game.

Theorem 4

(best response with N × M games). In an N × M μ–σ game (

M, N > 2

) in any mixed equilibrium, players randomize at most over two pure strategies, unless the payoffs to the player are constant (independent of his choice).

Proof.

Again, we study the best responses of players. From Lemma 1 we know that any action that may be chosen by a player will have to be represented by the same point in the

μ

–

σ

diagram. To illustrate our argument, let us assume that a player randomizes over 3 actions, while the other player randomizes only over two. To show this, consider the 3 × 2 game with material payoffs given in Figure 7.

Let

p_{1},

p_{2}

, and

1 - p_{1} - p_{2}

be the probabilities that the player chooses up, middle, and down respectively. The condition for a constant expected value in this case is

\begin{matrix} const = E [p_{1}, p_{2}] = a p_{1} q + b p_{1} (1 - q) + c p_{2} q + d p_{2} (1 - q) + \\ + e (1 - p_{1} - p_{2}) q + f (1 - p_{1} - p_{2}) (1 - q) . \end{matrix}

This is a condition on two variables, which gives us two constraints:

\begin{matrix} 0 & = (a - e) q + (b - f) (1 - q) \\ 0 & = (c - e) q + (d - f) (1 - q) . \end{matrix}

If one combines both, they imply also

(a - c) q + (b - d) (1 - q) = 0

. Any nondegenerate mixed equilibrium implies that

q \neq 0

which immediately implies

a \neq e \neq c

.

Furthermore the variance needs to be constant:

\begin{matrix} const = Var [p_{1}, p_{2}] = a^{2} p_{1} q + b^{2} p_{1} (1 - q) + c^{2} p_{2} q + d^{2} p_{2} (1 - q) + \\ + e^{2} (1 - p_{1} - p_{2}) q + f^{2} (1 - p_{1} - p_{2}) (1 - q) - {(E [p_{1}, p_{2}])}^{2} . \end{matrix}

This is a condition on two variables, and using derivatives it can be reduced to two equations

\begin{matrix} 0 & = (a^{2} - e^{2}) q + (b^{2} - f^{2}) (1 - q) \\ 0 & = (c^{2} - e^{2}) q + (d^{2} - f^{2}) (1 - q) \end{matrix}

and thus

0 = (a^{2} - c^{2}) q + (b^{2} - d^{2}) (1 - q)

. Solving implies

\begin{matrix} a + e & = b + f \\ c + e & = d + f \\ a + c & = b + d \end{matrix}

These three equations imply that

a = c = e

and

b = d = f

, which contradicts the condition for the constant expected value. □

3.4. A Game with Nonlinear Utility Functions

The results of the former section heavily depend on the fact that we restricted ourselves to linear

μ

–

σ

utility functions. If we consider other utility functions it might well be that an equilibrium in mixed strategies exists, although the restrictive condition (6) is not met. In order to show this result, we consider the following utility function:

V (μ, σ^{2}) = - \frac{1}{μ} - \frac{1}{2} σ^{2} .

Its indifference curves are convex and monotone functions in the

μ

-

σ^{2}

diagram.

We now look at a game where the material payoffs are given by Figure 8. We can now show that

(0.5, 0.5)

is a mixed equilibrium of the game. Notice that in classical game theory (where utilities are given by Figure 8) an equilibrium would be given by

(0.75, 0.25)

.

Assume that the row player chooses

q = 0.5

. Then the utility of the column player is given by

V (μ (p), σ^{2} (p)) = - \frac{1}{8} - \frac{3}{2} p + \frac{1}{2} p^{2} - \frac{1}{\frac{1}{2} + p}

This function has a maximum of

p = \frac{1}{2}

in

[0, 1]

. Hence, the best response is a mixed strategy. With the same reasoning we can show that

q = \frac{1}{2}

is the best response to the own player’s strategy

p = \frac{1}{2}

.

4. Conclusions

We applied

μ

–

σ

utility to game theory. Using monetary games we discussed how equilibria predictions—in particular with respect to the existence of mixed equilibria in static games—change if behavior can be described by preferences depending on the mean and variance of random payoffs. This is an alternative to models of generalized expected utility which relax the assumption of linearity in probabilities, which is the basis of von Neuman and Morgenstern’s expected utility model. While generalized expected utility models still maintain the assumption that terminal utilities are independent of the way the respective endpoint is reached,

μ

–

σ

theory allows us to capture endogenous uncertainty caused by mixed strategies of players. In the case of the 2 × 2 games we were able to show that mixed strategy equilibria do survive in a

μ

–

σ

game under a set of additional restrictions. Thus, the set of mixed equilibria in a

μ

–

σ

game is a subset of the mixed equilibria of the equivalent game where the monetary payoffs are interpreted as utility payoffs.

Our analysis is based on the interpretation of mixed strategies as randomization by players over their actions, and not as the beliefs about actions of the other player or of the composition of a population of other players playing pure strategies from which the other player is randomly drawn.11 In this case mixed equilibria only survive in

μ

–

σ

games if there is a substantial gain from randomizing, for example, because allowing the other player to predict one’s behavior comes at a first order cost effect, as in the case of zero sum games.

We believe this analysis can help to capture the experimentally observed aversion against mixing by players. While the

μ

–

σ

model is a very specific abstraction and somewhat arbitrary, its prominence in finance and its capability to capture uncertainty endogenous to the play of the game, made it for us a worthwhile starting point to reconsider equilibria when one departs from using utility function that can be characterized by Choquet integrals.

Author Contributions

Methodology, writing, and formal analysis: Both authors contributed equally. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

Von Neumann, J.; Morgenstern, O. Theory of Games and Economic Behavior; Princeton University Press: Princeton, NJ, USA, 1947. [Google Scholar]
Allais, M. Le Comportement de l’Homme Rationnel devant le Risque. Econometrica 1953, 21, 503–546. [Google Scholar] [CrossRef]
Ellsberg, D. Risk, Ambiguity, and the Savage Axioms. Q. J. Econ. 1963, 75, 643–669. [Google Scholar] [CrossRef] [Green Version]
Quiggin, J. A Theory of Anticipated Utility. J. Econ. Behav. Organ. 1982, 3, 225–243. [Google Scholar] [CrossRef]
Yaari, M.E. The Dual Theory of Choice under Risk. Econometrica 1987, 55, 95–115. [Google Scholar]
Segal, U. The Ellsberg Paradox and Risk Aversion: An Anticipated utility Approach. Int. Econ. Rev. 1987, 28, 175–202. [Google Scholar]
Cochrane, J.H. Asset Pricing, 2nd ed.; Princeton University Press: Princeton, NJ, USA, 2005. [Google Scholar]
Fama, G.; French, K. The cross section of expected returns. J. Financ. 1992, 47, 427–465. [Google Scholar] [CrossRef]
Markowitz, H.M. Portfolio Selection. J. Financ. 1952, 7, 77–91. [Google Scholar]
Lajeri-Chaherli, F.; Nielsen, L.-T. Parametric characterizations of risk aversion and prudence. Econ. Theory 2000, 15, 469–476. [Google Scholar]
Lajeri-Chaherli, F. More on Properness: The Case of Mean-Variance Preferences. Geneva Risk Insur. Rev. 2002, 27, 49–60. [Google Scholar]
Meyer, J. Two Moment Decision Models and Expected Utility Maximization. Am. Econ. Rev. 1987, 77, 421–430. [Google Scholar]
Crawford, V.P. Equilibrium without Independence. J. Econ. Theory 1990, 50, 127–154. [Google Scholar] [CrossRef] [Green Version]
Chen, H.-C.; Neilson, W.W. Pure-strategy equilibria with non-expected utility players. Theory Decis. 1999, 46, 19–209. [Google Scholar] [CrossRef]
Rabin, M. Incorporating Fairness Into Game Theory and Economics. Am. Econ. Rev. 1993, 83, 1281–1302. [Google Scholar]
Battigalli, P.; Dufwenberg, M. Dynamic Psychological Games. J. Econ. Theory 2009, 144, 1–35. [Google Scholar] [CrossRef]
Choquet, G. Theory of capacities. Ann. l’Inst. Fourier 1953, 5, 131–295. [Google Scholar] [CrossRef] [Green Version]
Löffler, A. Variance aversion implies μ-σ-criterion. J. Econ. Theory 1996, 69, 532–539. [Google Scholar] [CrossRef]
Battigalli, P.; Samuelson, L.; Van Huyck, J. Optimization Incentives and Coordination Failure in Laboratory Stag Hunt Games. Econometrica 2001, 69, 749–764. [Google Scholar]
Al-Ubaydli, O.; Jones, G.; Weel, J. Patience, cognitive skill, and coordination in the repeated stag hunt. J. Neurosci. Psychol. Econ. 2013, 6, 71–96. [Google Scholar] [CrossRef] [Green Version]
Agranov, M.; Healy, P.J.; Nielsen, K. “Non-Random Randomization”, Working Paper. 2020. Available online: https://ssrn.com/abstract=3544929 (accessed on 11 January 2021).
Raiffa, H. Risk, Ambiguity, and the Savage Axioms: Comment. Q. J. Econ. 1961, 75, 690–694. [Google Scholar] [CrossRef]
Eichberger, J.; Kelsey, D. Uncertainty Aversion and Preference for Randomization. J. Econ. Theory 1996, 71, 31–43. [Google Scholar] [CrossRef] [Green Version]
Harsanyi, J.C.; Selten, R. A General Theory of Equilibrium Selection in Games; MIT Press: Cambridge, MA, USA, 1988. [Google Scholar]
Dekel, E.; Safra, Z.; Segal, U. Existence and dynamic consistency of Nash equilibrium with non-expected utility preferences. J. Econ. Theory 1991, 55, 229–246. [Google Scholar] [CrossRef]
Geanakoplos, J.; Pearce, D.; Stacchetti, E. Psychological Games and Sequential Rationality. Games Econ. Behav. 1989, 1, 60–79. [Google Scholar] [CrossRef] [Green Version]
Nielsen, L.T. Portfolio selection in the mean-variance model: A note. J. Financ. 1987, 42, 1371–1376. [Google Scholar] [CrossRef]

1.	See, for example, [10,11,12].
2.	References [13,14,15,16] also consider monetary or material games.
3.	See [17].
4.	See [18] for an axiomatic foundation of $μ$ – $σ$ utility functions based on preference axioms. That $μ$ – $σ$ utility functions are not a special case of a Choquet integral using some capacity can easily be seen: Choquet integrals are homogenous of degree one, a feature that many $μ$ – $σ$ functions (for example, $μ$ – $σ^{2}$ ) do not possess.
5.	However, notice that [21] found very high rates of randomization in their experiments.
6.	See the discussion in [10].
7.	This is analogue to the definition of the Arrow–Pratt measure, see [10,12].
8.	Note, this utility function shares with CARA expected utility functions the characteristic that the preference overall risk is independent of the wealth or income level.
9.	Using suitable numbers, one can already show that $μ - σ^{2}$ has upper contour sets that are not convex.
10.	This particular feature of $μ$ – $σ$ preferences is well known; see, for example, [27]. The fact that the CAPM as an application of $μ$ – $σ$ is still used today shows that the non-monotonicity is not regarded as a major problem in finance.
11.	With the population interpretation, selection criteria, particularly payoff dominance ([24]), become important. Payoff dominance does not select mixed equilibria in coordination games, as these minimize the expected payoff, and it is difficult to argue why a population’s composition should yield this result. $μ$ – $σ$ theory applied to game theory gives us in the case of coordination games a similar prediction.

Figure 1. Mixing two lotteries

α_{i}

and

α_{i}^{'}

with the same utility decreases the utility in a

μ

–

σ

game.

Figure 1. Mixing two lotteries

α_{i}

and

α_{i}^{'}

with the same utility decreases the utility in a

μ

–

σ

game.

Figure 2. Mixtures of three pairs of strategies

(α_{i}, α_{i}^{'})

,

(β_{i}, α_{i}^{'})

and

(γ_{i}, α_{i}^{'})

in the

μ

-

σ

diagram.

Figure 2. Mixtures of three pairs of strategies

(α_{i}, α_{i}^{'})

,

(β_{i}, α_{i}^{'})

and

(γ_{i}, α_{i}^{'})

in the

μ

-

σ

diagram.

Figure 3. Material payoffs for best response function.

Figure 4. Material payoffs in the game of Theorem 2.

Figure 5. Matching pennies—the only

μ

–

σ

zero-sum game with (non-degenerate) mixed strategies.

Figure 5. Matching pennies—the only

μ

–

σ

zero-sum game with (non-degenerate) mixed strategies.

Figure 6. Battle of sexes with mixed strategy equilibrium (

a > b

).

Figure 6. Battle of sexes with mixed strategy equilibrium (

a > b

).

Figure 7. Material payoffs for best response function in a 3 × 2 game in the proof of Theorem 4.

Figure 8. Material payoffs of a game with nonlinear utility.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Dulleck, U.; Löffler, A. μ–σ Games. Games 2021, 12, 5. https://doi.org/10.3390/g12010005

AMA Style

Dulleck U, Löffler A. μ–σ Games. Games. 2021; 12(1):5. https://doi.org/10.3390/g12010005

Chicago/Turabian Style

Dulleck, Uwe, and Andreas Löffler. 2021. "μ–σ Games" Games 12, no. 1: 5. https://doi.org/10.3390/g12010005

APA Style

Dulleck, U., & Löffler, A. (2021). μ–σ Games. Games, 12(1), 5. https://doi.org/10.3390/g12010005

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

μ–σ Games

Abstract

1. Introduction

2. Definition of Static $μ$ – $σ$ Games

3. First Results for $μ$ – $σ$ Games

3.1. Best Response with Linear Utility

3.2. 2 × 2 Games with Linear Utility

3.3. N×M Games with Linear Utility

3.4. A Game with Nonlinear Utility Functions

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

μ–σ Games

Abstract

1. Introduction

2. Definition of Static μ – σ Games

3. First Results for μ – σ Games

3.1. Best Response with Linear Utility

3.2. 2 × 2 Games with Linear Utility

3.3. N×M Games with Linear Utility

3.4. A Game with Nonlinear Utility Functions

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2. Definition of Static $μ$ – $σ$ Games

3. First Results for $μ$ – $σ$ Games