1. Introduction
Multi-player evolutionary games are simple models that can be employed to study a wide range of systems across several diverse disciplines [
1,
2,
3,
4,
5,
6,
7,
8,
9]. In these mathematical models identical players are located at the nodes of a network structure (e.g., at the sites of a lattice), they repeatedly play the same two-player game against their neighbors, and are allowed to change their strategy according to some evolutionary update rule. The pair interaction is defined by an
payoff matrix [
10] if the players can choose among
n options (henceforth called strategies). The large number of payoff parameters causes difficulties in systematic investigations and in the classification of possible behaviors as well. These difficulties can be reduced by recognizing that two games are equivalent if their payoff matrices can be transformed into each other by exchanging some strategy labels.
The linear decomposition of payoff matrices [
11,
12,
13,
14] can easily reveal such symmetry relations, provides a classification scheme for payoff matrices, and allows us to distinguish fundamentally different types of elementary games [
15,
16]. This approach generalizes the two-strategy coordination game that enforces players choosing the same strategy. These latter models are equivalent to the kinetic Ising model if
and the logit rule controls the evolution [
17,
18,
19,
20,
21,
22,
23,
24,
25,
26,
27]. For
n strategies, generalized coordination games are composed of two-strategy coordination subgames between all possible strategy pairs [
16]. When the logit rule is used [
28,
29,
30], some lattice versions of these models become equivalent to spin systems or lattice gas models (e.g., Potts models [
31] or the Ashkin-Teller model [
32]) that exhibit thermodynamic behavior [
33,
34,
35]. In “real” coordination games the maximal value of the payoff matrix is located in its main diagonal, and the corresponding strategy is preferred by all players. Consequently, the strategy distribution observed at low noises can be considered as an ordered phase that transforms into a disordered strategy arrangement if the noise is increased.
So far, we have tacitly considered cases where the maximal payoff is unique or degenerate in such a way that equivalent phases possess similar features and symmetries at low noises. What happens when the latter conditions are not satisfied? Which of the possible equilibria will be preferred? For equal average payoffs in the ordered strategy arrangements, it will be shown that these systems select one of those ordered phases that have higher entropy. Furthermore, entropy can have a similar role even when the payoffs of the ordered phases are tuned away from being equal.
In this paper we investigate the stabilization effects of entropy in a five-strategy coordination game where the subgames of the first two and the last three strategies are identical to a ferromagnetic Ising and a three-state Potts model, respectively. It is also assumed that the players receive zero income if they choose strategies belonging to different subgames. Using Monte Carlo simulations and cluster variation methods, we show that one of the “Ising phases” is stabilized by its higher entropy at low noises if the payoffs are identical in all five ordered strategy arrangements. Furthermore, this system undergoes an Ising-type phase transition when the noise is increased. Evidently, any small extra payoff in the “Potts phase” can stabilize it in the low noise limit. In the rest of the paper we describe the model in detail and quantitatively study its phase transitions for several payoff values.
2. The Model and Its General Features
Throughout this paper we study evolutionary games on an
-site square lattice with periodic boundary conditions [
6,
13,
36]. There is one player located at each site, all of whom repeatedly play the same two-player game with their four nearest neighbors and may choose one of
n available pure strategies to play in all four games. If we represent these strategies with the
n Cartesian unit vectors of an
n-dimensional Euclidean space, then the income of the player at site
x can be calculated as
Here
denotes the strategy chosen by the player at site
x, the summation runs over this player’s nearest neighbor sites (
), and the element
of the
payoff matrix is the income of the player playing the
i-th strategy against his/her opponent selecting the
j-th strategy [
10].
This paper is focused on the study of a five-strategy game in which subgames associated with its first two (henceforth strategy 1 or 2) and last three strategies (labelled by numbers 3, 4, and 5) are an Ising and a three-state Potts model, respectively. Opposing players whose strategies belong to different subgames both receive zero payoff. We note again that we use pure coordination-type games for which the sum of payoffs is zero in each row and column of the payoff matrix. Consequently, the elements of our Potts payoff matrix are shifted by a uniform irrelevant constant in comparison with the standard diagonal Potts payoff matrix [
31,
37]. If the Ising subgame’s payoff rewarding coordination is chosen to be the unit of currency and
is the relative strength of the Potts subgame, then the payoff matrix takes the following form:
This payoff matrix contains a coordination-type (Ising) interaction between strategies 1 and 2 with unit strength while the strategy pairs (3,4), (3,5), and (4,5) are coordinated similarly with a strength of
[
16]. Our analysis is restricted to
.
The present game is a potential game because its payoff matrix is symmetric (
), and its corresponding potential matrix is
[
13]. In this evolutionary game the potential
of the whole system, a function of strategy configuration
, can be expressed as a sum of the pair potentials. Namely,
This quantity resembles the negative potential energy (Hamiltonian) of a physical system. Maximal total payoff is achieved if all players choose either strategy 1 or 2 (called an Ising phase) if
. For
, however, the total payoff reaches its maximum in one of three ordered Potts phases when all players uniformly choose strategy 3 (or 4 or 5).
If the evolution of strategy profile
is controlled by the so-called logit rule, then a randomly selected player may change his/her strategy
to any available pure strategy
with payoff-dependent probability
where the parameter
K quantifies external noise (or temperature) affecting the decision-making abilities of players. This rule exponentially favors higher payoffs, provided that the opponents do not modify their strategies [
28,
29,
30,
35].
The main advantage of the application of the logit rule is that the repetition of the above elementary strategy changes will drive potential games into the Boltzmann distribution [
33,
34] when strategy configuration
is realized with probability
The macroscopic behavior of this system can be quantified by the average proportions of players following the ith strategy (). The general features of this evolutionary game share many similarities with those of the constituent Ising and Potts models. At high temperatures (K) the system evolves into a disordered state in which strategies are chosen by almost equal shares of the population, or more precisely, all five strategy frequencies approach when K goes to infinity. On the other hand, strategy choices are homogenized in the low-noise limit, and we find the system in one of the competing ordered phases. When , the Potts phase prevails in which all players follow one of the Potts strategies (e.g., strategy 3, 4, or 5). In the following, we will assign label 3 to the strategy that forms the Potts phase as . In the other case () one of the Ising strategies dominates the low-temperature regime, which we will call strategy 1 without any loss of generality.
4. Results and Discussion
First, we sketch the general features that are already captured by mean-field approximation, and then move on to highlight finer details revealed by pair approximation and Monte Carlo simulations. From the Monte Carlo results, we extract the critical exponents of the system’s order-disorder phase transitions. Finally, we argue that the first-order transitions between the ordered phases are driven by entropy.
Figure 2a shows the equilibrium strategy frequencies derived within the framework of mean-field approximation when the Ising and Potts components are equally strong (
). Here the curves represent all possible solutions of Equation (
10), stable solutions are distinguished by thick lines. This two-player game has five equal-payoff Nash-equilibria and the multi-agent system has five degenerate ground states, one for each pure strategy when the game is played on a square lattice. Henceforth, as mentioned earlier, we assume that in the two-fold degenerate Ising phase strategy 1 dominates the system at low noises (
if
) while strategy 3 plays the same role among the three-fold degenerate Potts phases.
Figure 2a illustrates that the thermodynamic potential
of the Ising phases is greater than
of the Potts phases at all temperatures. Raising the noise level lowers the population gap between majority and minority strategies in the ordered states, the Ising state loses its stability at a critical point
, and above this critical noise level the system evolves into a disordered phase, where all five strategies are present with the same probability, that is,
. Notice, that the latter disordered strategy distribution also exists at low noises as an unstable solution. We note, furthermore, that mean-field theory predicts the presence of a first-order discontinuity in the Potts phase which is observed neither in the dynamical pair approximation approach nor in Monte Carlo simulations.
Evidently, the increase of
stabilizes the Potts phases and modifies their noise-dependence as illustrated in
Figure 2b–d. According to mean-field theory the Potts phases are stable at low noises if
or
(see
Figure 2b,c), and these ordered states transform into one of the Ising phases at
. The ordered Ising phase exhibits similar behavior as above and transforms continuously into the disordered phase at
if
K is increased. The phase transition between the Potts and the Ising phases is of the first order and it is driven by the higher entropy of the Ising phases.
In agreement with expectations,
increases monotonically with
, moves toward
, and shrinks the width of the temperature region where the Ising phase is stable. Eventually, at
, the predicted critical noise
reaches
, and the Ising phase disappears. For
the system has only one type of ordered phases and one first-order transition, as pictured in
Figure 2d.
Mean-field approaches neglect the correlations between (fixed) neighbors, but their predictions may be applied to well-mixed populations. The common feature of the present games is that the average potential is zero in the disordered phase at the level of the mean-field approximation. Moreover, in the Ising (Potts) phase the Potts (Ising) strategies do not contribute to the average potential. Pair approximation, however, takes into consideration the relevant nearest neighbor correlations and gives more adequate predictions, as shown by comparisons with Monte Carlo data in
Figure 3.
The predictions of mean-field theory are justified by both pair approximation and Monte Carlo simulations at low temperatures that typically cause point defects in the ordered phases. The latter methods, however, predict a differently structured disordered phase where Ising strategies are chosen slightly more often than Potts strategies due to the correlations favoring the choice of identical neighboring strategies. It is worth mentioning that in the analytic continuation of the disordered solutions the equal frequencies of Potts strategies tend to zero as .
In general, cluster variation methods overestimate the critical temperatures of continuous order-disorder transitions but give a more accurate approximation of first-order transition points between ordered phases. This expectation is confirmed by comparing the plots of
Figure 3. The effects of long range correlations and fluctuations become relevant in the vicinity of the critical point and modify the power law behavior as illustrated by these results [
43].
In order to characterize the continuous transitions, we have investigated the critical algebraic behaviors hidden in the quantities
.
Figure 4 shows Monte Carlo simulation data for the Ising magnetization
if
and
. For the quantitative analysis of the vanishing Potts phase we used a similar order parameter,
, which vanishes continuously at the critical point for
and
[
37,
44]. All four sets of Monte Carlo data seem to follow power laws. The critical exponents were estimated by fitting algebraic functions to the relevant data points closest to their respective critical temperatures. The estimated critical exponent (
) for the Ising magnetization turned out to be
[
] for
and
[
] for
that convincingly approximates the theoretical result (
) obtained by Onsager [
18] for the two-dimensional Ising model. These results support the robustness of Ising-type behavior in composite games with independent elementary coordination components (cf. Reference [
15,
16] for a counterexample). If
, the critical exponent for
was found to be
[
] that is remarkably close to
characteristic to the two-dimensional three-state Potts model [
37]. For
, however, the Potts phase transforms into the disordered one with a decidedly different exponent, namely
at
.
Why is the Ising state still preferred near the order-disorder transition point when
? Why is its free energy still higher than that of the Potts state?
Figure 5 clearly shows that the contribution of entropy (
) to the free energy has a very important role in stabilizing the Ising state. In both ordered states there are four minority strategies that are distributed differently in the two cases. The Potts phase has two higher- and two lower-frequency minority states as opposed to the three to one ratio observed in the Ising phase. Consequently, when compared at a fixed noise level, the Ising phase is less ordered, thus its entropy is higher and also grows faster as the noise level approaches
from below; hence the free energy of the Ising phase grows faster and overtakes that of the Potts state. This phenomenon creates a kind of social dilemma insofar as the Potts phase with its higher average payoff is not the system’s stable equilibrium steady state. In other words, the presence of a competing higher-entropy state can prevent the community from maximizing its total payoff.
The stabilization effect of entropy can also be observed in elementary coordination games [
15,
45] that form a basis in the subspace of coordination-type games [
12]. An
n-strategy elementary coordination game is an extension of the original coordination game [
3,
9,
46,
47,
48] in which beside two coordinated strategies
additional neutral strategies are also available, which provide zero payoff regardless of the opponent’s strategy. Notice that this game is exactly the Ising component of the game defined by the payoff matrix in Equation (
2). If this game is played on a square lattice, then the system undergoes an order-disorder phase transition as the noise level is increased across a critical value. Below the transition point the frequency of one of the coordinated strategies starts to grow, while the number of players following other strategies is reduced. The proportion of players following the other coordinated strategy drops faster than the proportion of those who play one of the neutral strategies. In the disordered phase all strategies are similarly frequent, both coordinated strategies are played with equal and slightly higher probabilities than their neutral counterparts. The order and the critical noise level of this phase transition is determined by the number of neutral strategies. If
n is greater than the threshold
[
15], then the system exhibits a first-order phase transition, whereas it becomes continuous if fewer strategies are available. The more neutral strategies there are, the smaller the critical temperature turns out to be. To explain these results one should observe how increasing the number of neutral strategies affects the entropy content in the two phases. Deep within the ordered phase, it is easy to see that the competing high-entropy state’s thermodynamical potential can grow beyond limit at a fixed noise level as more and more additional strategies are introduced. The ordered state may maintain its stability by either increasing its average payoff via further homogenization, which decreases entropy and is capped by the limiting case of uniform cooperation, or by raising its entropy through dismantling order that in turn diminishes average payoff and drives the system toward the disordered phase. As a result, the stability region of the high-entropy disordered phase becomes larger when additional neutral strategies are introduced into an elementary coordination game. A similar entropy-based stabilization is characteristic of high-entropy alloys [
49,
50,
51], a promising family of materials for several technical purposes.
If decoupling of independent game components holds at the mean-field level, then the above-described results should give us an idea about what happens in games made up of an Ising and an (
)-strategy Potts subgame, just like the one in Equation (
2). We should expect to see competing Ising and Potts phases for low enough temperatures and the entropic stabilization of the Ising state near the transition temperature when
is not too high. As
n is increased, critical temperatures should become lower because of the faster increase in the entropy content of the disordered phase.
5. Conclusions
In this paper, we have studied a five-strategy evolutionary potential game that pits an Ising-type and a three-state Potts-type coordination game against each other. This interaction combines the two games as independent subgames of the first two and last three strategies, respectively. The application of the logit strategy update rule has allowed us to utilize various well-known concepts and methods of statistical physics. The systematic investigation of the spatial system is restricted to a model where the players are located at the sites of a square lattice and repeatedly play the five-strategy game against their nearest neighbors. Due to this choice, the observed properties of the stationary states can be compared with universal features characteristic of the original Ising and Potts models.
The game is defined by a single parameter quantifying the ratio of payoffs in the ordered Ising and Potts phases. It is found that both of the two degenerate Ising phases are stable at low noises if . Their higher entropy favors the Ising phases over the Potts phases if . The contribution of entropy, however, vanishes at low noises, therefore the advantage of the Ising phases can be compensated if the Potts phases provide slightly higher payoffs (). In the latter cases one can observe a first-order phase transition from the Potts phase to the Ising phase if the noise level (temperature) is increased. The critical noise level of this first-order transition increases with until eventually the Ising phase (along with the first-order Potts–Ising transition) disappears because the Ising phase loses its metastability to the disordered state before the Potts phase transforms into the same disordered phase.
Independently of the preferred ordered phases at low noises, the system undergoes a continuous (critical) order-disorder phase transition. The quantitative analysis of the order parameters indicates that in most of the cases these continuous transitions preserve the universal behavior of the corresponding (Ising or Potts) model. Additionally, we have observed non-universal behavior in a narrow range of parameters where the interplay between these two different transitions seems to be relevant. It is worth mentioning that similar non-universal critical phase transitions were also reported in several versions of the Ashkin-Teller model [
32,
52,
53,
54] which are also combinations of elementary coordination games [
16]. The systematic analysis of the non-universal phase transitions goes beyond the scope of the present work. At the same time, we think that the concept of matrix decomposition could prove to be a general frame for the identification of multi-dimensional order parameters as well as for the parametrization of interactions when the tensor renormalization group approach [
55,
56] is used for the investigation of critical phase transitions.
We have found that the first-order transition from the Potts state to the Ising state is accompanied by a loss of average income for . The stability of the Ising phase resembles the tragedy of the commons, where the community as a whole does not maximize its total payoff. In this situation the social dilemma is caused by the higher entropy content of the Ising state. In other words, the number of microscopic states in the Ising phase significantly exceeds those which belong to the Potts phase. It is expected that a similar phenomenon can occur in many other social systems where the increasing number of noise-dependent mistakes in the competing ordered phases can compensate for the advantage of higher average payoffs.