1. Introduction
Since the seventeenth century, mankind’s thinking has been profoundly affected by the views of survival of the fittest by Darwin in natural selection and rational egoism in economics. With fierce competition in the economic, political, military, and other areas since the industrial revolutions, especially after the brutal World War I and World War II, competition has become the main focus of people’s thinking. Moreover, Bentham’s utilitarianism and Hobbes’s Law of the Jungle have become the philosophical basis of the competition concept. However, biologists find that there are altruistic behaviors in many species. Recent research also indicates that if there is no cooperation in ancient times, the anthropoid ape could not evolve into human beings, and human beings also could not reach the top of the food chain today. Consequently, it is very important to explore the origin and the evolution of cooperation of human beings.
Science has announced 125 most challenging scientific problems, among which “how cooperation in human beings evolves” is one of the two major problems in social science that will drive the basic scientific research and determine the future direction of scientific research in the next hundred years [
1]. Many researchers are interested in cooperation of human beings, including anthropologists, sociologists, economists, biologists, and neuroscientists. These researchers aim to reveal the contributing factors to human cooperation from different perspectives.
Classical economics explains altruistic behavior of human beings from ethics. David Hume argues that human’s morals and emotions come from sympathy [
2]. Adam Smith analyzed the origin of sympathy and its different manifestations. He used ethics as the basis of the origin of human cooperation and altruistic behavior. However, ethics itself is also a result of cooperative altruism. Based on the hypothesis of rational egoism, neoclassical economics derives a comprehensive theoretical system from self-interested individual behavior by deductive approach. Neoclassical economics argues that self-interested individual is willing to sacrifice short-term personal interests in order to benefit more in the future, which leads to cooperative altruistic behavior. The Reputation Theory is such an example. Nonetheless, the Cooperation Theory of neoclassical economics based on self-interests cannot explain the numerous instances of pure altruism, i.e., sacrifice without compensation. Based on the rational self-interested hypothesis, the subgame perfect Nash equilibrium leads to the conclusion that playing “prisoner’s dilemma” for finite times cannot engender cooperative behavior.
Fortunately, there are still biologists who rescue us from the moral dilemma of economics. Biological research finds that the altruistic behavior is one of the important factors to promote cooperation and improve competitiveness in the early stage of human beings. Biologists divide altruistic behavior into kin altruism, reciprocal altruism, and pure altruism. Reciprocal altruism, which can be divided into direct reciprocal altruism and indirect reciprocal altruism, is a kind of non-kin altruism. Direct reciprocity refers to the mechanism in which the donor gets favor from the recipient, i.e., “you scratch my back and I will scratch yours,” while indirect reciprocity refers to the mechanism in which the donor does not obtain favor from the recipient but from an irrelevant third party who can at the same time harvest good reputation. Alexander proposed the idea of indirect reciprocity for the first time in 1987 [
3]. Many scholars believe that indirect reciprocity is the origin of human ethics, and the cooperation of human beings in the process of evolution is based on indirect reciprocity. The study of indirect reciprocity emerged from biology at first but has since been widely used in other disciplines, including sociology, economics and anthropology [
4,
5].
To help understand how cooperation in human beings evolves, in this paper, we build an economic game model with three types of players: the cooperator, the defector, and the discriminator. In each round of the game, the cooperator chooses cooperation and the defection chooses non-cooperation or defection. Starting with the second round, the discriminator chooses cooperation if his partner chooses cooperation in the last round and chooses defection if his partner chooses defection in the last round. Hence, the game is based on indirect reciprocity.
We model and analyze the human cooperation model based on indirect reciprocity in this paper partly because we aim to understand how we can cooperate more effectively to achieve environmental protection. It is well known that there have been a number of urgent environmental issues such as pollution and climate change. For example, of the 100 million metric tons of plastic produced every year, about 10 million end up in the oceans. In terms of electronic waste, about an average of 50 million tons is produced every year [
6]. A big reason for this to happen is that a significant proportion of people and organizations have low regard for environmental protection. Furthermore, when they choose to litter or at least not to recycle, they face insufficient penalties. Admittedly, with a current world population of about 7.3 billion, it might be impossible for everyone not to be a defector in the game of environmental protection [
7]. On the other hand, this research can shed a light on how the different types of players will evolve as far as environmental protection is concerned. In particular, we aim to provide argument for the importance for government and the society as a whole to provide incentives to people to take actions for environmental protection.
The rest of this paper is organized as follows. In
Section 2, we provide a concise literature review. We then set up our main model and analyze it in
Section 3. In
Section 4, we extend our main model and show that the results from our main model are robust. Finally, in
Section 4, we summarize our results and discuss their implications for environmental protection.
2. Literature Review
This paper focus on explaining human cooperative behavior using an economic game model based on indirect reciprocity and altruism behavior. In our review of the related literature, we first focus on research on kin-based altruism behavior, then on research on non-kin third-party altruism behavior. For an existing study to be included in our review, the study should have a focus on theoretical explanation of human altruism cooperation. Note that such a study may be from diverse research areas such as biology, neuroscience, and economics.
In nature, kin altruism is the most common type of altruism and is considered to be a kind of “innate preference” for the animals. Hamilton, a British biologist, is the first person who explained kin altruism behavior in biology utilizing game theory. However, he could not explain non-kin altruism in human society. Human beings are the only species in nature who have non-kin altruism behavior. What is the origin of non-kin altruism behavior and how does it evolve? Some scholars explain non-kin altruism behavior from biological evolution mechanisms. The famous Santa Fe School considers that the altruism behavior of early humans is the result of internalization of social norms, and the harsh natural environment forces humans to extend altruism from kin individuals to non-kin individuals in order to raise the species fitness [
8]. Biological evolution mechanism includes group selection theory and individual selection theory. The group selection theory posits that natural selection mechanism occurs on the level of biological species. If it can improve a species’ fitness, then altruism behavior can stay with the evolution of that species group. [
9] explains the altruism of human from the preference of prosocial other-regarding. They presume that when humans act, they consider not only the consequences for themselves (self-regarding) but also the consequences for others (other-regarding). The pressure of group selection impels human to have prosocial features. Neurophysiology finds that the rational selection of human beings is deeply affected by emotions such as sympathy and guilt. Taking the feelings of others into account, human beings will adjust their behavior. The individual selection theory holds that natural selection mechanisms only arises on the level of individuals. In other words, individuals only give consideration to their own interests in the process of evolution. In the famous book
The Selfish Gene, Richard Dawkins points out that all living things are selfish in essence. Even if there is cooperation between them, the only reason is for more benefit in the future. [
10] argues that the reason of altruism is the surplus obtained from cooperation. Even self-interested individuals will take altruistic actions in order to establish an efficient cooperative order and improve the competitive advantage of humans in the biological world.
There is strong reciprocal altruism among human non-kin altruistic behavior. Strong reciprocal altruism means that one takes action that is only beneficial for others but is costly for himself or herself. It is also called “altruistic punishment.” [
11] explains the reason of altruistic punishment from the angle of group selection. They believe that human can build up a social order of cooperation and punishment by copying and learning the behavior of the majority and other successful people. [
8] also points out that although the altruistic punishment will reduce the fitness of the punisher himself, other members of the group will benefit more if all of them comply with social norms and punish the violator. Some neuron economists think that there is a so-called “opium reward area” in human’s brain. When the area is stimulated, it will produce a self-incentive mechanism, which makes people act with strong reciprocal altruism.
Because of the cost of altruistic punishment, it is easy to lead to the “second-order free rider problem,” which can make the punishment mechanism fail [
12]. In order to solve the dilemma of the second-order free rider, [
13] proposes a coordination penalty model, which shows that if the number of individuals in a population conforming to social norms increases, and the cost of altruistic punishment decreases, then altruistic punishment behavior will be common in the population. [
14] also argue that plenty of altruistic punishment behavior can overcome the problem of second-order free rider in the process of human evolution, and hence achieve human cooperation. [
15] demonstrates by a public goods game that if the reward of public goods reaches a certain degree, stable punishment can be a permanent threat to defection and can effectively maintain the order of human cooperation. He also holds that social justice is the prerequisite to solve the social dilemma, and the sense of justice in human nature is the outcome of internalization of social norms.
Reciprocal altruism is another non-kin altruism, including direct reciprocal altruism and indirect reciprocal altruism. Direct reciprocity can explain the repeated cooperation game among non-kin players, but it cannot explain third-party reciprocal cooperation, i.e., indirect reciprocal cooperation. [
16] challenges the traditional direct reciprocity model based on prisoner’s dilemma by setting up the first indirect reciprocity model. They incorporate reputation into their model and analyze its evolutionarily stable condition. [
17] proposes that the moral signal in human social networks can boost indirect reciprocal cooperation. Some researchers investigated the stable evolutionary result of the standing strategy under the condition of indirect reciprocity [
18]. [
19] compares the evolutionary equilibria of three games: the prisoner dilemma, the stag hunt game, and the snowdrift game. They find that under the condition of indirect reciprocity, the influences of ethics, reputation, and other mechanisms on final evolutionary equilibrium can be very complicated depending on the initial game conditions. However, [
20] finds that there is a need of a small number of defectors in a group to encourage other group members to punish the second-order free riders. This can promote cooperative behavior within the group. Therefore, the existence of bad actors is also valuable for the evolution of human, just as the moderate existence of native predators can eventually raise the fitness of a prey species.
In this paper, we set up and analyze a game of indirect reciprocity which includes three types of actors: cooperator, discriminator, and defector. In the game, the discriminator will punish the defector. We demonstrate that the game does not have a stable evolution equilibrium, which proves that there is also the “one-third law” in the evolution of indirect reciprocity. That is, in the process of human evolution, not only cooperators and discriminators but also defectors are needed. In order to further verify the effect of altruistic punishment, we extend our main model so that in the extended model, players can observe each other’s choices of the last two rounds. In this extended model, our analysis shows that the proportion of players who will choose cooperation increases starting from the third round. However, the payoffs of the cooperators and the defectors stay the same, while the payoff of the discriminators may or may not increase.
3. Main Model and Analysis
Suppose there are three types of players in a game, including the cooperator, the defector and the discriminator. In each round, a player’s choice can be observed by others, but the player can only remember the choice from the last round. The initial percentages of cooperator, defector, and discriminator in the population are
,
, and
, respectively, and it is common information. In every round, the cooperator chooses cooperation, and the defector chooses non-cooperation. The choice of discriminator depends on the choice of his partner in the last round. If the partner’s choice is cooperation in last round, then the discriminator chooses to cooperate in this round. If the partner’s choice is defection in the last round, the discriminator also chooses to defect in this round. If two players both choose to cooperate, they will obtain benefit
and incur cost
, where
. If one chooses to cooperate and the other chooses to defect, the cooperator’s benefit is
and cost is
, and the defector’s benefit also is
but cost is 0, where
. We assume that
is in proportion of
, i.e.,
. If two players choose noncooperation, their benefits and costs are zero. See
Table 1 for a summary of the potential payoffs.
Before our detailed analysis, we summarize the basic steps in our analysis and the similarities and differences between our model and existing models reviewed in the previous section. As for the basic steps of our analysis, we first derive the total payoffs of the cooperator, the defector, and the discriminator. Based on these payoffs, we will obtain the dynamic changes of the proportions of these types of player. Then we investigate if there is a stable equilibrium point in this economic game. As can be seen in our following analysis, consistent with existing models reviewed in the literature, our model assume that a play can only observe another player’s choices of two rounds: the current round and the previous round. However, different from existing studies, our model explicitly obtains the dynamics of the proportions of the three player types and find that in the long term, there is no stable equilibrium point in this game.
A player only can observe his partner’s last round choice. In the first round, there is no information about the game, so it assumes that the proportion of discriminator who chooses cooperation in the first round is . From the second round, the discriminator chooses cooperation if his partner chooses cooperation in the last round and chooses defection if his partner chooses defection in the last round. Hence, it is a downstream indirect reciprocity. Note that upstream indirect reciprocity and downstream indirect reciprocity are defined as follow. Supposing there are three players: A, B, and C. Upstream indirect reciprocity means if A helps B first, then B will help C next. Downstream indirect reciprocity means if A helps B first, then C will help B after observing B’s behavior.
We use
to denote the proportion of players who will choose cooperation in the
nth round. Then we have:
and
. Furthermore, we can obtain the following equation:
In the first round, for the cooperator, the chance that his/her opponent cooperates is
, and the expected revenue is
. The chance that his/her opponent defects is 1 −
, and the expected revenue is
. Hence, the cooperator’s expected revenue is
where we use the relationship
in the derivation. Similarly, we can obtain that the discriminator’s revenue is
and the defector’s revenue is
. In the
nth round
, the cooperator’s revenue is
, and the defector’s revenue is
. The proportion of the discriminator who chose cooperation in the last round is
, and his revenue in this round is
. The proportion of the discriminator who chooses noncooperation in the last round is
, and his revenue in this round is
. Therefore, the discriminator’s expected revenue in this round is
. According to Equation (1), the discriminator’s expected revenue can be simplified as
. At the end of the
nth round, the total payoff of the cooperator, the defector and the discriminator will be as follows:
where
Let
,
,
, then
The mean payoff of the three players is
. Substituting
,
,
into the equations, we have the following:
The dynamics of the cooperator, the defector and the discriminator are as follows:
indicates the dynamic change of the proportions of the three player types as the game evolves. Solving the equations of
and
, we can obtain the values of
and
. According to
, we can then calculate
. In this way, we can calculate the stable equilibrium points. Because the equations are nonlinear and analytically not solvable, we use numerical simulation with Matlab to obtain
,
, and
.
In our simulations, we use the following values for the parameters:
,
,
,
, and
. Employing Matlab programming, we obtain two solutions of
and
. However, because of the requirements of
and
, there is no real solution in the interior of the definition domain, which means there is no stable equilibrium point. In other words, the dynamic system is chaotic. The conclusion conforms to the “one-third law,” which means that if there are three strategies in the evolution, those three strategies will be dominant in turn [
21]. Bowles, S. et al also finds that there is no stable evolutionary equilibrium if a group consists of only cooperators [
8]. A group will be extinct if the group consists of only defectors. However, a group that includes some discriminators can survive in the competition of evolution. Perhaps, as Kauffman stated, life exists on the edge of chaos. In the same logic, evolutionary equilibrium based on indirect reciprocity exists on the edge of chaos.
In the absence of the cooperator, we have
,
, and
. And the stable equilibrium point is
. Using the following relationship,
we can find
. When there is only defectors and discriminators in the game, there is no stable equilibrium point in the interior of definition domain, i.e.,
. However, the equilibrium point will move to
on the edge line of
. The equilibrium point tends to move from
to
(as in
in
Figure 1). Under the condition of no cooperator, the defector cannot benefit from the cooperator and will be punished by the discriminator, so the defector will gradually disappear, and the discriminator will win.
In the absence of defector, we have
,
, and
. Thus, the stable equilibrium point is
. According to the equation
we can obtain the real roots of
as −0.92 and 1.03. As
, we can conclude that there is no stable equilibrium point in the interior of definition domain when there are only cooperators and discriminators in game. The equilibrium points will tend to either
or
. In other words, either the cooperator or the discriminator will win on the edge of
. If the cooperator wins, the equilibrium point tends from
to
(
). If the discriminator wins, the equilibrium point tends from
to
(
). When there is no defector, the discriminator does not need to punish the defector and their strategies are consistent. Hence, the points on the edge of
all can be the evolutionary equilibrium.
In the absence of discriminator, , , the stable equilibrium point is . If , we can find the solution , and . Hence, there is no winner and the stable equilibrium point is . Using the parameter values stated before, the stable equilibrium point is , which means the defector and the cooperator are well-matched in strength. If , we can find the solution , and , which means the defector will win and the stable equilibrium point is . If , we can find the solution of and , which means the cooperator will win and the stable equilibrium point is . When there is no discriminator, the equilibrium of the dynamic evolutionary game depends on the value of . In the static game, we can find that if and , then mixed Nash equilibrium exists for the game and the dynamic evolutionary game’s equilibrium point is . If and , then , which means the importance of cooperation diminishes. The static game’s Nash equilibrium is (defection, defection) and the dynamic evolutionary game’s equilibrium point is . If and , then , which means the cooperation is quite important. The static game’s Nash equilibrium is (cooperation, cooperation) and the dynamic evolutionary game’s equilibrium point is .