1. Introduction
A sanction (from Latin “sanction”—strictest decree) is a measure of influence applied to an economic agent (an individual or a legal entity, industry, or the state) and entailing certain consequences. Sanctions may be of an economic, legal, or social nature. In terms of form, a sanction may be expressed as a prohibition, restriction of operations, fines, and more. Depending on the nature of violated rights, it is customary to distinguish such legal sanctions as criminal, administrative, property, and international legal sanctions. At the same time, according to the consequences’ nature, legal norms’ sanctions can be both negative and positive. The first one implies the application of penalties, and the second one—actions of encouragement. Game Theory is actively used as a tool for modeling socio-economic processes in general and sanctions processes in particular, so, for example, in the research [
1] based on a game-theoretic approach, the concept of a model of criminal sanctions and a model of criminal punishment system, corresponding to fundamental requirements of game theory, taking into account Nash’s equilibrium is offered. The author concludes that “the game-theoretic approach to the formation of both individual criminal law sanctions and the system of criminal law sanctions is quite applicable and complements the formal legal logic of construction of criminal law sanctions and their system, allows to avoid defects in the construction of criminal law sanctions and helps to eliminate legal uncertainty in sentencing”.
It is well known that social sanctions are commonly understood as measures of encouragement and punishment that stimulate an individual to comply with social norms. Sanctions can be formal (medals, diplomas, scholarships, fines, and others) and informal (praise, ridicule, boycott, and others), positive and negative. The research on social sanctions is beyond the scope of this work. Further, we will consider economic sanctions in detail, as much as possible, within the framework of this section.
The first economic sanctions recorded in written sources were imposed by the Athenian Maritime Union (Delos Symmachus) on the city of Megara (part of the Peloponnesian Union) in 432 BC in order to stop the practice of receiving runaway Athenian enslaved people in that city and plowing the sacred border territories. They are known as “Megarian psephism”. The effectiveness of the sanctions was not obvious. On the one hand, the Megarian merchants suffered significant losses, but on the other hand, they were forced to turn to their allies (primarily Sparta) for military support. As a result of the Peloponnesian War, Athens suffered a crushing defeat, and the Athenian alliance was destroyed. Studying the above fact and using it as an example, G. Tsebelis, in his article “Are Sanctions Effective? A Game-Theoretic Analysis” [
2], reasoned that “Although economic sanctions have been quite frequent in the twentieth century, a close examination of the low success rate (33 out of 83 cases) indicates that sender countries are unable to select the appropriate cases. Moreover, analysts sometimes offer contradictory advice for such selection”. This research provides a game-theoretic explanation of these phenomena. Six different game-theoretic scenarios lead to the same equilibrium outcome. It is a mixed strategy equilibrium. The success ratio is the outcome of the selection of mixed strategies by sender and receiver countries. Under a wide range of (specified) circumstances, the size of the sanction has no impact on the behavior of the target country. Finally, some empirical implications of the game-theoretic analysis are compared to existing empirical generalizations, and further implications for empirical research are discussed.
In the Middle Ages in Europe, economic sanctions were primarily local and short-lived because of the constantly changing configuration of trade and military alliances and the changing interests of individual rulers and influential individuals. In the 19th century, the primary tool of economic sanctions was naval blockades—measures to prevent a country’s maritime trade with other countries without a declaration of war. Between 1827 (the first known naval blockade) and 1914, 21 blockades were recorded against Turkey, Portugal, the Netherlands, Colombia, Panama, Mexico, Argentina, and El Salvador. The organizers of the blockades were mainly Great Britain (12 times) and France (11 times), but also Italy and Germany (three times each), Russia and Austria (twice each), and Chile [
3].
Economic sanctions became widespread in the twentieth century with the development of international trade relations. Before World War II, Yugoslavia, Greece, Bolivia, Paraguay, and Italy were subject to collective economic sanctions. During the Cold War, sanctions were largely ineffective because they were not supported by either the Western or Eastern blocs of countries (with the United States and the Soviet Union as leaders, respectively). Unified sanctions, supported by both blocs, were imposed only twice: on Rhodesia and South Africa, and according to international relations experts, in both cases, were not effective enough.
One of the best-known examples of collective economic sanctions at the time was the restriction of deliveries of “strategic” goods and technologies, primarily military and computer technology, to socialist countries. The Coordination Committee (CoCom) was created in 1949 specifically to control exports to the Eastern Bloc countries, which included 17 states, while another six countries cooperated with the committee without formally being part of it. The committee ceased its activities in 1994. The most famous example of long-term unilateral sanctions is the U.S. embargo against Cuba, which began in 1960–1962 and continues today. U.S. companies are prohibited from any economic contact with Cuba without special permission, including in third countries. According to Cuban authorities, direct damage from the embargo amounted to about USD 1 trillion in current prices. Nevertheless, the goal of U.S. economic sanctions—establishing democracy in Cuba—has not been achieved. Since 1990, the UN has made greater use of international economic sanctions against various states. They have been subjected to Iraq (since 1990), Yugoslavia (1991–2001), Somalia (since 1992), Libya (1992–2003), Liberia (since 1992), Angola (1993–2002), Haiti (1993–1994), Rwanda (1994–2008), Sierra Leone (since 1997), Afghanistan (since 1999), Eritrea and Ethiopia (since 2000), DR Congo (since 2003), Cǒte d’Ivoire (since 2004), Sudan (since 2004), Lebanon (since 2005), Iran (since 2006), and DPRK (since 2006). The sanctions are mostly partial and restrict weapons and military equipment supplies to these countries. In some cases, foreign assets are frozen [
4,
5].
Global economic sanctions have been repeatedly imposed on several states. Sanctions, whether international or unilateral, currently apply to 24 states worldwide. However, experience has shown that states subject to sanctions have almost always found ways to minimize their damage or use them to their advantage [
6].
Many publications are devoted to economic sanctions and the application of the game-theoretic apparatus in their study. Let us note the research of Marc V. Simon “When sanctions can work: Economic sanctions and the theory of moves” [
7]; Shidiqi, Khalifany Ash and Pradiptyo, Rimawan “A Game Theoretical Analysis of Economic Sanctions” [
8]; Karimi, Mohammad and Maleki, Abbas and Haieri Yazdi, Asieh “How the Possibility of a Fight-Back Strategy Affects the Consequences of a Sanction’s Regime” [
9]; Onder, Mehmet “The Impact of Decision-Makers on Economic Sanctions: A Game Theoretical Perspective” [
10].
In contrast to other research, this article offers an economical and legal substantiation of theoretical and game constructions modeling the process of application of sanctions and countersanctions, using in general terms the idea of systemic balance of three macrosystems: economic, legal, and social [
11]. In a particular case, the economic-legal substantiation of the proposed theoretical-game model of the balance of sanctions and countersanctions is based on the use of the legal concept of sanctions as a component of the definition of legal responsibility of subjects, i.e., in practical terms, the regulator implements the principle of inevitability of legal responsibility, in particular, sanctions for offense and crime (can be a departure from the established in the legal prescriptions rules, rules of social and economic relations), which is manifested in the application in the description of real socio-economic processes, various concepts of static and active equilibriums.
The current national market system is based on the neoliberal economic doctrine. In differentiated forms, it covers all public relations spheres and appears in decision processes at all complex control and controlled systems levels. Various concepts of static [
12] and active equilibria [
13] are adopted to balance controlled systems [
14,
15] in game-theoretic economic–mathematical modeling. If an analytically constructed differential game describes decision-making in a complex system, then, according to leading researchers, equilibrium as an acceptable solution of a differential game should have the property of stability [
16,
17,
18]. In a practical interpretation, stability means no player will increase their payoff by any unilateral deviation from equilibrium.
Within the neoliberal economic doctrine, the well-known solution proposed by J. Nash [
19,
20,
21,
22] meets this requirement in many situations. (In 1994, J. Nash, J. Harsanyi and R. Selten were awarded the Nobel Prize in Economic Sciences “for the pioneering analysis of equilibria in the theory of non-cooperative games”). Note that this equilibrium does not always exist under certain conditions and (or) has several negative properties. For example, Nash equilibria can be internally and externally unstable. The mathematical problem of ensuring the stability of equilibria can be solved using active equilibria, e.g., the classical equilibrium in threats and counterthreats or the equilibrium in objections and counter-objections [
23,
24], simultaneously with requiring efficiency (Pareto maximality) [
25,
26,
27].
The economic and legal justification of game-theoretic models generally uses the idea of system balancedness: complex controlled systems correlate through interaction, and balancedness means that the legal order of public relations corresponds to the laws and trends of economic development. In a particular case, the economic and legal justification of the equilibrium in sanctions and countersanctions is based on the legal concept of sanctions defining the legal responsibility of subjects (agents). In practical terms, the adjuster implements the principle of inevitable legal liability (particularly sanctions) for an offense and crime (any deviation from the behavioral rules established by legal regulations, e.g., any deviation from the equilibrium mentioned above).
3. Results
To construct a game-theoretic model of equilibrium in sanctions and countersanctions, we consider a noncooperative linear-quadratic differential
N-player game in normal form described by an ordered quadruple
In the game
, the set of players (e.g., market participants) is
, where
. The dynamics of the controlled system
(the interacting subjects of market activity, also called agents) obey the vector linear differential equation
with the following notations:
is the
n-dimensional state vector of the system
is a finite time interval of the game with a fixed terminal time instant
is the control action of player
i ;
is a pair determining a current position in the game
; finally,
as an initial position, where
. The matrix
of dimensions
is assumed to have continuous elements on
which is denoted by
.
The strategy
of player
i will be identified with an
n-dimensional vector function
, and this fact will be denoted by
. Then, the strategy set of player
i can be written as
Thus, player i chooses their (strategy by specifying a matrix of dimensions from the space .
The game evolves over time under market competition in the following way. Without forming coalitions with other players, each player
i chooses a particular strategy
, which yields
a strategy profile of the game. Next, each player finds the solution
of System (1) with
i.e.,
The system of linear homogeneous differential Equation (2) with continuous coefficients on
has a continuous solution
that is extendable to
. Then, each player constructs the realization of their strategy
and the corresponding realization of the strategy profile
, which consists of the
N continuous
n-dimensional vectors
on
. The payoff function of player
i is a quadratic functional
defined on the continuous pairs
,
. Without loss of generality, let the constants matrices
and
of dimensions
be symmetric. The prime denotes transposition:
is a row vector. The value of the functional (3) is called the payoff of player
i. The neoliberal economic doctrine assumes that each player in the game
seeks to maximize their payoff only.
This research aims to find a rather general class of noncooperative linear-quadratic differential
N-player games in normal form
that have no Nash equilibrium but simultaneously have an equilibrium in sanctions and countersanctions. To this effect, we will associate with the game
the
N-criteria dynamic choice problem
Here, the controlled dynamic system
coincides with (1); the set of alternatives
coincides with the set of strategy profiles
of the game
; the
N criteria
are given by (3). The DM’s goal in the problem
is to choose an alternative
for which the
N criteria (3) will take the maximum possible values. V. Pareto proposed a conventional approach to such problems in 1909; see [
38,
39].
Note two results, which are immediate from Definition 1.
Definition 1. An alternative is said to be Pareto-maximal in the problem if and the system of inequalitieswith at least one strict inequality, is inconsistent. In this case, the vector is called a Pareto maximum in the problem Note two results, which are immediate from Definition 1.
Property 1. for at least one number and .
Property 2. If the conditionholds for constants , then the alternative is Pareto-maximal in the problem Here, indicates the bracketed expression from (4) with replaced by Consider two concepts of equilibrium for the game , where
Definition 2. A pair is called a Nash equilibrium of the game iffor any ( denotes a zero vector of dimension n). Now we construct the equilibrium in sanctions and countersanctions.
Let
be some fixed strategy profile of the game
Player 1 is said to impose a sanction to the strategy profile
if there exists their strategy
such that
An existing sanction is not necessarily implemented: it means the threat of coercion. Recall that the role of a sanction is revealed through the legal responsibility of players: sanctions make them refrain from violating the established game rules and are implemented in case of “frustration”. In terms of game theory, implementing the sanction is beneficial to Player 1: according to (5), their individual payoff will increase compared to the previous strategy profile
The complex of punitive measures taken by one party against the other in response to sanctions is manifested in countersanctions. Player 2 is said to impose
an incomplete countersanction to a sanction
of Player 1 if there exists a strategy
such that
Player 2 is said to impose
a complete countersanction to
if there exists a strategy
such that Inequality (6) is satisfied simultaneously with
Incomplete and complete countersanctions of other players to a sanction are formalized by analogy.
In the presence of an incomplete countersanction, Player 2 can choose their strategy for making the payoff of Player 1 (who imposes an original sanction) equal to a value not exceeding their original payoff in the strategy profile ; see (6). (Note that he (it) may even reduce the payoff of Player 1!) Therefore, the presence of an incomplete countersanction negates the implementation of a sanction. In addition, a complete countersanction motivates Player 2 to choose because their payoff in the resulting strategy prole yielded by implementing the sanction and countersanction will increase compared to the strategy prole yielded by implementing the sanction A sanction of player i to a strategy profile and a (complete) countersanction of one of the other players are defined by analogy.
If at least one of the other players has a countersanction to each sanction imposed by any player to , then it makes no sense for him to implement the sanction: due to the countersanction of another player, their payoff will not increase (but it may even decrease!).
Definition 3. A strategy profile is called an active equilibrium [
13]
of the game if, for any initial position : - 1.
The alternative is Pareto-maximal in the N-criteria dynamic choice problem .
- 2.
At least one of the other players has an incomplete countersanction to each sanction of any player.
Definition 4. A pair is called an equilibrium in sanctions and countersanctions in the differential N-player game if, for any initial position :
- 1.
The alternative is Pareto-maximal in the N-criteria dynamic choice problem .
- 2.
At least one of the other players has an complete countersanction to each sanction of any player.
As before,
and
From Definitions 3 and 4, it follows that any equilibrium in sanctions and countersanctions is simultaneously an active equilibrium. Active equilibria and equilibria in sanctions and countersanctions are based on threats and counterthreats, well known in game theory [
24]. They have all the positive properties of Nash equilibria [
18]. More specifically:
They are stable against the deviations of an individual player.
They satisfy individual rationality.
They coincide with the saddle point in the case of zero-sum two-player games.
At the same time, these equilibria are free from the following disadvantages of Nash equilibrium [
18]:
They exist in several cases when there is no Nash equilibrium (e.g., in the game ).
Unlike Nash equilibrium, they are unimprovable and internally stable due to Pareto maximality.
The presence of a Nash equilibrium in the game implies the existence of certain types of unimprovable equilibria in which the payoffs of all players are no smaller than in the Nash equilibrium.
The best Nash equilibria (in the sense of Pareto maximality) are equilibria in sanctions and countersanctions.
Let us emphasize again: the requirement of efficiency (Pareto maximality) has been incorporated into Definitions 3 and 4 to eliminate some negative properties of Nash equilibrium, such as the internal and external instability of the set of Nash equilibria.
N.N. Krasovskii [
40] formalized the concepts of players’ strategies and the motions of a dynamic system induced by them for a two-player zero-sum positional differential game. The constructions underlying the positive properties above are valid for a more general class––noncooperative positional differential games [
41].
Note that under economic sanctions [
42,
43], the methodology for constructing active equilibria, particularly the concept of equilibrium in sanctions and countersanctions, is of utmost importance for economic–mathematical modeling of decision processes and applications.
Hereinafter, the notation means that a quadratic form is negative definite (positive definite, respectively).
Consider the auxiliary
N-criteria static problem
in which the DM chooses an alternative
for simultaneously maximizing all components of a vector criterion
For this problem, Definition 1 can be reformulated as follows: an alternative is Pareto-maximal in if the system of inequalities , with at least one strict inequality, is inconsistent.
We present some auxiliary properties of the quadratic forms with constant coefficients (the elements of a matrix of dimensions ) and the components of an n-dimensional vector
Lemma 1. With the change of coefficients , a quadratic form is reduced to the form , where the matrix of dimensions is symmetric, i.e., .
Without loss of generality, all quadratic forms below are supposed to have symmetric matrices.
Lemma 2. If , then all n roots of the characteristic equation are real and positive , where denotes an identity matrix of dimensions [44]. Let
be the greatest root under consideration. Then
Since , for all n roots of the characteristic equation are negative
Let
be the greatest (smallest by magnitude) root among them. By analogy with (9), we have
Without loss of generality, consider the N-criteria choice problem in which a sanction is imposed by Player 1 and a countersanction by Player 2. (The players are numbered subjectively).
The next result follows from Property 2.
Lemma 3. Assume that in the problem :
1. The symmetric matrices
of dimensions satisfy the inequalities
and 2. The nonzero matrixof dimensions is singular, i.e., its determinant is Then, the system of strict homogeneous linear inequalitieshas a positive solution where and are the greatest roots of the characteristic equations and respectively Proof. To construct a Pareto-maximal strategy profile in the game
, we will apply Property 2 and the linear convolution
of the criteria (8) with positive coefficients
. For (8), we construct a quadratic form from the components of the
Nn-dimensional vector
The numbers
can be chosen so that all
will become negative and
To prove this fact, we associate with (12) the system of strict homogeneous linear inequalities
This system is obtained from (12) by discarding all negative terms, except for
in the first inequality and
in the second one. Adding negative terms only strengthens strict inequalities. Hence, for
and
, any positive solutions of System (14) are also positive for (12). Then, the quadratic form
will be negative definite with respect to the
Nn-dimensional vector
if System (12) (or System (14)) has the positive solution
, that is, all numbers. Let us establish this result for System (14). To this end, we find
from the first two strict inequalities of (14):
If
, then for
and
, there exists a positive number
such that
This inequality holds under
, e.g., for
From the third inequality of (14), for
and
(15), we obtain
e.g., by letting
Continuing the considerations above, using the subsequent inequalities of (14) and the calculated values
we finally arrive at the recurrent formula
□
Remark 1. By analogy with Lemma 3, we have the following result. Assume that in the problem , the symmetric matrices of dimensions and positive numbers and are such that for and
Then, forand given by Lemma 3, the quadratic formbecomes negative definite; the constants are given by (13). Really, and in this expression satisfy the strict inequalities (12).
Note that in addition to the solutions (Lemma 3 and Remark 1), the system of strict inequalities (12) has a continuum of other solutions. As demonstrated below, each of them induces a specific equilibrium in sanctions and countersanctions of the differential game under the conditions .
Remark 2. Positive solutions of (12) can be found using S.N. Chernikov’s approach [35]. To avoid cumbersome transformations and notations dictated by this approach, we propose an original method for proving Lemma 3. Lemma 4. The solutions of the system where satisfy the nontrivial propertyhere denotes a zero vector from the space .
Proof. Assume on the contrary that such that . In other words, at the time instant , two different solutions of the system are passing through the position the trivial one and the nontrivial one induced by the nonzero initial condition . This obviously contradicts the existence and uniqueness theorem for the solution of a matrix linear differential equation with continuous coefficients. □
Proposition 1. Assume that in the differential game , Then, a Pareto-maximal alternative in the N-criteria choice problem has the formwhereis a continuous symmetric matrix of dimensions on the time interval ; the negative constants are given by (13);and the other numbers are calculated by the recurrent formulaswhere is the greatest root of the characteristic equation and is the greatest root of the characteristic equation denotes an identity matrix of dimensions ; finally, means the fundamental matrix of the system Proof. We construct a Pareto-maximal alternative
using Lemma 3 (formula (4)) and dynamic programming [
16]. Due to Property 2, the application of dynamic programming reduces to two stages as follows.
First stage. For the problem
, find
positive numbers
a continuously differentiable scalar function
and
Nn-dimensional vector functions
such that
Then, using the scalar function
find
Nn-dimensional vector functions
from
for any
and
. The functions
in (22) exist under the following sufficient conditions: for all
where (as before)
denotes an
n-dimensional zero vector from the space
and
by Lemma 3.
From (23), it follows that
The second stage. Find the solution
, of the partial differential equation
with the boundary condition
where
. In other words, for all
and all
Consequently, the symmetric matrix
of dimensions
satisfies the Riccati matrix differential equation
where
denotes a zero matrix of dimensions
.
As is well known [
16], the solution
of the Riccati matrix differential equation has the form (18). (Here, the implication
has been taken into account). Formula (18), in combination with another implication
finally yields (17). Thus, a Pareto-maximal alternative
in the multicriteria choice problem
is given by (17) and (18).
Now we construct the Pareto-maximal payoffs
using dynamic programming and [
44]. □
Proposition 2. Let Conditions (16) of Proposition 1 be valid. Assume that for the differential game , there are N scalar continuously differentiable functions such that:
- 1.
- 2.
The system of N partial differential equations
has the solution
Then, for any initial position In (25),are continuous matrices of dimensions ; the matrix is given by (17) and (18); the numbers are given by (13);are symmetric matrices of dimensions ; finally, denotes the fundamental matrix of the homogeneous system Proof. We construct the
N scalar functions
where
are the
n-dimensional vector functions given by (17) and (18).
Let us find the solution
of the system of
N partial differential equations
as the quadratic form
We will establish two facts as follows.
First, the solution of Systems (27) and (28) has the property
where the strategy profile
has the form (17) and (18). Really, if
is a strategy profile from (12)–(14), then by (27) and (28), the solution
of the system
for
will be
Integrating both sides of this identity from
to
subject to the boundary conditions (28) yields
Hence, Property (29) is proved.
Second, the solution of System (29) has the form
where a symmetric matrix
of dimensions
is given by (26). Really, substituting
into (28) leads to (29) if
is the solution of the matrix linear inhomogeneous differential equation
A direct substitution of (26) into equation (29) shows that this symmetric matrix of dimensions is the desired solution. The proof of Proposition 2 is complete.
Remark 3. Propositions 1 and 2 considered together finally yield the following explicit form of the Pareto-maximal solution of the game .
Assume that in the differential game :
- 1.
The symmetric constant matrices and of dimensions are such that - 2.
Then, for all whereare symmetric matrices of dimensions ; the matrices and of dimensions are the fundamental matrices of the systems
and
, respectively;the numbers and are given by (20); the negative numbers are given by (13); and are the greatest roots of the characteristic equations and , respectively Next, let us proceed to the propositions that
These items (1) and (2) involve the definiteness of quadratic forms figuring in the integral terms of the payoff functions (3). From this point onwards, assume that Conditions (11) are satisfied. Hence, there exists a Pareto-maximal alternative
in the
N-criteria choice problem
.
Lemma 5. Let the payoff function (3) be such that . Then, for a Pareto-maximal strategy profile alternative in the game there exists a constant such that, for all and the strategy of Player 1, the inequalitywill hold for any initial positions .
Proof. According to Proposition 2, there exists a Bellman function
such that
where the symmetric matrix
of dimensions
is continuous on
and has the form (26)
.
Consider the strategy
of Player 1, in which the numerical parameter
will be determined below. Due to the symmetry of the matrix
and
,
where
denotes the Euclidean norm, and
is the smallest root of the characteristic equation
; see [
45].
We take the symmetric matrix
of dimensions
from (18) and the strategies
of players
, respectively, from (17). We introduce the scalar function
The matrix
in curly brackets is symmetric and has the form
where
is a symmetric and continuous matrix of dimensions
.
The elements of the matrices
and
are continuous on
and hence uniformly bounded on the compact set
. The factor
enters only the diagonal elements of the matrix
. Recall that
is the
smallest root of the characteristic equation
, where
denotes an identity matrix of dimensions
. Therefore, the constant
can be chosen sufficiently great for making all principal minors of the matrix
positive
. (This fact will be proved below). According to Lemma 2 and [
46], the quadratic form
is positive definite for all
and all constants
Now we show the existence of a constant
such that, for all
the quadratic form
is positive definite for all
and
Note that the matrix
of dimensions
is symmetric. By Sylvester’s criterion, the quadratic form
is positive definite if all principal minors
of the matrix
are positive. The minors
are located in the first
r rows and first
r columns of the matrix
:
They must be positive
and
Expanding the determinants
and rearranging the terms in the descending order of the power of the parameter
, we obtain
where
(constant), and the other coefficients
are continuous on the compact set
, hence being uniformly bounded. This uniform boundedness guarantees the existence of
such that
Let us demonstrate that if
then
In other words, for a sufficiently great value
, the sign of the polynomial
is determined by the sign of its leading term. Really,
Replacing
in this inequality by a greater value
yields
Thus
, and
, we have
Well, for a sufficiently great value , the sign of the polynomial is determined by the sign of its leading term. Finally, for each , we calculate and let
Then, for
, it follows that
We denote by
,
, the solution of the vector differential equation
Due to Lemma 2, the implication
and (32),
Integrating both sides of (32) from
to
subject to the boundary condition
and
gives:
This result, in combination with the equality , finally proves Lemma 5. □
Remark 4. Consider the inner optimization problem in the game : for fixed strategies of players , respectively, and for any , find subject to System (1). Lemma 5 claims that for and , this problem maximization problem has no solution. Really, whatever strategy is chosen by Player 1, there always exists another strategy of this player such that When choosing an appropriate solution of the game , this result allows directly eliminating those concepts of equilibrium that involve the maximization of the payoff function of Player 1 with respect to (For example, if , the concept of Nash equilibrium should not be used as the solution of the game ).
Thus, under Conditions (11), the differential game has no Nash equilibrium. At the same time, the strategy implements the sanction of Player 1 to the Pareto-maximal (efficient) strategy profile ; see (5). In the lemmas below, the initial position is fixed and coincides with the one from Lemma 5; in addition, the sanction strategy of Player 1 has a constant scalar . Recall that Conditions (11) are assumed to hold without special mention. Well, Lemma 5 establishes the following result.
Proposition 3. Assume that in the game , at least one of the constant symmetric matrices of dimensions is positive definite. Then, this game has no Nash equilibrium, i.e., there does not exist a strategy satisfying Definition 2.
Note that:
- 1.
The condition with a fixed number breaks only the i th equality of Definition 2. This is enough for the absence of a Nash equilibrium in the game . If for all , then the N equalities of Definition 2 will be violated.
- 2.
The equivalence
is obvious. (Here,—D means that all elements of the matrix D are multiplied by –1). Then, Lemma 5 also implies the following.
Lemma 6. Let the payoff function (3) be such that . Then, there exists a constant such that, for all and the strategy of Player 2, In other words, the strategy implements in the game an incomplete countersanction to the sanction of Player 1.
Proof. With some obvious modifications, the proof is immediate from Lemma 5.
Let a Bellman function
, be constructed, satisfying
By analogy with Lemmas 5 and 6, we will establish another important result below. Recall that an initial position , a continuous matrix of dimensions , and an incomplete countersanction strategy figuring in Lemmas 5 and 6 are assumed to be fixed, and Conditions (11) are assumed to hold. □
Lemma 7. The condition implies the existence of a value such that, for all and the strategy , In other words, the strategy of Player 2 implements a complete countersanction, jointly with , to the sanction of Player 1 to .
The sanctions of another player and countersanctions of the other players are constructed similarly.
Theorem 1. Assume that the game satisfies Conditions (16). Then, the (N + 1)-tupleis an equilibrium in sanctions and countersanctions of differential game where:the constants are given by (13); where and and and are the smallest and greatest roots of the equations and , respectively; denotes the fundamental matrix of the system ; finally, the symmetric matrices are given by (26). Proof. The absence of a Nash equilibrium in the game
and the presence of a sanction
imposed by Player 1 to a Pareto-maximal alternative
in the
N-criteria choice problem
immediately follow from
; see Remark 4. The existence of a Pareto-maximal alternative and Pareto-maximal outcomes in
(including their explicit forms in this case) has been established by Propositions 2 and 3, respectively. The condition
allows constructing an incomplete countersanction
of Player 2 to the sanctions of Player 1 (Lemma 6). The condition
and Lemma 7 enable transforming the incomplete countersanction
of Player 2 into the complete one
. The requirement
simultaneously implies the absence of a Nash equilibrium
is not achieved
) and the ability of Player 2 to design analytically a sanction
to
in the game
:
The condition
and Lemma 7 guarantee the existence of an incomplete countersanction
of Player 1 to the sanction
of Player 2:
Finally, the Pareto maximality of
and Property 1 lead to
Due to
and Lemma 5, there exists a
such that
The countersanction to the sanctions imposed by players to is designed by analogy.
Thus, one of the other players in the game always has a complete countersanction to a sanction imposed by any player to the Pareto-maximal strategy profile . The proof of Theorem 1 is complete. □