1. Introduction
The games of inspection and corruption are popular topics in game theory, with numerous applications including the problems of illegal logging, the sales of products with substandard quality, tax evasion, and arms control, see, e.g., monographs [
1,
2,
3,
4] and reviews [
5,
6].
Let us mention specifically the papers, where these problems were considered from an evolutionary point of view. In [
7], very simple evolutionary stable strategies were applied to the setting of corruption games. In [
8], an evolutionary approach was suggested specifically in the setting of illegal logging, where it was developed in the classical evolutionary setting of pairwise games. Namely, it was assumed that arbitrary pairs of players can hire inspectors that can control their pairwise agreement. In [
9], the inspection games were developed specifically in the framework of tax payers stressing the dynamics of information spreading.
In [
10], a new generalized evolutionary approach to various classes of models was developed, the pressure and resistance framework, where a large number
N of small players is playing against a distinguished major player, or a principle,
I, who can influence the payoffs by a parameter
b. Small players are supposed to have a finite collection of strategies
so that the state of the system is given by a vector
, where
is a fraction of players adhering to the strategy
i. A strategy
i yields payoff
depending on a state
x and strategy
b of
I. The evolution of the distribution of strategies
x develops in the direction of a better response to
I by a certain Markov chain. It was shown in [
10] that, as
, (i) these Markov processes converge weakly to the deterministic process described by the kinetic equation (or generalized replicator dynamics RD):
and (ii) the rest points of this dynamics represent approximate Nash equilibria for the stationary game of
N players with payoff
. When payoffs
are quadratic in
x, this kinetic equation reduces to the standard replicator dynamics of the evolutionary game theory (see, e.g., [
11]), which were used in [
7,
8] for corruption games. When
, denoting by
and zero the payoffs of the first and second strategies, respectively (only the difference of payoffs is of interest for the dynamics, and thus fixing the second payoff as zero we do not lose generality), system (
1) clearly reduces to the single equation
where
x is the fraction of players adhering to the first strategy.
In [
12], this approach was developed for the mechanism design of governmental regulations in a setting of the two-level hierarchy, where a local inspector
I can be corrupted and is further controlled by the higher authority
A. The main tools for the mechanism design were the fine function for violations
(amount of fine for the value
r of illegal profit detected) and the fine function
for a corrupted inspector (amount of fine for the value
r of bribes detected). It was shown that if the fine functions (both for criminal businesses and corrupted inspectors) were proportional to the level of violations (which is a common practice in certain regulations), that is, if
, with some constants
f,
F, which have the meaning of nominal fines for a unit of illegal profit, then the stable rest points of the dynamics (or Nash equilibria of related games) support the maximal possible level of both corruption and violation. The situation changes if a convex fine is introduced, namely the power-type functions
,
, with
and positive constants
f,
F. In particular, it was shown that with quadratic fine functions one can effectively control the level of violations, which was referred to as the principle of quadratic fine. The main examples represent illegal logging, selling of substandard quality products, and tax evasion. The necessity to analyze a two-level hierarchy arises from the question ’Who will guard the guardians?’ of the Nobel Prize lecture of L. Hurwicz [
13].
In [
12], the levels of illegal profit
r of violators and the level of bribes
(fraction of illegal profit asked by a corrupted inspector as a bribe from a violator for not disclosing the violation) were considered as given exogenously, which was of course only a rough first step for the corresponding real-life modeling. In this paper, we aim at addressing the following crucial questions: (i) what levels of illegal profit
r of violators and what level of bribes
α of an inspector are feasible, that is, realizable in stable equilibria, and (ii) what
α can be optimal for corrupted inspectors that aim at maximizing their total payoff. We present a detailed analysis for the simplest situation only. We also suggest several reasonable extensions, which we aim to analyze in subsequent publications.
The mathematical results obtained in this paper are not complicated. We specifically avoided lengthy expressions whenever possible. Our aim was to give a concise and very clear presentation of the starting point for the program of research on two-level corruption–inspection evolutionary games leading to results, which have clear interpretation and qualitative description that can be explained to a layman. Let us give a brief qualitative description of the results obtained here. We show that, for linear fines, corruption and violation are profitable under upper bounds for the coefficients f, F, which are around the inverse powers of the fraction of agents investigated by the authority A. When these bounds hold, the feasible illegal profit r must be higher than a certain level (small theft is not effective), but has no upper bound (rob as much as possible). The possible profitable values of the bribe fraction α lie in a certain interval depending on the parameters of the model. The effective profit of corrupted inspectors turns out to be a convex function of α, and the maximal profit is attained on the unique value α. This value lies usually somewhere in the middle of the feasible interval (a corrupted inspector should not be too greedy when aiming at sustainable bribe levels). In the case of convex fines, we have similar upper bounds for the coefficients f, F yielding necessary conditions of corruption. However, when these conditions hold, possible illegal profit r of violators is bounded from above and below. For the quadratic fines , possible r lie in a reasonably thin layer around the inverses of nominal fine values F and f, which is quite intuitive. For , these bounds increase with decreasing γ and tend to ∞, as . Optimal values of α usually lie in the middle of the feasible interval, like in the case of linear fines.
Remark 1. Of course, quadratic (or other power-type) fines represent only mathematical idealization. In practice, they can be realized approximately by piecewise linear approximations.
The further content of the paper is as follows. In the next section, we describe our model, also improving the presentation and the assumptions of paper [
12] and indicating its certain shortcomings. The next two sections give our main results on feasible pairs
and optimal
α in the case of linear and convex fine functions. In
Section 5, we discuss several extensions of the simple model considered here. The main point is that, under the present assumptions, the domain of corruption is independent of the dynamic variable of the kinetic equations and we obtain just two separate equations: in the domain of corruption and otherwise. In more advanced modeling, the domain of corruption becomes dependent on the dynamic variable (can be an interval below a certain effective frontier of corruption
, or above it, depending on details of models and asymptotic behavior of the parameters) leading to a switching dynamics with discontinuous coefficients. Only for these switching cases does the principle of quadratic fine (as stated in [
12]) reveals its full meaning. Rough numerical illustrations are given in
Section 6. In
Section 7, some conclusions are drawn.
2. Main Model
We assume that there is a large number M of firms or agents, having two strategies: violate the law and refrain from violating. Violation leads to some illegal gain r.
Remark 2. In [12], we looked at a more complicated model with different levels of r given exogenously. Here, we are looking for levels r that are feasible in an equilibrium. We denote by v the fraction of violating agents so that is their total number, and the average level of violation is .
A local inspector
I is given a budget that is to be used to check the agents so that the violation of each violator is detected with some probability
p that may depend on
v. It is natural to assume that the budget for local crime detection must increase when the crime level increases. The simplest dependence of this kind is linear:
with some non-negative constants
and
β such that
, which we shall adopt.
If an inspector is honest (strategy H), then a detected violator is deprived of the illegal gain
r and is fined, the fine (paid to the government) being a fixed function
, referred to as the
fine function, which represents the main mechanism design of the government. It was suggested in [
12] to use power fine function
with some
and
. It was found that using
(and better even
) is essentially more effective than
, as it allows ultimately to limit the scales of illegal gains in an equilibrium. We shall make this observation more precise here. If an inspector is corrupted (strategy C), she just demands for herself the part
of the illegal gain, with some constant
.
Additionally, there is a central authority A, which aims at fighting corruption and illegal gains of violators by carrying out independent investigations. However, A invests much less budget in local inquiries, so that the probability of detection of a violation is given by some δ, which is smaller than .
Remark 3. One can interpret the “investigation power” p of inspectors in an intuitively similar way, denoting by p the fraction of all agents that are randomly chosen from M and thoroughly checked (with violation detected with probability 1). The same remark concerns the parameter δ. However, formally, this interpretation would lead to different distributions, one would have to work with hypergeometric distributions instead of much simpler binomial distributions that are sufficient under our present interpretation of the parameters .
Remark 4. In [12], it was assumed that A makes just one random check (corresponding to in our present notations), but the parameter p was misleadingly used with a different meaning in some places leading to a slight distortion of the model. If
A detects a violating agent that was not detected by the local inspector
I, then the agent just pays the fine
. If
A detects a violator that was previously detected by
I and paid the bribe
to
I, then the detected agent pays the prescribed fine
, but
I is also punished. This punishment is assumed to take the standard form used in models of corruption games (see [
5,
6]):
I has to return the illegal profit
and pays the fine, given by some prescribed function of
, the
fine function for corruption, which is the second major mechanism design of the government. This function can be the same as the fine function for usual violators but can be different. We shall assume that it has the same power type
with possibly another constant
F. Moreover,
I is supposed to have some standard salary
w, but if detected in corruption, she obtains a lower level job with some reserve salary
and without further possibility for corruption.
Therefore, for a violator, the probability to be detected by an inspector is
p, and the probability to be detected by the authority
A is
δ. These events are assumed independent. Therefore, the average payoff for a violator under an honest inspector equals
and under a corrupted inspector
For these payoffs, we can write down the generalized replicator dynamics or kinetic Equation (
2), which is quite intuitive: the fraction of violators increases when a violation is profitable. Namely, we have
under a corrupted inspector and as
under an honest inspector.
Hence, the candidates for (nontrivial) rest points of the dynamics are obtained by solving the equations
and
, respectively. They are equal
and
respectively. These rest points are seen to be always stable. We call them candidates for rest points, as they have to belong to the interval (0, 1) in order to actually be such rest points.
Remark 5. It is shown in [10] that the rest points of such kinetic equations represent an ϵ-Nash equilibria for the corresponding game of M players, independently of any dynamics. Let us look at the average payoffs of an inspector I. If she is honest, then it is just w. If she were corrupted, but there were no A, the average payoff would be .
With the presence of
A the additional (to
w) payoff to
I consists of three terms: possible change of
w to
, possible fine and not detected bribes. Thus, it is equal to
All these probabilities are obtained from the standard binomial distribution. The average number of bribes and the average number of detected bribes are
and
, respectively. The probability
of the corruption being detected (at least one detected bribe) equals
One may use the first-order approximation
to this probability. This is an upper bound for
, which is reasonable whenever
is much less than 1. In this approximation, the additional payoff of a corrupted agent equals
The corruption is profitable for inspectors
I if this additional payoff is positive, that is, inside the
corruption domain (that does not depend on
p and
v) given by the condition
It is seen directly that if
(proportional fine), then the domain is empty whenever
. And otherwise the corruption domain is given by the condition
yielding no upper bound for an illegal payoff. If
, the corruption domain is bounded for any other parameters!
3. Results for Linear Fines
For
the formulas above simplify essentially. We obtain
The key point of the linear fine is that these values do not depend on r. Consequently, whenever unbounded r are allowed in equilibrium, their average values or are also unbounded.
Theorem 1. The necessary conditions for corruption in a stable equilibrium are If they hold, the necessary and sufficient condition is written down as This interval of the values of α is not empty if Proof. For the existence of corruption in a stable equilibrium the domain of corruption must be nonempty and
must be positive for at least some
α yielding (
15). Moreover, the fraction
must belong to the domain of corruption and ensure that
implying (
16). Evidently, interval (
16) is not empty exactly when (
17) holds. □
If the necessary conditions (
15) for corruption are fulfilled (which holds for sufficiently small
δ), then the average level of illegal profit in equilibrium (per violating agent) is
, which is unbounded.
If
, then
, and therefore there is neither corruption nor illegal profit in equilibrium. If
, then there is no corruption, but illegal profit may exist in equilibrium whenever
in which case its average is
.
Let us look at the dependence of the additional payoff of inspectors
I on
α. Its behavior comes out of two effects: By (
10), this payoff in the equilibrium is given by the expression
The behavior of this payoff comes out as the result of two opposite effects: it is increasing in
α for every
v, but
is a decreasing function of
α.
Theorem 2. The maximum of payoff (18) on interval (16) is attained on the point , where Proof. Substituting the value of
yields
, and therefore
Payoff (
20) is a quadratic function of
with roots at the points
Hence, its maximum is attained at the point
, as claimed. □
This result shows that, in order to maximize their payoff, inspector
I must not aim (unless 1 is the upper bound for interval (
16)) at the highest bribe
α, but has to choose it reasonably in the middle.
4. Results for Quadratic and More General Convex Fines
In case
, Formulas (
7) and (
8) yield
and
The domain of corruption is given by the quadratic inequality
Hence, a necessary condition for the possibility of corruption is that the discriminant of the corresponding quadratic equation is positive:
If this holds, then (
23) is fulfilled whenever
with
the roots of the corresponding quadratic equation.
On the other hand, the condition
yields the necessary conditions
Theorem 3. The necessary conditions for the existence of corruption in a stable equilibrium are the inequalitiesand If conditions (27) and (28) hold, then the corruption in equilibrium is possible for r satisfyingand α lying in the interval Proof. Condition (
30) follows from (
25) and (
26). Condition (
29) is an evident necessary and sufficient condition for interval (
30) to be nonempty.
The second condition (
29) rewrites as
Numbers
r satisfying the last inequality exist whenever there exist real roots
of the corresponding quadratic equation, which in turn occurs whenever the discriminant is positive, that is, when (
28) holds. □
Remark 6. (i) In the theorem, we did not specify precisely the conditions for the set given by (29) to be nonempty. This is the condition for the interval given by the first condition of (29) and to have a nonempty intersection. Explicit formulas are not much revealing here. However, some simple sufficient conditions can be written. For instance, under a natural assumption , interval (29) is not empty and it contains the interval . (ii) Condition (29) is a bit cumbersome and not intuitive. The detailed analysis allows one to write down some simple sufficient conditions. For instance, it holds if either (punishment for bribes is not essentially lower than for just illegal profit), or The main point of Theorem 3 is that conditions (
27) and (
28) yield very restrictive bounds for the parameters of the model allowing for the corruption in a stable equilibrium. And even when they are fulfilled, the possible levels of illegal profit
r are bounded (unlike the case of linear fines) from above and from below. In fact, they belong to a reasonably thin layer around the values
and
. Roughly speaking, the illegal profit
r in equilibrium is proportional to the average fractions of the inverse of nominal fines (per unit of illegal profit) inspected by
A, which is quite intuitive.
The analog of Theorem 2 can be obtained, but it involves more complicated (and not much revealing) expressions. Qualitative behavior is similar, but it depends on rather nontrivial conditions on the main parameters, which distinguishes the cases when it is profitable for inspectors
I to choose the highest possible value of
α or to stay somewhere in the middle of interval (
30).
The results for
are similar (with more cumbersome formulas). The domain of corruption is given by the inequality
The extremal point of the convex function on the l.h.s. of this inequality is
Hence, the set of
α satisfying (
31) is not empty whenever
or
This is the necessary condition for the existence of corruption in a stable equilibrium that extends (
24) to arbitrary
γ.
The condition
yields
All other conditions generalize accordingly. The qualitative result is that, under the necessary conditions of corruption, the illegal profit
r in equilibrium is roughly around the inverses
and
of nominal (per unit of illegal profit) fines inspected by
A, but now (unlike the case
) taken in power
, which is more than 1 for
and tends to
∞ for
.
5. Variants of the Model
The general statement about the noneffectiveness of linear fines seems to be sufficiently robust under the variants of the model. However, details may differ essentially under various modifications of the assumptions of the model. Above, we looked carefully at the simplest possible setting. Let us point out some quite reasonable modifications giving brief comments on essential differences and leaving details for further work.
1. Within our present assumptions, the corruption domain (
11) does not depend on
v, and thus we have just two separate generalized replicator dynamics (
5) and (
6). One can argue that if the “investigation power”
p of inspectors is supposed to increase in
v (as we assume in (
3)), it would be natural to assume the “investigation power”
δ of authority
A increasing in the same way. Then the corruption domain (
11) would depend on
v: it would be given by the interval
with some effective frontier of corruption
. Therefore, corruption would be effective for not very large
v, reflecting the intuition that, for larger
v, the probability of discovering corruption would be high enough to overweight the benefits of bribes. In the case of a
v-dependent domain of corruption, the analysis of stable equilibria becomes more involved, as the replicator dynamics are not separated, but becomes switching at the effective frontier of corruption
. Such a switching was analyzed in [
12], though with some minor errors in modeling. We expect that only for
v-dependent
δ the special effectiveness of
(as compared to other
) is revealed (see more on this point below in variant 3).
2. When
, the approximation
to
is obviously not appropriate. For large
on can use the Poisson approximation yielding
Therefore, up to exponentially small terms,
. This means that corruption would be most probably detected, but an inspector
I may not bother much about it, due to the immense profit from the corruption. In this approximation, the corruption domain is given by the condition
Here, the corruption domain is again v-dependent, but is given by an interval of the upper values , with some effective frontier of corruption . In this case, the corruption is profitable for larger v, unlike the previous case. This behavior reflects the intuition that, if I does not care essentially about being detected and fined, I just wishes to suck out as much as possible bribes from violating agents.
3. If
is large, but detection is not desirable for an inspector,
I may introduce another tool of control to reduce this probability. Namely,
I can choose to take bribes only for a fraction
ω of detected violators. If
ω is small enough to make
much smaller than 1, approximation
is valid for the probability of detection so that the additional illegal payoff to
I equals
and the corruption domain is again given by (
11). However, introduction of
ω changes
and
. Namely, in this case, (
4) changes to
This leads to essential modifications of the analysis because the rest point
obtains the term of order
in the denominator (unlike our (
21)). This means that the average illegal payoff
in equilibrium would be of order
for large
r, which is universally bounded for
(not for
) expressing the principle of quadratic fines, as declared in [
12].
6. Numeric Examples
In this section, we give some numeric examples with two objectives: to demonstrate (i) the domain of applicability of our main approximation and (ii) the linear—quadratic fine function dichotomy, that is, why quadratic fine yields more effective control over the illegal profit of agents and corruption of inspectors. The game parameters proposed in the examples are socially averaged and applicable to the real situations.
In our main model, see (
10), we have taken approximation
for the probability of a corrupted inspector to be detected. This approximation is reasonable for small
.
To illustrate this approximation further, let us take
If , then and , which seems to be a reasonable approximation (6% error). If , then the approximation to equals 0.8. This is close to 1 and differs essentially from the exact value .
When
M increases, we expect approximation (
33) to become more appropriate. In fact, when
,
, while the Poisson approximation is 0.798. For this range of parameters, one can employ the second variant of the model, sketched in the previous section.
Let us now illustrate our results with some rough estimates. Let us start with linear fine functions, with and .
Remark 7. These numbers are not taken totally out of the blue. In Russia, tax avoidance is fined usually by linear fine functions with or and in the UK by linear fine functions with f having value up to 2 (with variations depending on various circumstances). The fines for corruption vary in much broader intervals. Under certain circumstances, Russian law uses linear fine functions with F between 10 and 20.
By Theorem 1, to avoid illegal profit and corruption with such tools of mechanism design, authority A has to have efficiency δ higher than about 10%. If it is less than this value, equilibrium will support arbitrarily large illegal profit r, as long as it is essentially higher as some multiple of (the salary of inspectors I). Even if (efficiency of about 50%) stable equilibria do not support corruption of inspectors, they still do support unbounded illegal profit of agents.
On the other hand, if one would use the quadratic fine functions with, say, the same f and F, one would obtain, by Theorem 3, the (roughly) condition ensuring the absence of corruption in equilibria. But if this (restrictive) bound does not hold, corruption and illegal profit r will be possible, but in a controllable way. Namely, r will be bounded at least by , and will actually belong to a rather restrictive domain. Exact intervals for r and optimal values of bribes α can be obtained from Theorems 1 and 3.