Next Article in Journal
The Feynman–Kac Representation and Dobrushin–Lanford–Ruelle States of a Quantum Bose-Gas
Next Article in Special Issue
Time-Dependent Theme Park Routing Problem by Partheno-Genetic Algorithm
Previous Article in Journal
On the Generalized Cross-Law of Importation in Fuzzy Logic
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Exploitation of a Productive Asset in the Presence of Strategic Behavior and Pollution Externalities

1
GERAD, HEC Montréal, 3000 Côte-Sainte-Catherine, Montreal, QC H3T 2A7, Canada
2
Chair in Game Theory and Management, 3000 Côte-Sainte-Catherine, Montreal, QC H3T 2A7, Canada
*
Author to whom correspondence should be addressed.
Mathematics 2020, 8(10), 1682; https://doi.org/10.3390/math8101682
Submission received: 28 August 2020 / Revised: 25 September 2020 / Accepted: 26 September 2020 / Published: 1 October 2020
(This article belongs to the Special Issue Mathematical Game Theory 2021)

Abstract

:
We study the strategic behavior of firms competing in the exploitation of a common-access productive asset, in the presence of pollution externalities. We consider a differential game with two state variables (asset stock and pollution stock), and by using a piecewise-linear approximation of the nonlinear asset growth function, we provide a tractable characterization of the symmetric feedback–Nash equilibrium with asymptotically stable steady state(s). The results show that the firm’s strategy takes three forms depending on the pair of state variables and that different options for the model parameters lead to contrasting outcomes in both the short- and long-run equilibria.

1. Introduction

There is an extensive literature in economics, operations research, game theory, and dynamic optimization dealing with the exploitation of renewable resources (e.g., a fishery or a forest). One main question shared by all parties, namely firms, governments (regulators), and citizens, is how to exploit these resources in a sustainable way. In the vast dynamic games literature, where players (firms and regulators) interact strategically over time, the models have very often focused on the resource itself, without any other considerations. Typically, these models start with a dynamic system describing the evolution of the stock, and next, they characterize the equilibrium strategies under different assumptions related (i) to the information structure (e.g., open-loop, feedback, or closed-loop with or without memory) and (ii) to the players’ behavior (cooperative or noncooperative).
There is ample evidence showing that the evolution of renewable resources depends not only on natural variations and on human intervention (harvesting, deforestation, etc.) but also on the accumulated pollution. An illustrative example is the recent discovery—devastating for biomass—of over 5 trillion pieces of plastic, weighing a total of 250,000 tons, afloat on the Pacific Ocean. More than half of this island of plastic is made of fishing gear, i.e., is the result of resource exploitation [1]. In addition to the adverse effects of accumulated pollution on the resources, the literature documented various direct economic costs borne by firms operating in these industries. For instance, the fishing industry faces costs arising from the need to repair or replace gear that has been damaged or lost because of encounters with abandoned, lost, or otherwise discarded fishing gear (ALDFG) [2,3]. The impacts of ALDFG on fisheries also include hazards to navigation and safety at sea, which increase fuel costs and reduce fishing time, and loss of earnings due to reduced or contaminated catches including ghost fishing [4,5]. It is estimated that the cost of marine debris damage on the fishing industry in 2008 was US$364 million for the 21 Pacific Rim economies [6], and the derelict pot removal program between 2008–2014 in the largest estuary of the United States, the Chesapeake Bay, led to an additional harvest valued at US$21.3 million—a 27% increase above that which would have occurred without removals [7].
Scientists from various disciplines have proposed models to integrate the influence of other (state) variables—e.g., pollution stock, marine environmental quality, and habitat quality—in their analysis (e.g., [8]), omitting, however, the strategic interactions between the various parties involved. In this paper, we consider an oligopoly exploiting a renewable resource (a productive asset) and contribute to the literature by having a more realistic model where strategic behavior and pollution externalities are present. Indeed, strategic thinking has often been ignored in large-scale ecosystem models and in representative-agent frameworks and pollution has been disregarded in games of renewable-resource exploitation.
To represent the habitat’s limited carrying capacity, the rate of growth of the productive asset is typically modeled as a nonlinear, inverted U-shaped function of the asset stock (see, e.g., [9]). The author of [10,11] approximated this nonlinear rate of growth by an inverted-V function, which allowed for tractable characterization of the equilibrium strategies and payoffs. In this work, which is a revised version of [12], we adopt a differential game framework that extends the productive-asset model in [11] by introducing a second state variable, namely pollution stock, while retaining his approximation approach. The game is played by n identical firms competing à la Cournot over an infinite planning horizon. Each player aims to maximize their stream of discounted payoff, taking into account the market structure, the initial size of each stock, and their dynamics. For the present work, we abstract from the effects of pollution on the reproductive capacity of the asset and consider only the direct economic costs of pollution incurred by the firms as negative externality. The problem is technically an infinite horizon linear-quadratic n-player differential game with two state variables and one control variable for each player, which influences the two dynamic processes.
We characterize fully analytically the symmetric feedback–Nash equilibrium, where the firm’s strategy is a piecewise-linear function in the two state variables, for which the shape depends on the position of the game in the state space. Thus, the contribution of this work includes obtaining a closed-form solution in a model with two state variables and a piecewise-linear resource growth function. We show that the state space is divided into three regions, namely scarcity, abundance, and no exploitation. Three equilibrium cases are identified and are shown to depend on the relationship between the asset growth rate, the discount rate, and the pollution decay rate. This equilibrium is sustainable; that is, for any given pair of initial asset–pollution stocks, it converges to a stable steady state with positive exploitation and positive values of state variables. When there are at least two players, the equilibrium includes either one stable or two locally stable steady states; however, when the industry is monopolistic, the optimal solution includes only one stable steady state.
Our work belongs quite naturally to the literature on the exploitation of productive assets and the management of resources under pollution control. Early contributions in this area include [13,14], both of which explored the set of equilibria in a dynamic game framework. The characterization of the equilibrium in [10,11] showed that the firm’s strategy and the value function take piecewise forms, which depends on the asset stock level. There can be a unique or multiple steady states depending on the asset growth rate, and the decision of a single firm to unilaterally decrease its exploitation may result in a decrease in the asset stock. These contributions led to extensive research on various topics related to the strategic exploitation of common access resources that also took into account the nonlinearity of the growth rule. The literature has examined issues such as optimal taxation [15], losses from competition [16], the role of property rights, and convergence to the Cournot equilibrium [17,18]. More recently, [19] showed results on pre-emption, voracity, and exhaustion. The authors of [20] showed that nonlinear feedback strategies are unstable in a dynamic duopoly game with renewable resource exploitation. The effects of mergers were analyzed in [21], the impact of social status concern in these industries were studied in [22], and [23] investigated the incentives in a duopoly by considering a finite planning horizon.
In parallel to this dynamic-games literature on resources, a significant literature has dealt with the strategic behavior of agents under pollution externalities (see the surveys in [24,25]). The pioneering contributions are [26,27], where both noncooperative and cooperative solutions were characterized and contrasted. Many papers have ensued, focusing on issues such as taxation (e.g., [28,29]), sustainability and uncertainty (e.g., [30,31]), international environmental agreements (e.g., [32,33]), and technical change and R&D (e.g., [34]).
Some studies have integrated pollution accumulation in the exploitation of a renewable resource (see, e.g., [35,36,37]). However, these contributions have not accounted for strategic behavior or, when they have, they omitted the feature that the resource growth rate is not linear. In a closely related work to ours, [38] considered a similar model with two state variables, namely biomass stock and pollution stock. They studied a sustainable cooperative agreement in an open-access fishery and characterized the symmetric noncooperative and cooperative solutions. They focused only on an interior solution that includes one part of the solution (which we name the scarcity region). Moreover, they considered the damage of pollution on the resource dynamics together with the damage on welfare, and in the present paper, we study the effect of taking into account the direct economic damage on the firms’ profits. In addition, we characterize the equilibrium for the whole state space and study in detail the boundary cases and the values of parameters that allow for a feedback–Nash equilibrium.
In a nutshell, our paper attempts to integrate within the same framework the exploitation of a productive asset in the presence of pollution externalities and strategic behavior. We approximate the nonlinear growth function by a piecewise linear function, which allows us to have a tractable linear-quadratic dynamic game, for which the equilibria can be characterized analytically.
The rest of the paper is organized as follows: Section 2 introduces the model, Section 3 characterizes the equilibrium and shows its properties, and Section 4 concludes.

2. The Model

We consider an n-player infinite-horizon differential game, with the asset stock S and the pollution stock Z being the two state variables. The model extends the framework in [11] by introducing pollution externalities in the firms’ decision-making problem. At each date t 0 , + , n-firms exploit the common-access asset in quantities q i ( t ) , i = 1 , , n , and compete à la Cournot. We assume that the exploitation strategy q i ( t ) is nonnegative and bounded above, that is, 0 q i ( t ) M S ( t ) with the positive constant M sufficiently large (see [14,39] that considered similar constraints). We consider the transformation rate of the asset to the final product to be one-to-one and the unit cost of exploitation to be zero for simplicity. The price p is determined by the linear inverse-demand function given by p ( Q ) = a b Q , where Q = i = 1 n q i is the total quantity of supply and where a > 0 and b > 0 .
The growth rate of the productive asset (e.g., a fishery, a forest, etc.) is assumed to be nonlinear in an inverted U–shape in the asset stock. Following the literature, we adopt the following piecewise-linear approximation:
f ( S ) = δ S if S S y , δ ( S m a x S ) if S > S y ,
where δ > 0 denotes the intrinsic growth rate of the asset and S y is the level of asset that leads to the so-called maximum sustainable yield ( S y = S m a x 2 ) with S m a x > 0 being the carrying capacity of the habitat. Note that f ( S ) = 0 if S = 0 , S m a x , f ( S ) > 0 if S 0 , S m a x , and f ( S ) < 0 if S > S m a x . Taking into account an asset growth function that includes pollution (i.e., f ( S , Z ) ) would allow to capture the negative impacts of pollution on the asset stock. This is considered in [38], where they present the closed-form solutions for noncooperative and cooperative cases for a part of the interior solution. In this work, we refrain from adding pollution and use function f ( S ) to keep the presentation of results simpler.
Taking into account the firms’ exploitation, the change in the asset stock at date t is governed by the following differential equation:
d S ( t ) d t = S ˙ t = f ( S ( t ) ) i = 1 n q i ( t ) .
The firms’ activities generate emissions as a by-product and add up to the pollution stock, which evolves over time as follows:
d Z ( t ) d t = Z ˙ t = α i = 1 n q i ( t ) k Z ( t ) ,
where α > 0 denotes the amount of emissions resulting from exploiting a unit of asset, and k > 0 is the pollution decay rate.
Denote by d Z the (symmetric) damage cost of player i. We suppose that this environmental cost is convex increasing in the pollution stock Z and satisfies the property d 0 = 0 . For tractability, we adopt the quadratic functional form d ( Z ) = ϕ 2 Z 2 , where ϕ > 0 .
Assuming that each player maximizes their discounted stream of profit, the optimization problem of player i is then as follows:
max q i ( t ) t = 0 + e r t p i = 1 n q i ( t ) q i ( t ) d ( Z ( t ) ) d t , subject to ( 2 ) , ( 3 ) , and 0 q i ( t ) M S ( t ) , with S ( 0 ) = S 0 > 0 , Z ( 0 ) = Z 0 0 given .
where r > 0 denotes the common discount rate.

3. The Equilibrium

We consider the equilibrium where firms use feedback information in their decision-making. This is a subgame-perfect equilibrium (also called a Markovian Perfect Nash Equilibrium (MPNE)), in which the firm’s strategy is state dependent and strongly time consistent [40,41]. Denote by V i ( S ( t ) , Z ( t ) ) the value function of firm i, which is the discounted sum of profits that the firm obtains in the game starting in state ( S ( t ) , Z ( t ) ) . Unless an ambiguity arises, we shall from now on omit the time argument. Introduce the Hamilton–Jacobi–Bellman (HJB) equation associated to firm i’s maximization problem, that is,
r V i ( S , Z ) = max q i p i = 1 n q i q i d ( Z ) + V i ( S , Z ) S f ( S ) i = 1 n q i + V i ( S , Z ) Z α i = 1 n q i k Z ,
for i = 1 , . . , n , where the partial derivative V i ( S , Z ) / S represents the shadow price (or value) of the asset stock (also called scarcity rent) and V ( S , Z ) / Z denotes the shadow value of the pollution stock. Taking into account the nonnegativity restriction on q i in problem (4), maximizing the right-hand side of (5) yields the following condition:
q i 0 ; a b i = 1 n q i b q i V i ( S , Z ) S α V i ( S , Z ) Z ,
with at least one of the inequalities being equality. The condition (6) must hold together with the terminal condition lim t + e r t V ( S ( t ) , Z ( t ) ) = 0 for every admissible trajectory. The left-hand side of the second inequality in (6) is the marginal revenue of firm i for given quantities of competitors. The right-hand side represents the marginal (opportunity) cost of a unit of exploitation, which comprises the shadow prices of the asset stock and pollution stock. Since we are considering costless exploitation, the opportunity cost consists only of the shadow prices of the state variables. We focus on a symmetric equilibrium in which all firms have the same value function, and they exploit the same quantities ( V i ( S , Z ) = V ( S , Z ) and q i = q for all i = 1 , . . , n ). Consequently, (6) becomes
q * ( S , Z ) = max 0 , 1 b ( n + 1 ) a V ( S , Z ) S + α V ( S , Z ) Z ,
for i = 1 , . . , n . Condition (7) results in two possibilities for the equilibrium strategy, that is, q * ( S , Z ) > 0 or q * ( S , Z ) = 0 . There may exist different cases where q * ( S , Z ) > 0 depending on the signs of the partial derivatives of the function V ( S , Z ) . We consider the cases in which the asset has a scarcity rent ( V ( S , Z ) / S > 0 ) or not ( V ( S , Z ) / S = 0 ), and we consider all possible cases for the effect of pollution on the value of the firm; thus, s i g n ( V ( S , Z ) / Z ) is free. By using (7), we write these cases in the following definition:
Definition 1.
The three regions are as follows:
  • Scarcity region ( R S ): q * ( S , Z ) > 0 with V ( S , Z ) / S > 0 :
    R S = ( S , Z ) | a > V ( S , Z ) S α V ( S , Z ) Z a n d V ( S , Z ) S > 0 .
  • Abundance region ( R A ): q * ( S , Z ) > 0 with V ( S , Z ) / S = 0 :
    R A = ( S , Z ) | a > α V ( S , Z ) Z a n d V ( S , Z ) S = 0 .
  • No-exploitation region ( R 0 ): q * ( S , Z ) = 0 :
    R 0 = ( S , Z ) | a V ( S , Z ) S α V ( S , Z ) Z .
In region R S , the pair of state variables is such that it is profitable to exploit the asset, and the asset has a scarcity rent. The firms view the level of the asset stock as scarce, and they consider their impact on the asset stock in their strategy. In region R A , the asset stock is too high, so having an additional unit in the stock brings no value to firms. Players consider only the pollution externality as an intertemporal effect of exploitation, and the asset stock does not play a role in their decision. In region R 0 , the marginal revenue of an initial asset supply (given by the price P ( 0 ) = a ) is lower than its marginal cost, which depends on the shadow prices of the asset stock and pollution stock. Hence, exploitation is not dynamically profitable and the equilibrium strategy is to wait for the asset to replenish and for pollution to decline.
Since the function V is not known, it is not clear beforehand whether any of these three cases exist. To proceed with an analytically tractable characterization, we focus on the strategies that are linear functions of the state variables. Furthermore, we introduce the following assumptions:
Assumption 1.
( a ) r 1 + n 2 2 < δ r + k a n d ϕ < ϕ 1 , ( b ) r + k < δ < k 1 + n 2 n 2 1 a n d ϕ < ϕ 2 ,
where the terms ϕ 1 and ϕ 2 are given by
ϕ 1 = b ( k + r ) 2 δ n 2 + 1 r δ 2 k n 2 + n 2 + 1 r n 2 + 1 r ( k + r ) 2 ( δ α n ( n 1 ) ) 2 ,
ϕ 2 = b k ( k + r ) k n 2 + 1 δ n 2 1 n 2 + 1 ( k + r ) + δ n 2 1 2 ( δ α n ( n 1 ) ) 2 ,
where we have ϕ 2 < ϕ 1 for n > 1 and r + k < δ < k 1 + n 2 n 2 1 .
Assumption 2.
4 a b 1 + n 2 ( k + r ) ( k + δ ) δ ( λ + b ( 1 + n ) r ) ( λ + b ( 1 + n ) ( 2 δ r ) ) < S y ,
where λ = b ( n + 1 ) ( 2 k + r ) 2 + 8 n 2 α 2 b ϕ .
The assumptions above require the intrinsic growth rate of the asset ( δ ) to be sufficiently high (which is the same condition required in [11]) but also bounded from above with a threshold that depends on the pollution decay rate (k) and the number of firms (n). Another restriction is imposed on the marginal damage parameter ϕ , which is required to be sufficiently low. For too high values of δ , the condition is revised to be more strict for the parameter ϕ . We will discuss the roles of these restrictions in more detail in Section 3.1, where we study the properties of the equilibrium. In the following theorem, we characterize the symmetric equilibrium:
Theorem 1.
Suppose that Assumptions 1 and 2 are satisfied.
(a) 
Equilibrium strategies:
The strategy profile q 1 ( t ) , . . , q n ( t ) = q * ( S ( t ) , Z ( t ) ) , . . , q * ( S ( t ) , Z ( t ) ) for t 0 , + where
q * ( S , Z ) = ( a + c 0 + c S S + c Z Z ) / b ( n + 1 ) i f ( S , Z ) R S , ( a + c ¯ 0 + c ¯ Z Z ) / b ( n + 1 ) i f ( S , Z ) R A , 0 i f ( S , Z ) R 0 ,
constitutes a symmetric feedback–Nash equilibrium.
The terms c S , c Z , c 0 and c ¯ S , c ¯ Z , c ¯ 0 , which depend on the exogenous model parameters, are written as follows:
c S = ( n + 1 ) 4 n 2 ( 2 δ r ) ( δ + k ) λ + b ( n + 1 ) ( 2 δ r ) ,
c Z = ( n + 1 ) 4 n 2 α ( δ r k ) ( δ + k ) λ b ( n + 1 ) ( 2 k + r ) ,
c 0 = a n 2 + 1 ( c S ( k + r ) + α c Z ( δ r ) ) b ( n + 1 ) 2 ( k + r ) ( δ r ) 2 n 2 ( c S ( k + r ) + α c Z ( δ r ) ) ,
and
c ¯ S = 0 ,
c ¯ Z = ( n + 1 ) 4 n 2 α λ b ( n + 1 ) ( 2 k + r ) ,
c ¯ 0 = a n 2 + 1 α c ¯ Z b ( n + 1 ) 2 ( k + r ) 2 n 2 α c ¯ Z ,
where the term λ is given by
λ = b ( n + 1 ) ( 2 k + r ) 2 + 8 n 2 α 2 b ϕ .
The regions R S , R A , R 0 are written as follows:
R S = ( S , Z ) | S > a + c 0 c S c Z c S Z
a n d S < ( 2 δ r ) ( k + r δ ) c Z c S Z ( 2 δ r ) ( δ r ) a 1 + n 2 + 2 n 2 c 0 2 n 2 c S ,
R A = ( S , Z ) | Z < a + c ¯ 0 c ¯ Z
and S ( 2 δ r ) ( k + r δ ) c Z c S Z ( 2 δ r ) c S a 1 + n 2 + 2 n 2 c 0 2 n 2 ( δ r ) ,
R 0 = ( S , Z ) | S a + c 0 c S c Z c S Z or Z a + c ¯ 0 c ¯ Z .
(b) 
Value functions:
The discounted sum of profits obtained by each firm is given by the following value function:
V i ( S , Z ) = V ( S , Z ) = W ( S , Z ) i f ( S , Z ) R S , V ¯ ( Z ) i f ( S , Z ) R A , V 0 ( S , Z ) i f ( S , Z ) R 0 ,
for i = 1 , . . , n , which is continuously differentiable ( S , Z ) R + 2 .
The function W ( S , Z ) is written as follows:
W ( S , Z ) = A + B 2 S 2 + C S + D 2 Z 2 + E Z + F S Z ,
where
A = ( a + c 0 ) a + n 2 c 0 b ( n + 1 ) 2 r , B = 2 n 2 c S 2 b ( n + 1 ) 2 ( r 2 δ ) ,
C = c S a ( n 2 + 1 ) + 2 n 2 c 0 b ( n + 1 ) 2 ( r δ ) , D = 2 n 2 c Z 2 b ( n + 1 ) 2 ϕ b ( n + 1 ) 2 ( 2 k + r ) ,
E = c Z a ( n 2 + 1 ) + 2 n 2 c 0 b ( n + 1 ) 2 ( k + r ) , F = 2 n 2 c Z c S b ( n + 1 ) 2 ( k + r δ ) .
The function V ¯ ( Z ) is written as follows:
V ¯ ( Z ) = A ¯ + D ¯ 2 Z 2 + E ¯ Z ,
where
A ¯ = ( a + c ¯ 0 ) a + n 2 c ¯ 0 b ( n + 1 ) 2 r , D ¯ = c ¯ Z α , E ¯ = c ¯ 0 α .
The function V 0 ( S , Z ) is written as follows:
V 0 ( S , Z ) = Z Z ^ ( S , Z ) r k Θ S Z Z ^ ( S , Z ) δ k , Z ^ ( S , Z ) ϕ Z 2 2 ( 2 k + r ) 1 Z Z ^ ( S , Z ) 2 k + r k ,
with
Θ ( S , Z ) = W ( S , Z ) i f ( S , Z ) R 0 W , V ¯ ( Z ) i f ( S , Z ) R 0 V ¯ ,
and the function Z ^ ( S , Z ) is defined implicitly by the following system of equations:
S ^ ( S , Z ) = S Z Z ^ ( S , Z ) δ k , Z ^ ( S , Z ) = c S c Z S ^ ( S , Z ) a + c 0 c Z .
The partitions of region R 0 , denoted by R 0 W and R 0 V ¯ are written as follows:
R 0 W = ( S , Z ) R 0 | Z < Z 3 or Z 3 Z < Ψ ( S ) ,
R 0 V ¯ = ( S , Z ) R 0 | Z Z 3 and Z Ψ ( S ) ,
where the curve Z = Ψ ( S ) denotes the boundary between R 0 W and R 0 V ¯ given by
Ψ ( S ) = Z 3 S 3 S k δ ,
with the constants S 3 and Z 3 given as follows:
S 3 = 2 a 2 b k n 2 + 1 ( k + r ) b δ n 2 1 r δ λ ( n 1 ) δ ( b ( n + 1 ) r + λ ) ( λ + b ( n + 1 ) ( 2 δ r ) ) ,
Z 3 = 2 a α b 2 k n 2 + 1 + r 3 n 2 + 1 + λ ( n 1 ) ( λ + b ( n + 1 ) r ) ( λ b ( n + 1 ) ( 2 k + r ) ) .
Proof. 
The long proof is built throughout the paper, and the details are provided in Appendix A, which has several subsections. The road map to complete the proof of the theorem is as follows:
  • In Appendix A.1, we state some preliminaries and introduce the methodological approach.
  • In Appendix A.2, we study the case q * ( S , Z ) > 0 . By guessing a piecewise-quadratic form for the value function and by applying the undetermined coefficient method, we obtain the functions W ( S , Z ) and V ¯ ( Z ) and the solutions associated with their coefficients. Then, we analyze the boundary cases and their positions in ( S , Z ) .
  • Lemma A1 in Appendix A.2.1 shows that, under Assumptions 1 and 2, the function W ( S , Z ) satisfies the HJB equation ( S , Z ) R S and that strategy profile q i = q * i satisfies (7) with q * ( S , Z ) > 0 and V ( S , Z ) / S > 0 .
  • Lemma A2 in Appendix A.2.2 shows that, under Assumptions 1 and 2, the function V ( S , Z ) = W ( S , Z ) if ( S , Z ) R S , V ¯ ( Z ) if ( S , Z ) R A is continuously differentiable in S and Z and satisfies the HJB equation ( S , Z ) R S R A . The strategy profile q i = q * , i satisfies (7) with q * ( S , Z ) > 0 and V ( S , Z ) / S 0 .
  • Appendix A.3 looks at the case q * = 0 . Lemma A3 obtains the function V 0 ( S , Z ) and shows that it is continuously differentiable in S and Z , ( S , Z ) R 0 , and on the boundary cases of R 0 .
  • Combining these results, we conclude that the piecewise function V ( S , Z ) given in (22) satisfies HJB Equation (5) and that the strategy profile q i = q * , i satisfies the condition in (7) ( S , Z ) R + 2 and constitutes a feedback–Nash equilibrium.
 □
Theorem 1 characterizes the symmetric equilibrium strategy of firms for any given pair ( S , Z ) . The strategy q * ( S , Z ) is a piecewise-linear function in S and Z with coefficients ( c 0 , c S , c Z ) and ( c ¯ 0 , c ¯ Z ) that correspond to the coefficients of the marginal cost function given in (6) (see (A7) in Appendix A.2 in the Appendix A).
For further analysis, we define the boundary lines between the regions as follows:
Definition 2.
Let Z = Z i j ( S ) denote the boundary line in the ( S , Z ) plane between the regions R i and R j as function of S. We have
(i) 
The boundary line between R S and R 0 :
Z = Z 0 S ( S ) = c S c Z S a + c 0 c Z .
(ii) 
The boundary line between R S and R A :
Z = Z S A ( S ) = ( k + r δ ) ( 2 δ r ) c S c Z S + ( k + r δ ) a 1 + n 2 + 2 n 2 c 0 2 n 2 ( δ r ) c Z .
(iii) 
The boundary line between R A and R 0 :
Z = Z 0 A = a + c ¯ 0 c ¯ Z .
We now proceed with studying the general properties of the equilibrium in the next subsection.

3.1. The Properties of the Equilibrium

We briefly explain the methods used for obtaining the equilibrium strategies and then investigate their properties. Using a linear-quadratic model with the piecewise-linear approximation of the asset growth function enables us to guess the form of the value function as a polynomial of degree 2 in S and Z within an interior solution. We obtain the six-dimensional equation system associated with the coefficients of W ( S , Z ) and then reduce it into a system of two equations in ( c S , c Z ) given in (A5) and (A6). This system yields four solutions: two include W ( S , Z ) / S 0 and the other two include W ( S , Z ) / S = 0 . Among all solutions, only one pair makes it possible to characterize a feedback–Nash equilibrium with a stable steady state.
We use the solutions for the strategies and the value function to derive the analytical formulation of the case R S , which is given in (19). We obtain the linear functions associated with its boundary cases, where q * ( S , Z ) = 0 and W ( S , Z ) / S = 0 , and then study their positions in ( S , Z ) by analyzing the signs of ( c S , c Z ) given in Equations (12) and (13), which result in three cases that differ in the relationship among the dynamic model parameters, i.e., s i g n ( δ r k ) . In all cases, c S > 0 and c ¯ Z < 0 , but the sign of c Z differs, i.e., s i g n ( c Z ) = s i g n ( δ r k ) . Using these results, for δ > r / 2 , we write the following cases:
Case 1 : δ < r + k : c S > 0 , c Z < 0 , c ¯ Z < 0 , Case 2 : δ = r + k : c S > 0 , c Z = 0 , c ¯ Z < 0 , Case 3 : δ > r + k : c S > 0 , c Z > 0 , c ¯ Z < 0 .
Since c Z is the coefficient associated with the level of pollution in the equilibrium strategy in region R S , this difference leads to contrasting results in the equilibrium responses to pollution in R S , which will be discussed in detail below.
We analyze the properties of the equilibrium by using the diagram in Figure 1, which shows the shapes and positions of the regions given in Definition 1 in the ( S , Z ) plane.
(i)
There are four regions: the value function takes a different form in each region.
(ii)
q * > 0 in R S R A , and q * = 0 in R 0 W R 0 V ¯ .
(iii)
V / S > 0 in R S R 0 W , and V / S = 0 in R A R 0 V ¯ .
(iv)
The boundary case Z = Z 0 S ( S ) given in (37) denotes the threshold where q * = 0 and W / S > 0 , beyond which the firms voluntarily cease exploitation and wait for the asset to replenish and pollution to decline. The sign of its slope depends on s i g n ( r + k δ ) .
(v)
The boundary case Z = Z S A ( S ) given in (38) denotes the threshold where q * > 0 and W / S = 0 . It decreases in S in all cases.
(vi)
The boundary Z = Z 0 A given in (39), which is a positive constant, denotes the threshold level of pollution, where q * = 0 and V / S = 0 . The firms refrain from any exploitation if the level of pollution is above this threshold.
(vii)
The boundary cases of R S intersect with the Z = 0 axis at points ( S 1 , 0 ) and ( S 2 , 0 ) , and their intersection point is denoted by ( S 3 , Z 3 ) , where the closed-form solutions of S 1 , S 2 , and ( S 3 , Z 3 ) are given in (A9), (A10), (35), and (36). When Assumptions 1 and 2 are satisfied, the ordering of these points is given as follows: 0 < S 1 < S 2 < S y and S 3 > 0 , Z 3 > 0 .
The qualitative properties of equilibrium in region R A remain the same for all values of parameters that satisfy Assumptions 1 and 2. This is because the firm’s strategy and the value function in this region do not depend on the intrinsic growth rate ( δ ) . Within this region, the equilibrium behavior is that of the dynamic oligopoly with pollution externalities, where higher pollution induces the firms to exploit less of the asset ( q * / Z < 0 ). Since the slope of the boundary d Z S A ( S ) / d S < 0 in all cases, for a fixed point on this boundary, V / S > 0 for lower levels of pollution and V / S = 0 if pollution is high.
By contrast, the behavior in region R S differs depending on s i g n ( δ r k ) . In the following, we show the properties of the equilibrium strategies in each case:
Case 1: 
δ < r + k , c S > 0 and c Z < 0 (Figure 1 and Figure 2a): The equilibrium level of exploitation is faster if the asset stock is larger ( q * / S > 0 ), and it is slower if the level of pollution is higher ( q * / Z < 0 ). Since d Z 0 S ( S ) / d S > 0 , for a fixed pair of state variables on this threshold, firms will exploit the resource if pollution is sufficiently low but will not exploit if pollution is too high.
Case 2: 
δ = r + k , c S > 0 and c Z = 0 : The firm’s strategy includes only the asset stock and is independent of the level of pollution. The boundary case where q * = 0 and W / S > 0 becomes a constant S = S 1 .
Case 3: 
δ > r + k , c S > 0 and c Z > 0 (Figure 2b): In this case, the slope of the threshold Z = Z 0 S ( S ) becomes negative. Firms exploit the asset faster if the level of pollution is higher ( q * / Z > 0 ). Since d Z 0 S ( S ) / d S < 0 , for a fixed a point on this threshold, firms exploit the asset if the level of pollution is high and do not exploit it if pollution is low. The opportunity cost of exploitation given in (6) decreases in pollution.
The results presented above highlight the differences in equilibrium behavior among the three cases. For a relatively slow growing asset, the response to pollution is to decrease exploitation; however, if the asset growth rate is sufficiently large, the equilibrium response to pollution is reversed, i.e., to exploit more under higher pollution.
Since we make an extension of [11], it will be useful to compare and contrast our results. Naturally, our solutions share many similarities with [11]. The three regions and the piecewise-linear strategies we obtain are also found in [11], where they depend only on the level of asset stock. This is indeed the consequence of considering a piecewise-linear growth function. Moreover, the solution we present is equivalent to the one given in [11] in the limit case, where the damage function parameter ϕ 0 (see Remark A1 in Appendix A.2 in the Appendix A). Furthermore, as the damage cost parameter ϕ increases, the equilibrium behavior is more affected by the level of pollution. Our results show that, if the pollution level is too high, the firms do not exploit the asset for any given level of asset stock. Also, for a given set of other parameters, above a level of ϕ given in Assumption 1 ( ϕ > ϕ 1 or ϕ > ϕ 2 depending on δ , k, and r), we cannot characterize a feedback–Nash equilibrium for the whole state space. Last but not least, the different types of equilibrium responses in cases 1 to 3 appear due to the presence of pollution in the problem.
As mentioned above, when firms do not have information on the pollution externality or when they do not take it into account in their decision-making ( ϕ 0 ), the equilibrium behavior is as given in [11]. One question is whether the firms benefit from taking into account the damage cost ( d ( Z ) ) in their profits. When there are direct economic costs due to pollution but the firm disregards it, then the discounted sum of profits the firm obtains will be lower. We elaborate on this by discussing the two following cases: one firm chooses another strategy while all other firms stick with the feedback–Nash strategy with d(Z) in their profit function, and all firms disregard the damage in their profit. For the first case, in which all other firms choose the feedback–Nash strategy ( q * given in Equation (11)), one firm deviating and choosing another strategy (denote by q ˜ with q ˜ q * ) would yield a lower value, which is immediate because the equilibrium we characterize is subgame-perfect (i.e., V i ( q i * , q i * ) V i ( q ˜ i , q i * ) , where q i * denotes all other firms’ strategy except firm i). For the second case, let q denote the symmetric equilibrium strategy in the game without pollution, which is given in [11]. When all firms disregard the damage in their profit and all choose the strategy q , one firm can deviate and make larger profits by taking into account d ( Z ) since q is not maximizer of the problem with d ( Z ) , thus it would not be an equilibrium. For these reasons, it is beneficial for firms to take into account the costs arising from accumulated pollution in their decision-making process.
We can now discuss the roles that Assumptions 1 and 2 play on the characterization of the equilibrium. The first part of Assumption 1, which refers to the case δ < r + k , ensures that the asset quantity S 1 > 0 . In addition, having the slope of no-exploitation boundary increasing (see Figure 2a) leads to the result that the equilibrium behavior is not to exploit the asset for too small values of asset stock. In the second part of Assumption 1 ( δ > r + k ), ensuring S 1 > 0 is not sufficient to characterize a feedback–Nash equilibrium. The reason is that the slope of no-exploitation boundary decreases (see Figure 2b), and if S 3 < 0 , then there may exist equilibrium trajectories leading to asset exhaustion. Hence, we modify the condition to ensure S 3 > 0 . Then, in both cases, Assumption 1 guarantees that the equilibrium behavior around the asset level S = 0 is not to exploit the asset, which renders the asset growth to be positive ( S ˙ > 0 ) in the neighborhood of the S = 0 axis for all levels of pollution, and thus, the asset exhaustion never occurs in the equilibrium. Furthermore, Assumption 2 is to ensure that the value function is continuously differentiable for all ( S , Z ) , particularly for S = S y = S m a x / 2 , where the asset growth function in (2) is not continuously differentiable. Also note that, for a given parameter calibration that satisfies Assumption 1, the carrying capacity ( S m a x ) can be set such that Assumption 2 is satisfied as well.
Lastly, we turn to the case where the equilibrium strategy is to not exploit the asset. To characterize the value function in region R 0 given in (21), we use the fact that, in this region, q i = 0 for all i 1 , . . , n ; thus, the asset stock grows at rate δ and the pollution stock declines at rate k without an intervention. At a certain date t ^ , depending on the initial state ( S ( 0 ) , Z ( 0 ) ) , the pair ( S ( t ) , Z ( t ) ) reaches either one of the boundary cases where firms begin exploitation. The value of the game starting at a point in R 0 depends on this boundary point associated with itself, which can be computed. The following steps are taken in Appendix A.3:
(i) 
We define an implicit function denoted by ( S ^ ( S , Z ) , Z ^ ( S , Z ) ) written in the system of equations in (31), which yields the point (and the date) at which the firms launch their exploitation for ( S , Z ) R 0 .
(ii) 
Using this function, we obtain the curve denoted by Z = Ψ ( S ) given in (34) associated with the intersection point of the three regions ( S 3 , Z 3 ) where W ( S 3 , Z 3 ) = V ¯ ( Z 3 ) .
(iii) 
This curve enables us to partition region R 0 into two parts, denoted by R 0 W and R 0 V ¯ given in (32) and (33), such that the boundary for launching exploitation is known for a given ( S , Z ) R 0 .
Having obtained the boundary point associated with all points in ( S , Z ) R 0 , in Lemma A3, we find V 0 ( S , Z ) that satisfies the HJB equation with q i * = 0 , i and show its continuity. The function ( S ^ ( . ) , Z ^ ( . ) ) does not have an analytical solution for ( S , Z ) R 0 W , except in special cases ( δ = k ) and ( δ = r + k ) (see Remark A2 in Appendix A.3 in the Appendix A); nevertheless the characterization of V 0 ( S , Z ) remains tractable. For any other parameter setting, this function has to be computed numerically in order to obtain the value of a point in R 0 W . In the other partition, R 0 V ¯ , the value function has an analytical form.

3.2. The Equilibrium Dynamics, Steady States, and Stability

In order to analyze the stable steady states of the equilibrium, we derive the set of points such that S ˙ ( t ) = 0 and Z ˙ ( t ) = 0 , respectively. Then, we obtain the steady states and study their stability by using the methods provided in [42]. The results are shown in the following theorem:
Theorem 2.
Under Assumptions 1 and 2, the steady state(s) may be single or multiple depending on the parameters.
(i) 
The steady state in region R S is denoted by ξ S = ( s S , z S ) and written as follows:
s S = k n a + c 0 b δ k ( n + 1 ) n α δ c Z + k c S , = 2 a k b δ 2 k n 2 + 1 + n 2 + 3 r + 2 b n 2 + 1 r ( k + r ) + δ λ ( n 1 ) δ ( b ( n + 1 ) r + λ ) b δ ( 2 k ( n 1 ) + r ( n + 1 ) ) b ( n + 1 ) r 2 λ ( δ r ) ,
z S = α δ n a + c 0 n α δ c Z + k c S b δ k ( n + 1 ) , = 2 a α b δ 2 k n 2 + 1 + n 2 + 3 r 2 b n 2 + 1 r ( k + r ) δ λ ( n 1 ) ( b ( n + 1 ) r + λ ) b δ ( 2 k ( n 1 ) + r ( n + 1 ) ) + b ( n + 1 ) r 2 + λ ( δ r ) .
The steady state ξ S always exists, and it is asymptotically stable for n 2 .
(ii) 
The steady states in region R A are denoted by ξ A 1 = ( s A 1 , z A 1 ) and ξ A 2 = ( s A 2 , z A 2 ) and written as follows:
s A 1 = k n ( a + c ¯ 0 ) δ ( b k ( n + 1 ) n α c ¯ Z ) , = 2 a k b 2 k n 2 + 1 + 3 n 2 r + r + λ ( n 1 ) δ ( b ( n + 1 ) r + λ ) ( 2 b k ( n 1 ) b ( n + 1 ) r + λ ) ,
s A 2 = S m a x k n ( a + c ¯ 0 ) δ ( b k ( n + 1 ) n α c ¯ Z ) , = S m a x 2 a k b 2 k n 2 + 1 + 3 n 2 r + r + λ ( n 1 ) δ ( b ( n + 1 ) r + λ ) ( 2 b k ( n 1 ) b ( n + 1 ) r + λ ) ,
z A 1 = z A 2 = Z 0 A = α n a + c ¯ 0 b k ( n + 1 ) α n c ¯ Z , = 2 a α b 2 k n 2 + 1 + 3 n 2 + 1 r + λ ( n 1 ) ( b ( n + 1 ) r + λ ) ( b ( 2 k ( n 1 ) ( n + 1 ) r ) + λ ) ,
The existence of steady states ξ A 1 and ξ A 2 depends on the following condition:
S m a x 2 b δ ( n + 1 ) n c ¯ Z a + c ¯ 0 c ¯ Z < α n a + c ¯ 0 b k ( n + 1 ) α n c ¯ Z .
  • If (45) is true, then the point ξ A 1 is unstable, whereas ξ A 2 is stable. In that case, there are two locally asymptotically stable steady states ( ξ S , ξ A 2 ) , and the equilibrium to which a game converges depends on its initial state ( S ( 0 ) , Z ( 0 ) ) .
  • If (45) is not true, then ξ S is the unique steady state which is asymptotically stable.
Proof. 
See Appendix B. □
Figure 3 illustrates these results for a case in which multiple steady states exist, which also shows the positions of the loci S ˙ = 0 and Z ˙ = 0 .
The cases of unique and multiple steady states are illustrated in the diagrams in Figure 4, which also show the equilibrium trajectories in the ( S , Z ) plane. These trajectories are obtained by using the differential equation system resulting from replacing q * ( S , Z ) in (11) into (2) and (3). For a given initial state, the equilibrium strategy may shift from one to another as the values of the state variables cross the thresholds between the regions.
The continuity of V ( S , Z ) on the boundary cases ensures a smooth transition between the regions with continuous q * ( S , Z ) ; consequently, the strategies q i ( t ) = q * ( S ( t ) , Z ( t ) ) , i converge to a steady state with lim t + q * ( t ) > 0 for all initial states and constitute a symmetric feedback–Nash equilibrium.
We now investigate how the equilibrium responses to marginal increases in asset and pollution stocks vary with the changes in the parameter values. In the following proposition, we analyze these comparative statics for the number of players (n), the marginal damage coefficient ( ϕ ), and the intrinsic growth rate of the asset ( δ ) .
Proposition 1.
The partial derivatives of coefficients of the equilibrium strategy given in (11) with respect to the selected parameters have the following signs:
s i g n c S n < 0 ; s i g n c Z n = s i g n ( k + r δ ) ; s i g n c ¯ Z n > 0 ; s i g n c S ϕ > 0 ; s i g n c Z ϕ = s i g n ( δ k r ) ; s i g n c ¯ Z ϕ < 0 ; s i g n c S δ > 0 ; s i g n c Z δ > 0 ; s i g n c ¯ Z δ = 0 .
Proof. 
See Appendix C. □
The results of Proposition 1 can be summarized as follows: the equilibrium response to an increase in asset stock is lower if the number of players is higher, and the response to an increase in pollution stock depends on s i g n ( k + r δ ) . Recall that s i g n ( c Z ) = s i g n ( δ k r ) ; then, with a higher number of players, the equilibrium response to higher pollution is to decrease exploitation less for δ < k + r and increase exploitation less for δ > k + r . The results are reversed for the marginal damage parameter. When the growth rate of the asset is higher, the response to an increase in asset stock is higher and the response to an increase in pollution stock is also higher, which is in contrast with the effects of the other parameters.
Another example in which the qualitative properties differ is the case of a monopoly. The following proposition presents the result in this case:
Proposition 2.
Under Assumptions 1 and 2, for a monopoly ( n = 1 ), the optimal solution includes only one stable steady state, which is in region R A .
Proof. 
See Appendix D. □
Proposition 2 shows that, in the monopoly case, the steady state ξ S lies on the boundary Z = Z S A ( S ) and coincides with the unstable steady state in R A , i.e., ξ S = ξ A 1 . Hence, for a monopoly, this point is unstable but it is sustainable. Analysis regarding the stability of the other steady state in R A ( ξ A 2 ) remains valid, and thus, ξ A 2 is the only steady state that is asymptotically stable. Therefore, for a monopoly, the optimal solution includes two steady states with one of them stable and the other one unstable but sustainable.

4. Concluding Remarks

We characterized the symmetric feedback–Nash equilibrium and showed its existence within a certain range of model parameters given in Assumptions 1 and 2. The equilibrium path always reaches a steady state that is sustainable. For a set of parameters outside this range, there may still exist local equilibria for some levels of asset–pollution stock pairs.
The framework we present includes various simplifications and abstractions, which made the characterization of the equilibrium more conveniently tractable. We introduced the pollution externalities in a simple way in order to guarantee that variations in the equilibrium results between the outcomes with and without pollution externalities could be studied through a single exogenous parameter.
The methodology used to characterize the equilibrium can be applied in problems involving similar features by considering different objective functions and state dynamics. Some examples are the issues relating to the open-access fisheries shared by multiple countries, analysis of cooperation and stability of coalitions, welfare analysis, spillover effects, and the interactions between other possible state variables and the asset stock.

Author Contributions

Conceptualization, N.B.V. and G.Z.; methodology, N.B.V. and G.Z.; formal analysis, N.B.V.; writing—original draft preparation, N.B.V.; writing—review and editing, N.B.V. and G.Z.; visualization, N.B.V.; supervision, G.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Social Sciences and Humanities Research Council, Canada, grant 435-2013-0532.

Acknowledgments

We wish to thank three anonymous reviewers for their very helpful comments and suggestions which improved the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Theorem 1

Appendix A.1. Some Preliminaries

A symmetric equilibrium exists if there exists a function V i ( S , Z ) = V ( S , Z ) i that is continuous and continuously differentiable in S and Z, which satisfies the HJB equation in (5), the first-order condition in (7), and the terminal condition lim t + e r t V ( S ( t ) , Z ( t ) ) = 0 for every admissible trajectory (see [40,41]). Due to the nonnegativity restriction on q i in problem (4), we consider the two following cases:
q * ( S , Z ) > 0 and q * ( S , Z ) = 0 .
Moreover, the constraint q i ( t ) M S ( t ) in (4) guarantees that the asset stock S cannot be negative.
In order to characterize an equilibrium strategy q * ( S , Z ) that is defined for the whole state space ( S , Z ) R + 2 which converges to a stable steady state with lim t + ( q * ( S ( t ) , Z ( t ) ) > 0 ) , we look for the parameter constellations under which the following two conditions are satisfied:
  • q * ( 0 , Z ) = 0 , Z 0 , which is to ensure that asset exhaustion never occurs and there exist steady state(s) with positive asset level.
  • V ( S , Z ) / S = 0 , S S y , which is to ensure the continuity of V ( S , Z ) on S = S y where the asset growth function f ( S y ) is not continuously differentiable.
More specifically,
  • if q * ( 0 , Z ) = 0 , Z 0 is not satisfied, then Z 0 | q * ( 0 , Z ) > 0 and then there may exist an initial state ( S ( 0 ) , Z ( 0 ) ) such that lim t + ( S ( t ) , Z ( t ) ) = ( 0 , 0 ) and lim t + ( q * ( S ( t ) , Z ( t ) ) = 0 ; then, q * ( S , Z ) would not be a feedback–Nash equilibrium that includes stable steady state(s) with positive asset levels.
  • if ( V ( S , Z ) / S = 0 S S y ) is not satisfied, then the discontinuity point f ( S y ) given in (2) is included in the case q * ( S , Z ) > 0 and V ( S , Z ) / S > 0 . For this reason, we cannot find a function that is continuously differentiable in S, and q * ( S , Z ) does not satisfy the HJB equation for S S y ; thus, in that case, V ( S , Z ) together with q * ( S , Z ) do not fulfill any currently known sufficient condition.
In the following subsections, we use the methods developed in the literature to study the cases given in Definition 1. We focus on linear strategies, obtain the function V ( S , Z ) , and analyze a number of closed-form formulas to identify the restrictions on the model parameters under which the conditions discussed above are satisfied, which allows us to characterize the equilibrium.

Appendix A.2. Case with Positive Exploitation ( Q * ( S , Z ) > 0 )

Since the model is linear-quadratic, we make the informed guess that, within an interior solution ( q * ( S , Z ) > 0 ), the value function is a polynomial of degree 2 in S and Z. We consider the function W ( S , Z ) given in Equation (23).
The maximized HJB equation is obtained by replacing q * ( S , Z ) in (7) into (5). Using W ( S , Z ) results in an equation that is a polynomial of degree 2 in S and Z. We then apply the method of undetermined coefficients (see [41]) by identification, and after simplifications, the system of equations in ( A , B , C , D , E , F ) is written as follows:
A = ( a + α E C ) a + n 2 ( α E C ) b ( n + 1 ) 2 r , B = 2 n 2 ( α F B ) 2 b ( n + 1 ) 2 ( r 2 δ ) ,
C = ( α F B ) a ( n 2 + 1 ) + 2 n 2 ( α E C ) b ( n + 1 ) 2 ( r δ ) , D = 2 n 2 ( α D F ) 2 b ( n + 1 ) 2 ϕ b ( n + 1 ) 2 ( 2 k + r ) ,
E = ( α D F ) a ( n 2 + 1 ) + 2 n 2 ( α E C ) b ( n + 1 ) 2 ( k + r ) , F = 2 n 2 ( α D F ) ( α F B ) b ( n + 1 ) 2 ( k + r δ ) .
Introduce the following changes of variables:
c 0 = α E C , c S = α F B , c Z = α D F .
Replacing (A4) into the equations for C and E, the term c 0 is written as a function of c S and c Z , and is given in (14). Furthermore, using the equations for B , D , and F in (A1)–(A3) with (A4) enables us to write the system of equations for c S and c Z as follows:
c S = α 2 n 2 c Z c S b ( n + 1 ) 2 ( k + r δ ) 2 n 2 c S 2 b ( n + 1 ) 2 ( r 2 δ ) ,
c Z = α 2 n 2 c Z 2 b ( n + 1 ) 2 ϕ b ( n + 1 ) 2 ( 2 k + r ) 2 n 2 c Z c S b ( n + 1 ) 2 ( k + r δ ) .
Hence, we reduced the six-dimensional equation system in (A1)–(A3) into a system of two equations in c S and c Z , which contain polynomials of degree 2. Equation (A5) has two solutions for c S , i.e., c S = 1 2 ( 2 δ r ) b ( n + 1 ) 2 n 2 + 2 α c Z δ k r , c S = 0 . Inserting these values into (A6) yields four solutions to the system in (A5) and (A6):
  • Solution A is written in Equations (12) and (13).
  • Solution B is written in Equations (15) and (16).
  • Solution A and Solution B , which are denoted by ( c S , c Z ) and ( c ¯ S , c ¯ Z ), are written exactly as in (12), (13), (15), and (16) by inverting the sign of λ , i.e.,
    λ A = λ B = b ( n + 1 ) ( 2 k + r ) 2 + 8 n 2 α 2 b ϕ .
The solutions given above consist only of the exogenously given model parameters. Therefore, all solutions for W ( S , Z ) can be obtained by replacing (A4) in (A1)–(A3) and by inserting the solutions given in (12)–(16).
In solutions A and A , W ( S , Z ) / S 0 δ r / 2 ; thus, they are candidates for the case q * > 0 and V ( S , Z ) / S > 0 . In solutions B and B , we have W ( S , Z ) / S = 0 , and they are candidates for the case q * > 0 and V ( S , Z ) / S = 0 . Therefore, we consider solutions A and A for the function W ( S , Z ) given in (23), and for solutions B and B , where c ¯ S = c ¯ S = 0 , we define the function denoted by V ¯ ( Z ) , which is a polynomial of degree 2 in Z, written in (27). It can be verified that both functions W ( S , Z ) and V ¯ ( Z ) satisfy the HJB equation in (5) for all choices of solutions.
Remark A1.
In the limit case where ϕ 0 , the problem reduces to a game with one state variable (S). Solution A reduces to c Z = 0 , c S = ( 2 δ r ) b ( 1 + n ) 2 2 n 2 , and c 0 = a 1 + n 2 2 n 2 δ ( r 2 δ ) , which results in D = E = F = 0 , and B = c S = ( r 2 δ ) b ( 1 + n ) 2 2 n 2 , C = c 0 = a 1 + n 2 2 n 2 δ ( 2 δ r ) , A = a 2 r 1 + n 2 2 δ r 1 + n 2 2 n 2 δ 4 b r n 2 ( 1 + n ) 2 δ 2 . Solution B vanishes with c ¯ S = c ¯ Z = c ¯ 0 = 0 , which leads to the equilibrium outcome of the static Cournot oligopoly, i.e., V ¯ ( Z ) = a 2 b ( 1 + n ) 2 r and q * = a b ( 1 + n ) . These outcomes are identical to their corresponding terms in the solution provided in [11].
The equilibrium strategy is written by using (7) and (A4) with V ( S , Z ) = W ( S , Z ) :
q * ( S , Z ) = ( a + c 0 + c S S + c Z Z ) / b ( n + 1 ) ,
which is linear in S and Z. Note that, by using (A4), the RHS of (6) can be written as M C ( S , Z ) = ( c 0 + c S S + c Z Z ) ; hence, these terms correspond to the coefficients of the marginal cost function.
In the following sections, we use the functions W ( S , Z ) and V ¯ ( Z ) to study the two cases where V / S > 0 and V / S 0 .

Appendix A.2.1. Case with Q * ( S , Z ) > 0 and V ( S , Z ) / S > 0

We first obtain the analytical formulation of R S . For q * ( S , Z ) > 0 , from (A7), we obtain a > c 0 c S S c Z Z , and for W ( S , Z ) / S > 0 , we use (A1) to (A4). Since c S > 0 in all cases for δ > r / 2 , writing both inequalities in S enables us to obtain the following region and its boundary cases:
(i) 
q * ( S , Z ) > 0 and V ( S , Z ) / S > 0 for all ( S , Z ) R S where R S is given in (19); the boundary line associated with the first inequality, which is denoted by Z = Z 0 S ( S ) , is written in (37); and the sign of its slope is given by s i g n ( d Z 0 S ( S ) d S ) = s i g n ( c S c Z ) .
  • if r / 2 < δ < r + k , then d Z 0 S ( S ) d S > 0 ;
  • if δ = r + k , Equation (37) reduces to S = a + c 0 c S ; and
  • if δ > r + k , then d Z 0 S ( S ) d S < 0 .
(ii) 
The set of points such that q * ( S , Z ) = 0 is defined as follows:
q * ( S , Z ) = 0 if ( S , Z ) | S a + c 0 c S c Z c S Z ,
which is obtained by using a W ( S , Z ) S α W ( S , Z ) Z .
(iii) 
The set of points such that W ( S , Z ) / S = 0 is given by the linear function in S written in Equation (38), and the sign of its slope is given by s i g n ( d Z S A ( S ) d S ) = s i g n ( ( δ r k ) ( 2 δ r ) c S c Z ) . Since s i g n ( c Z ) = s i g n ( δ r k ) and c S > 0 , we have d Z S A ( S ) d S < 0 in all cases where δ > r / 2 .
We now calculate the points at which the linear functions obtained for the boundary cases intersect with the Z = 0 axis and each other, i.e., Z 0 S ( S ) = 0 , Z S A ( S ) = 0 , and Z = Z 0 S ( S ) = Z S A ( S ) . These formulas are first written in terms of ( c 0 , c S , c Z ) , and then, after inserting the solutions in (12)–(16), they are written in closed form, using λ in (18) to study their signs.
(i) 
( S 1 , 0 ) : Z 0 S ( S 1 ) = 0 and q * ( S , 0 ) = 0 S S 1 where
S 1 = a + c 0 c S , = 2 a ( k + δ ) 2 b r δ + 1 + n 2 b δ ( 2 k + r ) 2 r ( k + r ) ( n 1 ) δ λ δ ( 2 δ r ) ( λ + b ( 1 + n ) ( 2 δ r ) ) ( λ + b ( 1 + n ) r ) .
(ii) 
( S 2 , 0 ) : Z S A ( S 2 ) = 0 where
S 2 = ( 2 δ r ) a 1 + n 2 + 2 n 2 c 0 2 n 2 ( δ r ) c S , = 4 a b 1 + n 2 ( k + r ) ( k + δ ) δ ( λ + b ( 1 + n ) r ) ( λ + b ( 1 + n ) ( 2 δ r ) ) > 0 if δ > r / 2 .
(iii) 
( S 3 , Z 3 ) : Z 0 S ( S 3 ) = Z S A ( S 3 ) = Z 3 is written in Equations (35) and (36) and note that Z 3 > 0 .
By using (A9) and (A10), the difference S 2 S 1 is given by
S 2 S 1 = 2 a b 2 k 1 + n 2 + r 1 + 3 n 2 + ( n 1 ) λ ( k + δ ) ( 2 δ r ) ( λ + b ( 1 + n ) r ) ( λ + b ( 1 + n ) ( 2 δ r ) ) > 0 if δ > r / 2 .
Thus, for δ > r / 2 , Z = Z S A ( S ) intersects with the Z = 0 axis at S 2 > 0 and S 2 > S 1 . By using (A9), we obtain the conditions under which S 1 > 0 :
δ > r 1 + n 2 2 ,
ϕ < ϕ 1 .
where ϕ 1 is given in (8).
For δ r + k , S 3 > 0 if (A12) and (A13) are satisfied, and for δ > r + k , S 3 > 0 if
δ < k n 2 + 1 n 2 1 ,
ϕ < ϕ 2 ,
where ϕ 2 is given in (9), and note that this condition is more strict than the one in (8) (i.e., ϕ 2 < ϕ 1 ).
Lemma A1.
Suppose that Assumptions 1 and 2 are satisfied. For all ( S , Z ) R S , the function V ( S , Z ) = W ( S , Z ) satisfies the HJB Equation (5) and the strategies q i = q * ( S , Z ) , i given in (11) satisfy (7) with q * ( S , Z ) > 0 and V ( S , Z ) / S > 0 .
Proof. 
The sign analysis conducted in (A9) to (A15) shows that, if the conditions given in Assumptions 1 and 2 are satisfied, then we have 0 < S 1 < S 2 < S y and S 3 > 0 , Z 3 > 0 . We study the slopes of the two boundary lines of R S for each case. The slope of Z = Z 0 S ( S ) depends on s i g n ( c S c Z ) , and the slope d Z S A ( S ) / d S < 0 if δ > r / 2 ; then, the three cases are written as follows:
Case 1: 
δ < r + k : In this case d Z 0 S ( S ) / d S > 0 and d Z S A ( S ) / d S < 0 . If S 1 > 0 ((A12) and (A13)), then q * ( S , Z ) = 0 ( S , Z ) | S S 1 . In addition, if S 2 < S y , then ( S , Z ) R S ( S , Z ) | S S 2 (see Figure 1 and Figure 2a). The function W ( S , Z ) satisfies the HJB Equation (5) and condition (7) with q * ( S , Z ) > 0 ( S , Z ) R S .
Case 2: 
δ = r + k : In this case c Z = 0 and the boundary case q * ( S , Z ) = 0 reduces to ( S 1 , Z ) Z Z 3 . We have the same result as the previous case. If S 1 > 0 and S 2 < S y , then the function W ( S , Z ) satisfies the HJB Equation (5) and condition (7) with q * ( S , Z ) > 0 ( S , Z ) R S .
Case 3: 
δ > r + k : In this case d Z 0 S ( S ) / d S < 0 and d Z S A ( S ) / d S < 0 , and hence, both boundary lines decrease in S. The difference in their slopes is given by d Z 0 S ( S ) / d S d Z S A ( S ) / d S = c S c Z ( δ + k ) ( 2 δ r ) . Since for δ > r + k we have c S > 0 and c Z > 0 , the slopes compare as follows:
d Z 0 S ( S ) d S < d Z S A ( S ) d S < 0 if δ > r / 2 ,
thus, Z 0 S ( S ) is steeper than Z S A ( S ) , and Z 3 > 0 , which can also be seen in (36). If S 3 > 0 ((A14) and (A15)) and S 2 < S y are satisfied, then q * ( S , Z ) = 0 S S 3 and Z 0 (see Figure 2b), and the function W ( S , Z ) satisfies the HJB Equation (5) and condition (7) with q * ( S , Z ) > 0 ( S , Z ) R S .
 □
Note that, when using (19), we eliminate the possibility of choosing solution A for δ < r / 2 (where c S < 0 ) and solution A for r / 2 < δ < r + k (where c S < 0 and c Z > 0 ). In these cases, the inequalities in (19) and (A8) change their directions. In step (A16), we eliminate the possibility of choosing solution A for δ < r / 2 (where c S > 0 and c Z > 0 ), as it leads to d Z S A ( S ) d S < d Z 0 S ( S ) d S < 0 , which does not allow to characterize a feedback–Nash equilibrium.

Appendix A.2.2. Case with Q * ( S , Z ) > 0 and V ( S , Z ) / S > 0

We now consider the function V ( S , Z ) = V ¯ ( Z ) given in (27), and by using a > α V ¯ ( Z ) Z , which is given by a > α ( D ¯ Z + E ¯ ) with (19), (28), and (38), we have
(i) 
q * ( S , Z ) > 0 and V ( S , Z ) / S = 0 for all ( S , Z ) R A , where R A is given in (20).
(ii) 
The set of points such that q * ( S , Z ) = 0 and V ( S , Z ) / S = 0 is written in Equation (39), which is obtained by using a = α V ¯ ( Z ) Z and (28). The explicit form of Z 0 A equals the constant Z 3 given in (36).
Lemma A2.
Suppose that Assumptions 1 and 2 are satisfied. For ( S , Z ) R S R A where V ( S , Z ) = W ( S , Z ) if ( S , Z ) R S , V ¯ ( Z ) if ( S , Z ) R A , the strategies q i = q * i given in (11) satisfy the HJB Equation (5) and condition (7) with q * ( S , Z ) > 0 and the function V ( S , Z ) is continuously differentiable ( S , Z ) R S R A .
Proof. 
The functions W ( S , Z ) and V ¯ ( Z ) are polynomials of degree 2 and thus continuously differentiable in S and Z ( S , Z ) R S and ( S , Z ) R A , respectively. For continuity of V ( S , Z ) on the boundary, as ( S , Z ) ( s ^ , z ^ ) | z ^ = Z S A ( s ^ ) :
lim ( S , Z ) ( s ^ , z ^ ) W ( S , Z ) = A C 2 2 B + D 2 F 2 2 B z ^ 2 + E C F B z ^ ,
lim ( S , Z ) ( s ^ , z ^ ) V ¯ ( Z ) = A ¯ + D ¯ 2 z ^ 2 + E ¯ z ^ .
where (A17) is obtained by using S = C / B Z F / B (where W ( S , Z ) / S = 0 ). By using (24) to (28), it can be verified that
A ¯ = A C 2 2 B ; D ¯ 2 = D 2 F 2 2 B ; E ¯ = E C F B ,
Therefore, lim ( S , Z ) ( s ^ , z ^ ) W ( S , Z ) = lim ( S , Z ) ( s ^ , z ^ ) V ¯ ( Z ) for z ^ = Z S A ( s ^ ) , and thus, V ( S , Z ) is continuous on ( S , Z ) | Z = Z S A ( S ) .
Note that the four solutions to the system in (A5) and (A6) are paired such that (A19) is true for solutions (A and B) and ( A and B ) while (A19) is not true for ( A and B) and (A and B ); therefore, in order to have a function that is continuously differentiable on the boundary where lim ( S , Z ) ( s ^ , z ^ ) W ( S , Z ) / S = 0 , either one of the pair of solutions must be selected.
By the definition of Z = Z S A ( S ) given in (38), lim ( S , Z ) ( s ^ , z ^ ) W ( S , Z ) S = 0 , and V ¯ ( Z ) S = 0 ; hence V ( S , Z ) / S is continuous on ( S , Z ) | Z = Z S A ( S ) . Further, lim ( S , Z ) ( s ^ , z ^ ) W ( S , Z ) Z = lim ( S , Z ) ( s ^ , z ^ ) V ¯ ( Z ) Z since ( D F 2 / B ) = D ¯ by (A19), and thus, V ( S , Z ) / Z is continuous on Z = Z S A ( S ) and q * ( S , Z ) is continuous on Z = Z S A ( S ) , which can also be shown by using the solutions given in (12)–(16).
By using Lemma A1, d Z S A ( S ) / d S < 0 in all cases with S 2 > 0 . If S 2 < S y , then ( S y , Z ) R A Z 0 , Z 0 A . For ( S , Z ) R A , we have V ( S , Z ) = V ¯ ( Z ) , which does not depend on S; hence, the point S = S y where f ( S ) is not continuous does not affect the continuity of V ( S , Z ) . Therefore, the function V ( S , Z ) is continuously differentiable in S and Z and satisfies the HJB Equation (5) and condition (7) with q * ( S , Z ) > 0 ( S , Z ) R S R A . □
To constitute a Nash equilibrium, the strategies are required to be defined for the whole state space; hence, we study the case q * ( S , Z ) = 0 in the next subsection.

Appendix A.3. Case with No Exploitation ( Q * ( S , Z ) = 0 )

By combining the results in (39) and (A8), we obtain the set of points such that q * ( S , Z ) = 0 ( R 0 ) given in (21). The value function for this region, denoted by V 0 ( S , Z ) , must satisfy the HJB equation in (5) with q i = 0 for i 1 , . . , n , i.e.,
r V 0 ( S , Z ) = ϕ 2 Z 2 + δ S V 0 ( S , Z ) S k Z V 0 ( S , Z ) Z .
The above equation is a first-order linear partial differential equation (PDE), and V 0 ( S , Z ) must be continuously differentiable in R 0 and on its boundary cases. In order to obtain this function, we begin by deriving the boundary case associated to a given point in R 0 .
Under Assumptions 1 and 2, for ( S ( 0 ) , Z ( 0 ) ) R 0 , t ^ 0 such that Z ( t ^ ) = Z 0 ( S ( t ^ ) ) , where Z 0 ( S ) denotes either the boundary between R S and R 0 (i.e., Z 0 S ( S ) ) or the boundary between R A and R 0 (i.e., Z 0 A ). This point is denoted by ( s ^ , z ^ ) = ( S ( t ^ ) , Z ( t ^ ) ) and found by solving the following system of equations:
z ^ = Z ( t ^ ) = Z e k t ^ , s ^ = S ( t ^ ) = S e δ t ^ ,
such that Z ( t ^ ) = Z 0 ( S ( t ^ ) ) and t ^ 0 ,
where (A21) is found by using q i = 0 i , which implies S ˙ ( t ) = δ S ( t ) and Z ˙ ( t ) = k Z ( t ) . By using (21), there are three cases in which Equation (A22) holds true:
(i) 
z ^ = Z 0 S ( s ^ ) ; then, the point ( s ^ , z ^ ) is given by the system of equations written in Equation (31).
(ii) 
z ^ = Z 0 A with S S 3 ; then, the point ( s ^ , z ^ ) is given by
S ^ ( S , Z ) = S Z Z 0 A δ k S 3 , Z ^ ( S , Z ) = Z 0 A .
(iii) 
Z ( 0 ) = 0 ; then, the point ( s ^ , z ^ ) is given by
S ^ ( S , 0 ) = a + c 0 c S , Z ^ ( S , 0 ) = 0 for S a + c 0 c S .
The system in (31) does not have an analytical solution, and the pair of equations are written as implicit functions denoted by S ^ ( S , Z ) and Z ^ ( S , Z ) . For a fixed point on the boundary z 0 = Z 0 S ( s 0 ) , lim ( S , Z ) ( s 0 , z 0 ) ( S ^ ( S , Z ) , Z ^ ( S , Z ) ) = ( s 0 , z 0 ) , which will be used later. The functions S ^ ( S , Z ) and Z ^ ( S , Z ) have to be computed numerically except some special cases given below:
Remark A2.
There are special cases where (31) has an analytical solution:
i f δ = k t h e n S ^ ( S , Z ) = a + c 0 2 4 S Z c Z c S a + c 0 / ( 2 c S ) , Z ^ ( S , Z ) = a + c 0 2 4 S Z c Z c S + a + c 0 / ( 2 c Z ) ,
i f δ = r + k t h e n S ^ ( S , Z ) = a + c 0 c S , Z ^ ( S , Z ) = Z S c S a + c 0 k / δ ,
Since Z ( t ) decreases (by (A21)), for ( S ( 0 ) , Z ( 0 ) ) R 0 where Z ( 0 ) < Z 0 A , we have t ^ 0 such that Z ( t ^ ) = Z ^ ( S ( 0 ) , Z ( 0 ) ) = Z 0 S ( S ^ ( S ( 0 ) , Z ( 0 ) ) ) . However, for Z ( 0 ) Z 0 A , depending on the position of ( S ( 0 ) , Z ( 0 ) ) , Z ( t ^ ) may lie on Z = Z 0 S ( S ) or Z = Z 0 A . In order to precisely determine the boundary associated to every point in R 0 , we first obtain the curve associated with the intersection of the boundary cases of R S (denoted by ( S , Z ) = ( S 3 , Z 3 ) given in Equations (35) and (36) and recall that Z 3 = Z 0 A ). The set of points ( S ( 0 ) , Z ( 0 ) ) R 0 such that t ^ 0 where ( S ( t ^ ) , Z ( t ^ ) ) = ( S 3 , Z 3 ) is given by the curve denoted by Z = Ψ ( S ) that is written in Equation (34), which is found by solving (A21) with ( s ^ , z ^ ) = ( S 3 , Z 3 ) , and note that d Ψ ( S ) / d S = k δ S 3 Z 3 S 2 S 3 S k δ 1 < 0 . By using (34), we define the partitions of region R 0 , denoted by R 0 W and R 0 V ¯ , given in Equations (32) and (33).
Consider a point ( S ( 0 ) , Z ( 0 ) ) = ( s 0 , z 0 ) such that z 0 = Ψ ( s 0 ) , where s 0 < S 3 and z 0 > Z 3 . It satisfies ( S ^ ( s 0 , z 0 ) , Z ^ ( s 0 , z 0 ) ) = ( S 3 , Z 3 ) . Denote by t = t 0 such that ( S ( t 0 ) , Z ( t 0 ) ) = ( S 3 , Z 3 ) . By using (A21), we obtain t 0 = 1 k log z 0 Z 3 = 1 δ log S 3 s 0 > 0 . Then:
  • for ( S ( 0 ) , Z ( 0 ) ) = ( s , z 0 ) R 0 W where s < s 0 , we have ( S ( t 0 ) , Z ( t 0 ) ) = ( S ( t 0 ) , Z 3 ) with S ( t 0 ) < S 3 ; hence ( S ( t 0 ) , Z 3 ) R 0 , ( S ( t 0 ) , Z 3 ) R S , and ( S ( t 0 ) , Z 3 ) R A . Then, z ^ = Z 0 S ( s ^ ) , and the point ( s ^ , z ^ ) is given by (31).
  • for ( S ( 0 ) , Z ( 0 ) ) = ( s , z 0 ) R 0 V ¯ where s > s 0 , we have ( S ( t 0 ) , Z ( t 0 ) ) = ( S ( t 0 ) , Z 3 ) with S ( t 0 ) > S 3 ; hence the point ( s ^ , z ^ ) is given by (A23).
To summarize, if ( S , Z ) R 0 W , then Z ^ ( S , Z ) = Z 0 S ( S ^ ( S , Z ) ) , and if ( S , Z ) R 0 V ¯ , then Z ^ ( S , Z ) = Z 0 A .
Since we obtained the boundary case associated with every point in R 0 , by applying the method of characteristics (see [43]), we obtain the solution to (A20) given in the following lemma:
Lemma A3.
The value function in R 0 is written in Equation (29). The function V 0 ( S , Z ) is continuously differentiable in S and Z ( S , Z ) R 0 and on the boundary cases of R 0 .
Proof. 
We first consider Equation (A20) such that V ( S , z ^ ) = Θ ( S , z ^ ) with a constant z ^ , where Θ ( S , Z ) is an analytical function of S and Z, which is a Cauchy problem. Suppose that a function u, which is parameterized by τ , is a solution to Equation (A20) such that u ( τ ) = u ( S ( τ ) , Z ( τ ) ) = V ( S ( τ ) , Z ( τ ) ) . Introduce the system of ordinary differential equations (ODEs), which are called the characteristic system for the PDE in (A20):
d u d τ = r u + ϕ 2 Z 2 ; d S d τ = δ S ; d Z d τ = k Z such that ,
u ( 0 ) = Θ ( s ^ , z ^ ) ; S ( 0 ) = s ^ ; Z ( 0 ) = z ^ .
Solving the last two equations in (A27) with their initial conditions results in
S ( τ ) = s ^ e δ τ , Z ( τ ) = z ^ e k τ .
By solving for e τ and s ^ , we obtain τ ( S , Z ) and S ^ ( S , Z ) that satisfy (A29):
e τ = Z ( τ ) z ^ 1 k ; s ^ = S ( τ ) e δ τ = S ( τ ) Z ( τ ) z ^ δ k .
We insert Z ( τ ) given in (A29) into the first ODE in (A27) and obtain the following equation:
d u d τ = r u + ϕ 2 z ^ 2 e 2 k τ .
Solving (A31) with its initial condition in (A28) yields
u ( τ ) = e r τ Θ ( s ^ , z ^ ) + e r τ ϕ z ^ 2 2 ( 2 k + r ) 1 e ( 2 k + r ) τ .
Replacing (A30) into (A32) yields
u ( S ( τ ) , Z ( τ ) ) = Z ( τ ) z ^ r k Θ S ( τ ) Z ( τ ) z ^ δ k , z ^ + ϕ Z ( τ ) 2 2 ( 2 k + r ) Z ( τ ) z ^ 2 k + r k 1 ;
Hence, we obtained the solution to (A20), which is verified for any analytical function Θ ( S , Z ) . Further, in order to satisfy the boundary conditions in (A21) and (A22) ( S , Z ) R 0 , we replace z ^ = Z ^ ( S , Z ) and obtain (29). We then write (A20) by using (29), which simplifies to the following equation:
2 k Z ^ ( . ) Z Z ^ ( . ) r k k Z Z ^ ( . ) Z δ S Z ^ ( . ) S r Θ ( S ^ ( . ) , Z ^ ( . ) ) δ S ^ ( . ) Θ ( S ^ ( . ) , Z ^ ( . ) ) S k Z ^ ( . ) Θ ( S ^ ( . ) , Z ^ ( . ) ) Z ϕ 2 Z ^ ( . ) 2 = 0 .
We showed that, for ( S , Z ) R 0 W where Θ ( S , Z ) = W ( S , Z ) , we have Z ^ ( S , Z ) = Z 0 S ( S ^ ( S , Z ) ) . For z ^ = Z 0 S ( s ^ ) , W ( s ^ , z ^ ) satisfies the HJB Equation (5) with q * ( s ^ , z ^ ) = 0 , i.e.,
r W ( s ^ , z ^ ) = δ s ^ W ( s ^ , z ^ ) S k z ^ W ( s ^ , z ^ ) Z ϕ 2 z ^ 2 ,
and for ( S , Z ) R 0 V ¯ where Θ ( S , Z ) = V ¯ ( Z ) , we have Z ^ ( S , Z ) = Z 0 A . When z ^ = Z 0 A , V ¯ ( z ^ ) satisfies (5) with q * ( s ^ , z ^ ) = 0 ,
r V ¯ ( z ^ ) = k z ^ V ¯ ( z ^ ) Z ϕ 2 z ^ 2 .
For both cases, the third term in (A34) is zero and the equation holds true; hence, the function V 0 ( S , Z ) given in (29) satisfies the HJB equation in (5) with q i * = 0 i and ( S , Z ) R 0 .
For the partial derivatives of V 0 ( S , Z ) with respect to S and Z, by collecting the terms multiplied with Z ^ ( S , Z ) / S and Z ^ ( S , Z ) / Z , using (A35) and (A36) allows us to simplify to the following:
V 0 ( S , Z ) S = Z Z ^ ( . ) δ r k Θ ( S ^ ( . ) , Z ^ ( . ) ) S ,
V 0 ( S , Z ) Z = Z Z ^ ( . ) k + r k Θ ( S ^ ( . ) , Z ^ ( . ) ) Z ϕ Z 2 k + r 1 Z Z ^ ( . ) 2 k + r k .
The function V 0 ( S , Z ) is continuously differentiable in S and Z ( S , Z ) R 0 W , ( S , Z ) R 0 V ¯ , and ( S , Z ) | Z = Ψ ( S ) , since on this curve, Z ^ ( S , Z ) = Z 3 and W ( S 3 , Z 3 ) = V ¯ ( Z 3 ) .
  • Continuity of V ( S , Z ) on the boundary cases of R 0 :
There are two cases: ( a ) ( S , Z ) = ( S , Z 0 S ( S ) ) and ( b ) ( S , Z ) = ( S , Z 0 A ) with S S 3 .
(a) 
( S , Z ) = ( S , Z 0 S ( S ) )
On this boundary case, z ^ = Z 0 S ( s ^ ) and we have lim ( S , Z ) ( s ^ , z ^ ) ( S ^ ( S , Z ) , Z ^ ( S , Z ) ) = ( s ^ , z ^ ) ,
lim ( S , Z ) ( s ^ , z ^ ) V 0 ( S , Z ) = W ( s ^ , z ^ ) ,
Thus, V ( S , Z ) is continuous on Z = Z 0 S ( S ) . For continuity of its partial derivatives, by using (A37) and (A38), we obtain
lim ( S , Z ) ( s ^ , z ^ ) V 0 ( S , Z ) S = W ( s ^ , z ^ ) S , lim ( S , Z ) ( s ^ , z ^ ) V 0 ( S , Z ) Z = W ( s ^ , z ^ ) Z ;
Hence, V ( S , Z ) is continuously differentiable in S and Z on Z = Z 0 S ( S ) .
(b) 
( S , Z ) ( S , Z 0 A ) with S S 3 :
On this boundary case, we have lim ( S , Z ) ( s ^ , z ^ ) ( S ^ ( S , Z ) , Z ^ ( S , Z ) ) = ( s ^ , Z 0 A ) with s ^ S 3 . For continuity, we have
lim ( S , Z ) ( s ^ , z ^ ) V 0 ( S , Z ) = V ¯ ( z ^ ) ,
Thus, V ( S , Z ) is continuous on Z = Z 0 A . For continuity of its partial derivatives, by using (A38), we have
lim ( S , Z ) ( s ^ , z ^ ) V 0 ( S , Z ) S = V ¯ ( z ^ ) S = 0 , lim ( S , Z ) ( s ^ , z ^ ) V 0 ( S , Z ) Z = V ¯ ( z ^ ) Z ,
Thus, V ( S , Z ) is continuously differentiable on Z 0 A with S S 3 .
Therefore, the function V 0 ( S , Z ) is continuously differentiable in S and Z in R 0 and both its boundary cases.
 □
Consequently, by using Lemmas A1–A3, under Assumptions 1 and 2, the function V ( S , Z ) given in (22) is continuously differentiable in S and Z and satisfies the HJB equation in (5), and the strategy profile q i = q * ( S , Z ) i given in (11) satisfies condition (7) for all ( S , Z ) R + 2 . The terminal condition is satisfied because Z ˙ ( t ) k Z ( t ) + α M n S max and the set of all S is bounded, while the set of all Z 0 can be divided into two parts: a bounded interval and a set in which every admissible trajectory Z decreases so every admissible trajectory is bounded. Thus, by continuity of V ( S , Z ) , for any given initial condition ( S ( 0 ) , Z ( 0 ) ) , the limit of V ( S ( t ) , Z ( t ) ) e r t exists and equals 0. Therefore, q i = q * i constitutes a symmetric feedback–Nash equilibrium, which proves Theorem 1.

Appendix B. Proof of Theorem 2

In the following, we derive the set of points such that S ˙ ( t ) = 0 and Z ˙ ( t ) = 0 , respectively, and then obtain the steady states and study their stability.
We begin with ( S , Z ) R S . The locus S ˙ ( t ) = 0 is given by δ S = n ( a + c 0 + c S S + c Z Z ) / b ( n + 1 ) . By solving for Z, we obtain the following linear function in S:
Z = S 0 S ( S ) = S b δ ( n + 1 ) n c Z c S c Z a + c 0 c Z .
The locus Z ˙ ( t ) = 0 is given by k Z = α n ( a + c 0 + c S S + c Z Z ) / b ( n + 1 ) and defined by the following linear function in S:
Z = Z 0 S ( S ) = α n b k ( 1 + n ) α n c Z ( a + c 0 + c S S ) .
The point at which the two loci intersect, i.e., ( S , Z ) R S such that S ˙ ( t ) = 0 and Z ˙ ( t ) = 0 , is denoted by ξ S = ( s S , z S ) | z S = S 0 S ( s S ) = Z 0 S ( s S ) and written in Equations in (40) and (41).
We analyze the stability of the steady states by studying the signs of the determinants and traces of the Jacobian matrix associated with the differential equation system in the neighborhood of the critical points ([42]); see [44] for a similar analysis. The Jacobian matrix associated with the point ξ S is given by
J S = δ n c S b ( n + 1 ) n c Z b ( n + 1 ) n α c S b ( n + 1 ) n α c Z b ( n + 1 ) k .
Its determinant and trace are written as follows:
d e t ( J S ) = n ( k c S + δ α c Z ) b ( 1 + n ) k δ , = ( δ r ) ( λ b ( n + 1 ) r ) 2 b δ k ( n 1 ) 4 b n ,
t r ( J S ) = δ k n c S α c Z b ( 1 + n ) , = λ + b ( 4 δ + 2 k ( n 1 ) 3 ( 1 + n ) r ) 4 b n .
It can be verified that d e t ( J s ) > 0 and t r ( J S ) < 0 if δ > r ( 1 + n ) / 2 . This is a weaker condition than the first part of Assumption 1; thus, ξ S is a stable steady state if Assumption 1 is satisfied.
We now turn to ( S , Z ) R A . The locus S ˙ ( t ) = 0 is given by the following piecewise-linear function in S:
Z = S 0 A 1 ( S ) = S b δ ( n + 1 ) n c ¯ Z a + c ¯ 0 c ¯ Z , S S y ,
Z = S 0 A 2 ( S ) = ( S m a x S ) b δ ( n + 1 ) n c ¯ Z a + c ¯ 0 c ¯ Z , S S y .
Since c ¯ Z < 0 , we have d S 0 A 1 ( S ) / d S < 0 and d S 0 A 2 ( S ) / d S > 0 . Thus, for ( S , Z ) R A , the minimum level of Z such that S ˙ ( t ) = 0 occurs at is given by Z = S 0 A 1 ( S y ) = S 0 A 2 ( S y ) . The locus Z ˙ ( t ) = 0 is given by k Z = α n ( a + c ¯ 0 + c ¯ Z Z ) / b ( n + 1 ) , which results in the constant denoted by Z 0 A that is given in (44) (i.e., Z 0 A = z A 1 = z A 2 ). Then, in order to have S ˙ ( t ) = 0 and Z ˙ ( t ) = 0 exist, S 0 A 1 ( S y ) = S 0 A 2 ( S y ) < Z 0 A must be satisfied, which is written in (45). When condition (45) is true, the intersection points denoted by ξ A 1 = ( s A 1 , z A 1 ) | z A 1 = S 0 A 1 ( s A 1 ) = Z 0 A and ξ A 2 = ( s A 2 , z A 2 ) | z A 2 = S 0 A 2 ( s A 2 ) = Z 0 A are given in Equations (42)–(44).
To check stability, we obtain the Jacobian matrices that are associated with these critical points, written as follows:
J A 1 = δ n c ¯ Z b ( n + 1 ) 0 n α c ¯ Z b ( n + 1 ) k ; J A 2 = δ n c ¯ Z b ( n + 1 ) 0 n α c ¯ Z b ( n + 1 ) k .
For the first matrix, we have
d e t ( J A 1 ) = α δ n c ¯ Z b ( n + 1 ) δ k , = δ ( 2 b k ( n 1 ) + λ b ( n + 1 ) r ) 4 b n < 0 ,
t r ( J A 1 ) = α n c ¯ Z b ( n + 1 ) + δ k , = δ k ( n 1 ) 2 n λ b ( n + 1 ) r 4 b n .
Since d e t ( J A 1 ) < 0 , ξ A 1 is not a stable steady state. For the second one,
d e t ( J A 2 ) = δ k n α δ c ¯ Z b ( n + 1 ) , = δ ( 2 b k ( n 1 ) + λ b ( n + 1 ) r ) 4 b n > 0 ,
t r ( J A 2 ) = n α c ¯ Z b ( n + 1 ) δ k , = δ + k ( n 1 ) 2 n + λ b ( n + 1 ) r 4 b n < 0 ;
therefore, ξ A 2 is a stable steady state.
We can conclude that, under Assumptions 1 and 2, stable steady states are either unique ξ S or multiple ξ S , ξ A 2 depending on condition (45). In both cases, q * ( t ) converges to a steady state with q * ( S ( t ) , Z ( t ) ) > 0 , ( S ( 0 ) , Z ( 0 ) ) R + 2 .
In the analysis of stability, in step (A46), we eliminate the possibility of using solution A for δ > r , since replacing λ with λ in (A46) leads to d e t ( J S ) < 0 , which means that the point ξ S is not stable. Thus, combining with the previous results in steps (19), (A16), and (A19), the strategy profile in (11) given by solutions A and B is the only solution that satisfies the sufficient conditions for symmetric feedback-Nash equilibrium.

Appendix C. Proof of Proposition 1

The partial derivatives of the coefficients of equilibrium strategies with respect to the parameters n, ϕ , and δ are written as follows:
c S n = ( 2 δ r ) 4 n 3 ( δ + k ) λ + b 2 ( n + 1 ) 2 ( 2 k + r ) 2 λ + 2 b ( n + 1 ) ( 2 δ r ) < 0 if δ > r / 2 ,
c Z n = ( k + r δ ) ( λ b ( n + 1 ) ( 2 k + r ) ) 2 4 α λ n 3 ( δ + k ) s i g n ( c Z n ) = s i g n ( k + r δ ) ,
c ¯ Z n = ( λ b ( n + 1 ) ( 2 k + r ) ) 2 4 α λ n 3 > 0 .
c S ϕ = α 2 b ( n + 1 ) ( 2 δ r ) ( δ + k ) λ > 0 if δ > r / 2 ,
c Z ϕ = α b ( n + 1 ) ( δ k r ) ( δ + k ) λ s i g n ( c Z ϕ ) = s i g n ( δ k r ) ,
c ¯ Z ϕ = α b ( n + 1 ) λ < 0 .
c S δ = ( n + 1 ) ( λ ( 2 k + r ) + b ( n + 1 ) ( 2 δ r ) ( 2 δ + 4 k + r ) ) 4 n 2 ( δ + k ) 2 > 0 if δ > r / 2 ,
c Z δ = ( n + 1 ) ( 2 k + r ) ( λ b ( n + 1 ) ( 2 k + r ) ) 4 α n 2 ( δ + k ) 2 > 0 .

Appendix D. Proof of Proposition 2

For n = 1 , the steady state in R S , i.e., ξ S = ( s S , z S ) given in (40) and (41) simplifies to
s S = 8 a b k ( k + r ) δ ( λ 2 4 b 2 r 2 ) , z S = 8 α a b ( k + r ) λ 2 4 b 2 r 2 .
The point ( s S , z S ) lies on the boundary Z S A ( S ) given in (38), i.e., z S = Z S A ( s S ) . Furthermore, it coincides with z S = z A 1 = Z 0 A (given in (44)) and s S = s A 1 (given in (42)); then, the point ξ S coincides with the unstable point in R A , i.e., for n = 1 , ξ S = ξ A 1 , and thus, it is an unstable steady state; nevertheless, it is sustainable. The analysis on the stability of ξ A 2 remains valid for n = 1 ; therefore, ξ A 2 = ( s A 2 , z A 2 ) given in (43) and (44) is the only stable steady state.

References

  1. Eriksen, M.; Lebreton, L.C.; Carson, H.S.; Thiel, M.; Moore, C.J.; Borerro, J.C.; Reisser, J. Plastic pollution in the world’s oceans: More than 5 trillion plastic pieces weighing over 250,000 tons afloat at sea. PLoS ONE 2014, 9, e111913. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Newman, S.; Watkins, E.; Farmer, A.; Ten Brink, P.; Schweitzer, J.P. The economics of marine litter. In Marine Anthropogenic Litter; Bergmann, M., Gutow, L., Klages, M., Eds.; Springer: Cham, Switzerland, 2015; pp. 367–394. [Google Scholar]
  3. Richardson, K.; Hardesty, B.D.; Wilcox, C. Estimates of fishing gear loss rates at a global scale: A literature review and meta-analysis. Fish Fish. 2019, 20, 1218–1231. [Google Scholar] [CrossRef] [Green Version]
  4. Gilman, E. Status of international monitoring and management of abandoned, lost and discarded fishing gear and ghost fishing. Mar. Policy 2015, 60, 225–239. [Google Scholar] [CrossRef]
  5. Macfadyen, G.; Huntington, T.; Cappell, R. Abandoned, Lost or Otherwise Discarded Fishing Gear; UNEP Regional Seas Reports and Studies No. 185; FAO Fisheries and Aquaculture Technical Paper No. 523; UNEP/FAO: Rome, Italy, 2009. [Google Scholar]
  6. McIlgorm, A.; Campbell, H.F.; Rule, M.J. The economic cost and control of marine debris damage in the Asia-Pacific region. Ocean Coast. Manag. 2011, 54, 643–651. [Google Scholar] [CrossRef]
  7. Scheld, A.M.; Bilkovic, D.M.; Havens, K.J. The dilemma of derelict gear. Sci. Rep. 2016, 6, 19671. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Ryan, R.W.; Holl, D.S.; Herrera, G.E. Ecosystem externalities in fisheries. Mar. Res. Econ. 2014, 29, 39–53. [Google Scholar] [CrossRef]
  9. Clark, C.W. Mathematical Bioeconomics: The Optimal Management of Renewable Resources; Wiley: New York, NY, USA, 1990. [Google Scholar]
  10. Benchekroun, H. Unilateral production restrictions in a dynamic duopoly. J. Econ. Theory 2003, 111, 214–239. [Google Scholar] [CrossRef]
  11. Benchekroun, H. Comparative dynamics in a productive asset oligopoly. J. Econ. Theory 2008, 138, 237–261. [Google Scholar] [CrossRef]
  12. Vardar, B.; Zaccour, G. Exploitation of a Productive Asset in the Presence of Strategic Behavior and Pollution Externalities; Les Cahiers du GERAD G-2018-43; HEC Montréal: Montréal, QC, Canada, 2018. [Google Scholar]
  13. Benhabib, J.; Radner, R. The joint exploitation of a productive asset: A game-theoretic approach. Econ. Theory 1992, 2, 155–190. [Google Scholar] [CrossRef]
  14. Dockner, E.J.; Sorger, G. Existence and properties of equilibria for a dynamic game on productive assets. J. Econ. Theory 1996, 71, 209–227. [Google Scholar] [CrossRef]
  15. Kossioris, G.; Plexousakis, M.; Xepapadeas, A.; de Zeeuw, A. On the optimal taxation of common-pool resources. J. Econ. Dyn. Control 2011, 35, 1868–1879. [Google Scholar] [CrossRef] [Green Version]
  16. Fujiwara, K. Losses from competition in a dynamic game model of a renewable resource oligopoly. Res. Energy Econ. 2011, 33, 1–11. [Google Scholar] [CrossRef] [Green Version]
  17. Colombo, L.; Labrecciosa, P. On the convergence to the Cournot equilibrium in a productive asset oligopoly. J. Math. Econ. 2013, 49, 441–445. [Google Scholar] [CrossRef]
  18. Colombo, L.; Labrecciosa, P. Oligopoly exploitation of a private property productive asset. J. Econ. Dyn. Control 2013, 37, 838–853. [Google Scholar] [CrossRef]
  19. Lambertini, L.; Mantovani, A. Feedback equilibria in a dynamic renewable resource oligopoly: Pre-emption, voracity and exhaustion. J. Econ. Dyn. Control 2014, 47, 115–122. [Google Scholar] [CrossRef] [Green Version]
  20. Lambertini, L.; Mantovani, A. On the (in) stability of nonlinear feedback solutions in a dynamic duopoly with renewable resource exploitation. Econ. Lett. 2016, 143, 9–12. [Google Scholar] [CrossRef] [Green Version]
  21. Benchekroun, H.; Gaudet, G. On the effects of mergers on equilibrium outcomes in a common property renewable asset oligopoly. J. Econ. Dyn. Control 2015, 52, 209–223. [Google Scholar] [CrossRef] [Green Version]
  22. Benchekroun, H.; Long, V.N. Status concern and the exploitation of common pool renewable resources. Ecol. Econ. 2016, 125, 70–82. [Google Scholar] [CrossRef] [Green Version]
  23. Grilli, L.; Bisceglia, M. A duopoly with common renewable resource and incentives. Intl. Game Theory Rev. 2017, 19, 1750018. [Google Scholar] [CrossRef]
  24. Jørgensen, S.; Martín-Herrán, G.; Zaccour, G. Dynamic games in the economics and management of pollution. Environ. Model. Assess. 2010, 15, 433–467. [Google Scholar] [CrossRef]
  25. Long, V.N. Dynamic games in the economics of natural resources: A survey. Dyn. Games Appl. 2011, 1, 115–148. [Google Scholar] [CrossRef]
  26. Van der Ploeg, F.; de Zeeuw, A.J. International aspects of pollution control. Environ. Res. Econ. 1992, 2, 117–139. [Google Scholar] [CrossRef] [Green Version]
  27. Dockner, E.J.; Long, V.N. International pollution control: Cooperative versus noncooperative strategies. J. Environ. Econ. Manag. 1993, 25, 13–29. [Google Scholar] [CrossRef] [Green Version]
  28. Benchekroun, H.; Long, V.N. Efficiency inducing taxation for polluting oligopolists. J. Public Econ. 1998, 70, 325–342. [Google Scholar] [CrossRef] [Green Version]
  29. Rubio, S.J.; Escriche, L. Strategic Pigouvian taxation, stock externalities and polluting non-renewable resources. J. Public Econ. 2001, 79, 297–313. [Google Scholar] [CrossRef]
  30. Wirl, F. Pigouvian taxation of energy for flow and stock externalities and strategic, noncompetitive energy pricing. J. Environ. Econ. Manag. 1994, 26, 1–18. [Google Scholar] [CrossRef]
  31. Wirl, F. Tragedy of the commons in a stochastic game of a stock externality. J. Public Econ. Theory 2008, 10, 99–124. [Google Scholar] [CrossRef]
  32. Germain, M.; Toint, P.; Tulkens, H.; de Zeeuw, A. Transfers to sustain dynamic core-theoretic cooperation in international stock pollutant control. J. Econ. Dyn. Control 2003, 28, 79–99. [Google Scholar] [CrossRef] [Green Version]
  33. Petrosjan, L.; Zaccour, G. Time-consistent Shapley value allocation of pollution cost reduction. J. Econ. Dyn. Ctrl. 2003, 27, 381–398. [Google Scholar] [CrossRef]
  34. Xepapadeas, A. Induced technical change and international agreements under greenhouse warming. Res. Energy Econ. 1995, 17, 1–23. [Google Scholar] [CrossRef]
  35. Tahvonen, O. On the dynamics of renewable resource harvesting and pollution control. Environ. Res. Econ. 1991, 1, 97–117. [Google Scholar]
  36. Xepapadeas, A. Managing the international commons: Resource use and pollution control. Environ. Res. Econ. 1995, 5, 375–391. [Google Scholar] [CrossRef]
  37. Wirl, F. Sustainable growth, renewable resources and pollution: Thresholds and cycles. J. Econ. Dyn. Control 2004, 28, 1149–1157. [Google Scholar] [CrossRef]
  38. Dahmouni, I.; Vardar, B.; Zaccour, G. A fair and time-consistent sharing of the joint exploitation payoff of a fishery. Nat. Res. Model. 2019, 32, e12216. [Google Scholar] [CrossRef]
  39. Singh, R.; Dwivedi, A.D.; Srivastava, G.; Wiszniewska-Matyszkiel, A.; Cheng, X. A game theoretic analysis of resource mining in blockchain. Clust. Comput. 2020, 23, 2035–2046. [Google Scholar] [CrossRef] [Green Version]
  40. Dockner, E.J.; Jørgensen, S.; Long, V.N.; Sorger, G. Differential Games in Economics and Management Science; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
  41. Haurie, A.; Krawczyk, J.B.; Zaccour, G. Games and Dynamic Games; World Scientific Books: Singapore, 2012. [Google Scholar]
  42. Takayama, A. Analytical Methods in Economics; University of Michigan Press: Ann Arbor, MI, USA, 1993. [Google Scholar]
  43. Melikyan, A. Generalized Characteristics of First Order PDEs: Applications in Optimal Control and Differential Games; Springer Science and Business Media: New York, NY, USA, 2012. [Google Scholar]
  44. Jun, B.; Vives, X. Strategic incentives in dynamic duopoly. J. Econ. Theory 2004, 116, 249–281. [Google Scholar] [CrossRef]
Figure 1. Illustration of the regions and their boundaries in ( S , Z ) .
Figure 1. Illustration of the regions and their boundaries in ( S , Z ) .
Mathematics 08 01682 g001
Figure 2. Two cases of region positioning under Assumptions 1 and 2.
Figure 2. Two cases of region positioning under Assumptions 1 and 2.
Mathematics 08 01682 g002
Figure 3. Sample of the case with multiple steady states.
Figure 3. Sample of the case with multiple steady states.
Mathematics 08 01682 g003
Figure 4. Sample equilibria with unique and multiple steady states.
Figure 4. Sample equilibria with unique and multiple steady states.
Mathematics 08 01682 g004

Share and Cite

MDPI and ACS Style

Vardar, N.B.; Zaccour, G. Exploitation of a Productive Asset in the Presence of Strategic Behavior and Pollution Externalities. Mathematics 2020, 8, 1682. https://doi.org/10.3390/math8101682

AMA Style

Vardar NB, Zaccour G. Exploitation of a Productive Asset in the Presence of Strategic Behavior and Pollution Externalities. Mathematics. 2020; 8(10):1682. https://doi.org/10.3390/math8101682

Chicago/Turabian Style

Vardar, N. Baris, and Georges Zaccour. 2020. "Exploitation of a Productive Asset in the Presence of Strategic Behavior and Pollution Externalities" Mathematics 8, no. 10: 1682. https://doi.org/10.3390/math8101682

APA Style

Vardar, N. B., & Zaccour, G. (2020). Exploitation of a Productive Asset in the Presence of Strategic Behavior and Pollution Externalities. Mathematics, 8(10), 1682. https://doi.org/10.3390/math8101682

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop