Cooperative Strategies in Transboundary Water Pollution Control: A Differential Game Approach

Tu, Guoping; Yu, Chengyue; Yu, Feilong

doi:10.3390/w16223239

Open AccessArticle

Cooperative Strategies in Transboundary Water Pollution Control: A Differential Game Approach

by

Guoping Tu

¹

,

Chengyue Yu

^1,*

and

Feilong Yu

²

¹

School of Public Policy and Administration, Nanchang University, Nanchang 330031, China

²

Science and Technology Affairs Center of Jiangxi Province, Nanchang 330046, China

^*

Author to whom correspondence should be addressed.

Water 2024, 16(22), 3239; https://doi.org/10.3390/w16223239

Submission received: 16 October 2024 / Revised: 5 November 2024 / Accepted: 9 November 2024 / Published: 11 November 2024

Download

Browse Figures

Versions Notes

Abstract

:

This paper, based on differential game theory, examines governance models and cooperative strategies for managing cross-border water pollution in regions with uneven economic development. To address cross-regional water pollution, three differential game models are constructed under different scenarios: the Nash noncooperative mechanism, the pollution control cost compensation mechanism, and the collaborative cooperation mechanism. This study analyzes the dynamic changes in pollution emissions, governance investments, and economic returns within each model. The results indicate that the collaborative cooperation mechanism is the most effective, as it significantly reduces pollution emissions, maximizes overall regional benefits, and achieves Pareto optimality. In comparison, the pollution control cost compensation mechanism is suboptimal under certain conditions, while the Nash noncooperative mechanism is the least efficient, resulting in the highest pollution emissions. Furthermore, the research explores the influence of cooperation costs on the selection of governance models. It finds that high cooperation costs reduce local governments’ willingness to engage in collaborative cooperation. However, an appropriate compensation mechanism can effectively encourage less-developed regions to participate. Numerical analysis confirms the dynamic evolution of pollution stocks and economic returns under different models, and provides corresponding policy recommendations. This paper offers theoretical insights and practical guidance for cross-regional water pollution management, highlighting the importance of regional cooperation and cost-sharing in environmental governance.

Keywords:

transboundary water pollution; differential game theory; cooperative governance; pollution control compensation; sustainable development

1. Introduction

Watershed water pollution exhibits complex characteristics, including unidirectional, bidirectional, and cross-flow patterns, affecting multiple administrative regions. It presents transboundary, negative externalities, and spatial spillover effects. Rivers, as carriers of water resources, not only facilitate daily living for residents but also support industrial production. Consequently, both household and industrial wastewater often enter water bodies. With economic development and population growth, excessive water consumption and the cross-border transmission of pollutants in various regions have increasingly harmed the livelihoods and production of residents along riverbanks. The challenge of maintaining water quality standards at cross-border sections has severely hindered ecological restoration and balanced regional development [1]. The United Nations Environment Programme (UNEP) has warned that “many countries around the world are facing the threat of cross-regional water pollution, and the problem is worsening” [2]. In recent years, the accumulation of pollutants, the reduced self-purification capacity of rivers, and damage to vegetation—mainly due to industrial pollution and human activities—have become prominent issues. Given the challenges in resolving transboundary water pollution disputes, it is critical to implement robust water resource management systems and advance research on transboundary pollution compensation and cooperation within watersheds.

In this context, this paper addresses the issue of water pollution control between two regions with unequal economic development. To explore this, three regional pollution control scenarios are established: the Nash noncooperative mechanism, the pollution control cost compensation mechanism, and the collaborative cooperation mechanism. Differential game models are developed to conduct comparative analyses. Finally, numerical simulations are performed to validate the models, and management strategies are proposed based on the results, providing valuable insights for selecting regional water pollution control strategies.

In research on cooperative governance of water pollution, part of the literature focuses on the causes and nature of water pollution. Some scholars argue that the root cause of transboundary water pollution lies in conflicts of interest. Huisman et al. [3], through their analysis of water pollution control in the Rhine River, concluded that conflicting interests between countries are the primary reason for the failure of environmental governance agreements. List and Mason [4] contend that the key to solving transboundary water pollution issues lies in the “cost–benefit” dilemma. Governments often lack the awareness and incentives to manage transferable pollution. Only by finding viable solutions that consider cost–benefit factors can transboundary pollution problems be effectively resolved. Another stream of the literature focuses on pollution control, with research hotspots centered on pollution permit trading among enterprises and government–enterprise cooperation in managing water pollution. For example, Montgomery [5] and Eheart et al. [6] developed deterministic models of river pollution permit trading, analyzing the role of market access permits in controlling pollution. Building on this, Hung et al. [7] introduced a trading ratio system (TRS) for unidirectional river flow, demonstrating that TRS could account for emission location effects and achieve environmental quality standards at the lowest total abatement cost. Extending Hung’s work, Mesbah et al. [8] proposed an enhanced TRS as a cost-effective decision-making tool for managing river water quality in real time, while also accounting for risks. Although the aforementioned studies provide valuable insights into the causes of water pollution, research on cooperative pollution governance tends to focus on collaboration between enterprises or between governments and enterprises. Few studies address pollution compensation or collaborative control across administrative regions. Moreover, most of the literature on cooperative governance examines the effects or conditions of a single cooperation model, lacking in-depth analysis of different forms of cooperative governance.

Game theory, as a vital tool for analyzing conflicts and cooperation in human society, has been widely applied to the study of cross-regional water pollution governance. Common game models include evolutionary games, which are used to examine ecological compensation in water bodies [9] and address time consistency issues in downstream water pollution control [10]; cooperative games, such as those utilizing Monte Carlo numerical simulations for experimental research [11]; and dynamic games, such as river pollution models based on network externalities [12]. These studies encompass both traditional game methods and emerging theories like evolutionary games.

Differential games, a type of game in continuous-time systems, involve multiple participants who optimize their independent and conflicting objectives through ongoing interactions, eventually arriving at strategies that evolve over time and reach Nash equilibrium. This theory originated from U.S. Air Force research on military conflicts in the 1950s, combining optimal control and game theory, and has since been widely applied to areas such as joint emissions reduction [13], cooperative disaster relief [14], inventory management [15], product pricing [16], and cooperative governance [17]. In the context of dynamic pollution control, Huang et al. [18] proposed a differential game model for cross-border pollution between two regions; Xu et al. [19] used differential game models to examine the impact of the “River Chief System” on water pollution control; Wei et al. [20] analyzed how to balance sustainable development with water resource protection through ecological compensation; and Li et al. [21] investigated cross-border pollution using differential game models.

Early explorations into the application of game theory for pollution control laid a crucial theoretical groundwork in this area. Dockner and Long [22], along with Long [23], investigated international pollution control through the Cournot–Nash model, employing differential game techniques. Their findings underscored a key flaw in noncooperative approaches: when nations act independently, prioritizing economic benefits over environmental preservation, the result is often an increase in global emissions. This issue catalyzed further research aimed at coordinating international pollution control efforts to enhance emission reduction results.

Following this foundation, Wirl [24] examined the use of tax and permit systems within the Cournot–Nash framework, shedding additional light on the limitations of noncooperative methods. His study highlighted the challenges these strategies face in achieving effective pollution control.

In later work, Insley and Forsyth [25,26] utilized the Stackelberg model to analyze pollution control dynamics, introducing a leader–follower hierarchy as a means of achieving partial coordination in noncooperative settings. Compared with the Cournot–Nash approach, the Stackelberg model offers some reduction in the tension between individual and collective interests. Nevertheless, it still falls short of realizing globally optimal emission targets, especially in economically diverse regions.

Recent studies have further developed these theories by examining game structures under varying environmental policies. For instance, Nkuiya and Plantinga [27] investigated strategic pollution control within free trade conditions, concluding that noncooperative countries face challenges in maximizing global welfare through emissions reductions. Conversely, the Stackelberg strategy’s hierarchical leader–follower approach helps to mitigate some negative outcomes, though its effectiveness remains limited by the noncooperative nature of such frameworks.

In the context of cross-regional pollution management, balancing cooperative and noncooperative strategies presents significant difficulties. While the Stackelberg model provides a pathway for partial coordination, realizing optimal outcomes in cross-regional pollution management requires overcoming the inherent constraints of noncooperative models like Cournot–Nash. This is particularly challenging in economically disparate regions, where less-wealthy nations may “free-ride”, contributing minimally to pollution control efforts and exacerbating environmental burdens on neighboring areas.

To address these issues, cooperative approaches, such as cost-sharing mechanisms and collaborative governance structures, are critical. Such strategies enable economically diverse regions to jointly bear the costs and responsibilities of pollution control, thereby enhancing overall welfare.

To address the limitations of previous research, this paper analyzes the effects of production capacity, pollution control costs, and environmental pollution across regions, considering economic disparities. It explores three models: the Nash noncooperative mechanism, the pollution control cost-sharing mechanism, and collaborative cooperation. To conduct an in-depth analysis of these issues, this study employs “differential game” methods as a mathematical tool. Finally, by analyzing the dynamic trajectories of pollution levels and regional instantaneous returns, this paper examines the sensitivity of optimal strategies to factors such as short-sighted behavior and the natural degradation rate of the environment. Through numerical analysis, this paper discusses the dynamic changes in total pollutant levels and overall returns, along with the effects of discount rates and natural pollutant degradation rates on returns. This paper offers valuable theoretical references for regional cooperative strategies in water pollution governance.

Additionally, part of the inspiration for this paper stems from the work of Carbonaro and Menale (2024) [28], who investigated the application of Markov chains and kinetic theory in socioeconomic contexts. Their research on dynamic systems offers crucial insights for modeling decision-making processes in differential games. In particular, their focus on the evolution and optimization of strategies within changing environments closely aligns with the objectives of this study. By incorporating concepts from their research, this paper seeks to deepen the understanding of participant behavior in game psychology, especially in terms of how strategies adapt over time in response to various factors and uncertainties.

2. Problem Description and Model Construction

Assume that two adjacent regions, which are not administratively connected, are dealing with a cross-regional water pollution issue. These regions will be referred to as Region A and Region B. Assume that Region A has a higher economic level than Region B. The pollutants emitted by each region not only cause damage within their own region but also affect neighboring regions. These damages are represented by coefficients

ε_{A A} g_{A} (t), ε_{B B} g_{B} (t), ε_{A B} g_{A} (t),

and

ε_{B A} g_{B} (t)

, where

ε_{A A} > 0

and

ε_{B B} > 0

indicate the coefficients of environmental damage caused by emissions within each region. Similarly,

ε_{A B} > 0

and

ε_{B A} > 0

represent the coefficients of damage caused to neighboring regions by emissions from each region.

Assume that each region produces a specific product, and each unit of this product generates a certain amount of atmospheric pollutants. According to Breton’s research [29], the revenue of each region can be described by the following concave function:

R_{i} (t) = α_{i} g_{i} (t) - \frac{1}{2} g_{i}^{2} (t), i = A, B

(1)

Here, the parameter

α

is a positive constant that measures the disparity in production efficiency between the two regions. It is assumed that Region A is more developed than Region B, meaning

α_{A} > α_{B}

. Additionally,

0 < α < 1

.

The pollutants emitted by both regions can be treated using end-of-pipe technology, with the pollution control investment denoted as

h

. The pollution control costs for Regions A and B are represented as:

C_{i} (h_{i} (t)) = \frac{1}{2} c_{i} h_{i}^{2} (t), i = A, B

(2)

where

c_{i}

is the pollution control cost coefficient, which measures the differences in pollution control technology between the two regions. Specifically,

1 < c_{A} < c_{B}

, indicating that the pollution treatment cost in Region B is higher than in Region A.

To protect and improve the natural environment, it is assumed that the higher-level government (the environmental regulatory authority) imposes an environmental tax on pollutants for local governments. This environmental tax is denoted as

τ

.

Due to the mobility and accumulation characteristics of regional water pollution, each region not only bears the negative externalities of its own emissions but is also affected by the emissions of neighboring regions, resulting in a transregional pollution interaction problem. Additionally, pollutants continuously accumulate as emissions persist in each region. The pollutant stock

x (t)

is assumed to be represented by the following dynamic differential equation:

\overset{\cdot}{x} (t) = \sum_{i = A, B} g_{i} (t) - \sum_{i = A, B} ω_{i} h_{i} (t) - ζ x (t)

(3)

And

x (0) = x_{0}, x (t) \geq 0

. Here,

ζ > 0

represents the natural decomposition rate of pollutants, or the environment’s self-purification capacity, while

ω > 0

represents the amount of pollution reduced through local pollution control investments.

Assume that pollutant emissions cause pollution in both regions, leading to social losses. As a result, both regions must account for the negative effects of environmental damage in their decision making. Let the negative effect be represented by a linear function

d_{i} (x), i = A 、 B

, where

d_{A} = d, d_{B} = β d, (0 < β < 1)

, indicating that Region B experiences less environmental damage than Region A. While Region B has weaker economic development capabilities than Region A, its natural environment is in a better condition. For instance, residents in developed regions are more sensitive to environmental pollution, requiring the government to invest more resources to mitigate the negative impact of pollution.

In cross-regional water pollution governance, cooperation costs arise mainly from two sources: first, addressing the environmental damage caused by cross-border pollution; and second, the expenses related to pollution control investments and resource coordination.

First, consider that a portion of the cooperation cost is allocated to mitigating the environmental damage caused by cross-border pollution emissions. This cost is directly related to the pollution levels in each region. The higher the emissions, the more severe the environmental damage, leading to a significant increase in the measures required to reduce these emissions, thereby raising cooperation costs. Second, assume that both regions need to coordinate technologies and share resources during the governance process. Another part of the cooperation cost involves the total expenditure of both regions on governance investments, technical coordination, and facility construction.

By combining these two factors, the cooperation cost

c_{C}

can be expressed as the total expenses incurred by the two regions for addressing cross-border pollution emissions and governance investments. The specific formula is

c_{C} = k_{1} g_{A} + k_{2} g_{B} + k_{3} h_{A} + k_{4} h_{B}

(4)

where the coefficients

k_{1}

and

k_{2}

represent the cost weights for each region in reducing pollution emissions, and

k_{3}

,

k_{4}

reflect the cost weights related to governance investment coordination and technical sharing. By linking cooperation costs with pollution emission levels and governance investments, a more accurate assessment of the actual economic burden of cooperation and its impact on overall utility can be achieved.

3. Materials and Methods

3.1. Differential Game Method

This study utilizes differential game theory, a mathematical framework designed to model and analyze decision making in dynamic systems involving multiple agents with conflicting objectives. Specifically, the method was applied to formulate and solve problems where each agent’s strategy evolves over time. The Nash equilibrium was employed to identify optimal strategies, ensuring that no agent has an incentive to deviate from their chosen approach. This method facilitates the analysis of complex dynamic interactions and provides insights into optimal control within competitive scenarios.

3.2. AI Tools Disclosure

The author acknowledges the use of AI tools in preparing this manuscript. In particular, ChatGPT (GPT-4, https://www.openai.com, accessed on 15 October 2024) was used to enhance language clarity and accuracy by correcting grammar, refining sentence structures, and improving word choice. It is important to note that all academic content, research data, and analyses were conducted solely by the authors, with the AI tool serving only to improve the linguistic quality of the text.

4. Differential Game Model

4.1. Nash Noncooperative Game

In this scenario, Region A and Region B make decisions independently. Under the constraint of dynamic changes in pollution capacity, each region selects its optimal emission and pollution control paths to maximize its instantaneous net benefits. Therefore, in the noncooperative case, the utility functions and constraints for both regions are as follows:

Each local government seeks to maximize its own utility. The utility functions for the local governments in Region A and Region B are expressed as follows:

U_{A} = \max_{g_{A} (t)} \int_{0}^{\infty} e^{- p t} \{\begin{array}{l} α_{A} g_{A} (t) - \frac{1}{2} g_{A}^{2} (t) - \frac{1}{2} c_{A} h_{A}^{2} (t) - τ [g_{A} (t) - ω_{A} h_{A} (t)] \\ - ε_{A A} g_{A} (t) - ε_{B A} g_{B} (t) - d x (t) \end{array}\} d t

(5)

U_{B} = \max_{g_{B} (t)} \int_{0}^{\infty} e^{- p t} \{\begin{array}{l} α_{B} g_{B} (t) - \frac{1}{2} g_{B}^{2} (t) - \frac{1}{2} c_{B} h_{B}^{2} (t) - τ [g_{B} (t) - ω_{B} h_{B} (t)] \\ - ε_{B B} g_{B} (t) - ε_{A B} g_{A} (t) - β d x (t) \end{array}\} d t

(6)

s . t . \overset{\cdot}{x} (t) = \sum_{i = A, B} g_{i} (t) - \sum_{i = A, B} ω_{i} h_{i} (t) - ζ x (t)

(7)

Here,

p > 0

represents the discount rate or discount factor.

Since the equilibrium of open-loop strategies cannot ensure time consistency and fails to reflect the dynamic changes in decision structures caused by pollution emissions, the optimal strategies derived from the feedback Nash equilibrium provide consistency in both state and time. Therefore, this paper addresses the Markov feedback Nash equilibrium of the differential game using the following Hamilton–Jacobi–Bellman (HJB) equation. For simplicity, the time variable

t

is omitted in the subsequent analysis.

ρ V_{A}^{N} (x) = \max_{g_{A} {, h}_{A}} \{\begin{array}{l} α_{A} g_{A} - \frac{1}{2} g_{A}^{2} - \frac{1}{2} c_{A} h_{A}^{2} - τ (g_{A} - ω_{A} h_{A}) - ε_{A A} g_{A} \\ - ε_{B A} g_{B} - d x + \frac{\partial V_{A}^{N}}{\partial x} (\sum_{i = A, B} g_{i} - \sum_{i = A, B} ω_{i} h_{i} - ζ x) \end{array}\}

(8)

ρ V_{B}^{N} (x) = \max_{g_{B} {, h}_{B}} \{\begin{array}{l} α_{B} g_{B} - \frac{1}{2} g_{B}^{2} - \frac{1}{2} c_{B} h_{B}^{2} - τ (g_{B} - ω_{B} h_{B}) - ε_{B B} g_{B} \\ - ε_{A B} g_{A} - β d x + \frac{\partial V_{B}^{N}}{\partial x} (\sum_{i = A, B} g_{i} - \sum_{i = A, B} ω_{i} h_{i} - ζ x) \end{array}\}

(9)

The right side of the HJB equation above represents the maximization function. To determine its specific form, take the first-order partial derivatives with respect to the variables

g_{A}^{N}, g_{B}^{N}, h_{A}^{N}, h_{B}^{N}

, and set these derivatives to zero. This yields the following maximization conditions:

\begin{array}{l} g_{A}^{N} = α_{A} - τ - ε_{A A} + \frac{\partial V_{A}^{N}}{\partial x} g_{B}^{N} = α_{B} - τ - ε_{B B} + \frac{\partial V_{B}^{N}}{\partial x} \\ h_{A}^{N} = \frac{ω_{A} (τ - \frac{\partial V_{A}^{N}}{\partial x})}{c_{A}} h_{B}^{N} = \frac{ω_{B} (τ - \frac{\partial V_{B}^{N}}{\partial x})}{c_{B}} \end{array}

(10)

Substitute Equation (10) into the HJB equation and simplify to obtain the following:

ρ V_{A}^{N} (x) = - (d + ζ \frac{\partial V_{A}^{N}}{\partial x}) x + \{\begin{array}{l} \frac{(c_{A} + {ω_{A}}^{2}) {(\frac{\partial V_{A}^{N}}{\partial x})}^{2}}{2 c_{A}} \\ + \frac{1}{c_{A} c_{B}} \frac{\partial V_{A}^{N}}{\partial x} \{\begin{array}{l} c_{B} (α_{A} c_{A} + α_{B} c_{A} - c_{A} (ε_{A A} + ε_{B B} + 2 τ) - τ {ω_{A}}^{2}) \\ - c_{A} τ ω_{B} {}^{2}+ c_{A} (c_{B} + {ω_{B}}^{2}) {(\frac{\partial V_{B}^{N}}{\partial x})}^{2} \end{array}\} \\ + \frac{1}{{2 c}_{A}} \{\begin{array}{l} c_{A} (\begin{array}{l} α_{A} {}^{2}+ ε_{A A} {}^{2}- 2 α_{B} ε_{B A} + 2 ε_{B A} ε_{B B} + τ^{2} \\ + 2 τ (ε_{A A} + ε_{B A}) - 2 α_{A} (ε_{A A} + τ) \end{array}) \\ + τ^{2} ω_{A} {}^{2}- 2 c_{A} ε_{B A} \frac{\partial V_{B}^{N}}{\partial x} \end{array}\} \end{array}\}

(11a)

ρ V_{B}^{N} (x) = - (β d + ζ \frac{\partial V_{B}^{N}}{\partial x}) x + \{\begin{array}{l} \frac{(c_{B} + {ω_{B}}^{2}) {(\frac{\partial V_{B}^{N}}{\partial x})}^{2}}{2 c_{B}} \\ + \frac{1}{2} \{\begin{array}{l} α_{B} {}^{2}- 2 (α_{A} - ε_{A A}) ε_{A B} + 2 τ ε_{A B} + τ^{2} {ω_{B}}^{2} \\ - 2 α_{B} (ε_{B B} + τ) + {(ε_{B B} + τ)}^{2} - 2 c_{B} ε_{A B} \frac{\partial V_{A}^{N}}{\partial x} \end{array}\} \\ + \frac{\frac{\partial V_{B}^{N}}{\partial x}}{c_{A} c_{B}} \{\begin{array}{l} c_{B} (α_{A} c_{A} + α_{B} c_{A} - c_{A} (ε_{A A} + ε_{B B} + 2 τ) - τ {ω_{A}}^{2}) \\ - c_{A} τ ω_{B} {}^{2}+ c_{B} (c_{A} + {ω_{A}}^{2}) \frac{\partial V_{A}^{N}}{\partial x} \end{array}\} \end{array}\}

(11b)

Based on the structure of Equation (11a,b), it can be concluded that the linear optimal value function of

X

is the solution to the HJB equation. Let

V_{A}^{N} (x) = k_{A}^{N} x + b_{A}^{N}, V_{B}^{N} (x) = k_{B}^{N} x + b_{B}^{N}

(12)

Here,

k_{A}^{N}

,

b_{A}^{N}

,

k_{B}^{N}

, and

b_{B}^{N}

are constants to be determined. By substituting Equation (12) and its derivatives into Equation (11a,b), we obtain the following:

ρ (k_{A}^{N} x + b_{A}^{N}) = - (d + ζ k_{A}^{N}) x + \{\begin{array}{l} \frac{(c_{A} + {ω_{A}}^{2}) {(k_{A}^{N})}^{2}}{2 c_{A}} \\ + \frac{k_{A}^{N}}{c_{A} c_{B}} \{\begin{array}{l} c_{B} (α_{A} c_{A} + α_{B} c_{A} - c_{A} (ε_{A A} + ε_{B B} + 2 τ) - τ {ω_{A}}^{2}) \\ - c_{A} τ ω_{B} {}^{2}+ c_{A} (c_{B} + {ω_{B}}^{2}) {(k_{B}^{N})}^{2} \end{array}\} \\ + \frac{1}{2} \{\begin{array}{l} α_{A} {}^{2}+ ε_{A A} {}^{2}- 2 α_{B} ε_{B A} + 2 ε_{B A} ε_{B B} \\ + 2 τ (ε_{A A} + ε_{B A}) + τ^{2} - 2 α_{A} (ε_{A A} + τ) \end{array}\} \\ + \frac{τ^{2} ω_{A} {}^{2}- 2 c_{A} ε_{B A} k_{B}^{N}}{2 c_{A}} \end{array}\}

(13a)

ρ (k_{B}^{N} x + b_{B}^{N}) = - (β d + ζ k_{B}^{N}) x + \{\begin{array}{l} \frac{(c_{B} + {ω_{B}}^{2}) {(k_{B}^{N})}^{2}}{2 c_{B}} \\ + \frac{1}{2} \{\begin{array}{l} α_{B} {}^{2}- 2 (α_{A} - ε_{A A}) ε_{A B} + 2 τ ε_{A B} + τ^{2} {ω_{B}}^{2} \\ - 2 α_{B} (ε_{B B} + τ) + {(ε_{B B} + τ)}^{2} - 2 c_{B} ε_{A B} k_{A}^{N} \end{array}\} \\ + \frac{k_{B}^{N}}{c_{A} c_{B}} \{\begin{array}{l} c_{B} (α_{A} c_{A} + α_{B} c_{A} - c_{A} (ε_{A A} + ε_{B B} + 2 τ) - τ {ω_{A}}^{2}) \\ - c_{A} τ ω_{B} {}^{2}+ c_{B} (c_{A} + {ω_{A}}^{2}) k_{A}^{N} \end{array}\} \end{array}\}

(13b)

Based on the previous assumptions, Equations

V_{A}^{N} (x)

and

V_{B}^{N} (x)

must hold for all

x \geq 0

. Consequently, the values of

k_{A}^{N}

,

b_{A}^{N}

and

k_{B}^{N}

,

b_{B}^{N}

can be determined as follows:

k_{A}^{N} = - \frac{d}{ρ + ζ}

(14a)

k_{B}^{N} = - \frac{β d}{ρ + ζ}

(14b)

b_{A}^{N} = \{\begin{array}{l} \frac{c_{B} {(d + (ρ + ζ) τ)}^{2} ω_{A} {}^{2}+ 2 c_{A} d (d β + (ρ + ζ) τ) {ω_{B}}^{2}}{2 ρ c_{A} c_{B} {(ρ + ζ)}^{2}} \\ + \frac{d^{2} (1 + 2 β) - 2 d (ρ + ζ) (α_{A} + α_{B} - ε_{A A} - β ε_{B A} - ε_{B B} - 2 τ)}{2 ρ {(ρ + ζ)}^{2}} \\ + \frac{{(α_{A} - ε_{A A})}^{2} + 2 ε_{B A} (ε_{B B} - α_{B}) + 2 τ (- α_{A} + ε_{A A} + ε_{B A}) + τ^{2}}{2 ρ} \end{array}\}

(15a)

b_{B}^{N} = \{\begin{array}{l} \frac{d β (d + (ρ + ζ) τ) {ω_{A}}^{2}}{ρ c_{A} {(ρ + ζ)}^{2}} + \frac{+ {(d β + (ρ + ζ) τ)}^{2} {ω_{B}}^{2}}{2 ρ c_{B} {(ρ + ζ)}^{2}} \\ + \frac{d^{2} β (2 + β) + 2 d (ρ + ζ) (ε_{A B} + β (- α_{A} - α_{B} + ε_{A A} + ε_{B B} + 2 τ))}{2 ρ {(ρ + ζ)}^{2}} \\ + \frac{α_{B} {}^{2}+ 2 (- α_{A} + ε_{A A}) ε_{A B} + 2 τ ε_{A B} - 2 α_{B} {(ε_{B B} + τ)}^{2}}{2 ρ} \end{array}\}

(15b)

By substituting

k_{A}^{N}

,

b_{A}^{N}

,

k_{B}^{N}

, and

b_{B}^{N}

into Equations (10) and (12), we derive Proposition 1:

Proposition 1.

Under the Nash equilibrium between the two local governments, the pollution emissions

g_{A}^{N^{*}}

and

g_{B}^{N^{*}}

, the pollution control investments

h_{A}^{N^{*}}

and

h_{B}^{N^{*}}

, and the total benefits

V_{A}^{N} {}^{*}(x)

and

V_{B}^{N} {}^{*}(x)

are as follows:

g_{A}^{N *} = \frac{(α_{A} - τ - ε_{A A}) (ρ + ζ) - d}{ρ + ζ}

(16a)

g_{B}^{N *} = \frac{(α_{B} - τ - ε_{B B}) (ρ + ζ) - β d}{ρ + ζ}

(16b)

h_{A}^{N *} = \frac{ω_{A} [τ (ρ + ζ) + d]}{c_{A} (ρ + ζ)}

(17a)

h_{B}^{N *} = \frac{ω_{B} [τ (ρ + ζ) + β d]}{c_{B} (ρ + ζ)}

(17b)

V_{A}^{N} {}^{*}(x) = - \frac{d x}{ρ + ζ} + \{\begin{array}{l} \frac{c_{B} {(d + (ρ + ζ) τ)}^{2} ω_{A} {}^{2}+ 2 c_{A} d (d β + (ρ + ζ) τ) {ω_{B}}^{2}}{2 ρ c_{A} c_{B} {(ρ + ζ)}^{2}} \\ + \frac{d^{2} (1 + 2 β) - 2 d (ρ + ζ) (α_{A} + α_{B} - ε_{A A} - β ε_{B A} - ε_{B B} - 2 τ)}{2 ρ {(ρ + ζ)}^{2}} \\ + \frac{{(α_{A} - ε_{A A})}^{2} + 2 ε_{B A} (ε_{B B} - α_{B}) + 2 τ (- α_{A} + ε_{A A} + ε_{B A}) + τ^{2}}{2 ρ} \end{array}\}

(18a)

V_{B}^{N} {}^{*}(x) = - \frac{β dx}{ρ + ζ} + \{\begin{array}{l} \frac{d β (d + (ρ + ζ) τ) {ω_{A}}^{2}}{ρ c_{A} {(ρ + ζ)}^{2}} + \frac{+ {(d β + (ρ + ζ) τ)}^{2} {ω_{B}}^{2}}{2 ρ c_{B} {(ρ + ζ)}^{2}} \\ + \frac{d^{2} β (2 + β) + 2 d (ρ + ζ) (ε_{A B} + β (- α_{A} - α_{B} + ε_{A A} + ε_{B B} + 2 τ))}{2 ρ {(ρ + ζ)}^{2}} \\ + \frac{α_{B} {}^{2}+ 2 (- α_{A} + ε_{A A}) ε_{A B} + 2 τ ε_{A B} - 2 α_{B} {(ε_{B B} + τ)}^{2}}{2 ρ} \end{array}\}

(18b)

To ensure that emissions remain positive, conditions

(α_{A} - τ - ε_{A A}) (ρ + ζ) - d > 0

and

(α_{B} - τ - ε_{B B}) (ρ + ζ) - β d > 0

must be satisfied.

From Equation (18a,b), the optimal benefit functions for local governments’ collaborative environmental governance can be derived.

V^{N} {}^{*}(x) = V_{A}^{N} {}^{*}(x) + V_{B}^{N} {}^{*}(x) = (18 a) + (18 b)

In this model, the dynamic changes in pollutant stock over time

t

for the two regions are represented by

x^{N} (t)

:

\{\begin{cases} \overset{\cdot}{x} (t) = A_{N} - ζ x (t) \\ x^{N} (0) = x_{0} \\ A_{N} = \{\begin{array}{l} \frac{(α_{A} + α_{B} - ζ - 2 τ - ε_{A A} - ε_{B B}) (ρ + ζ) - d (1 + β)}{ρ + ζ} \\ - \frac{ω_{A} {}^{2}{[τ (ρ + ζ) + d]}}{c_{A} (ρ + ζ)} - \frac{ω_{B} {}^{2}{[τ (ρ + ζ) + β d]}}{c_{B} (ρ + ζ)} \end{array}\} \\ x^{N} (t) = \frac{A_{N}}{ζ} + (x_{0} - \frac{A_{N}}{ζ}) e^{- ζ t} \end{cases}

(19)

4.2. Stackelberg Game

In this scenario, since Region A possesses more advanced pollution control technology and experiences more severe pollution, a pollution control cost compensation mechanism is implemented to reduce total pollution capacity. Under this mechanism, the more developed Region A will take on part of the pollution control costs. As a result, the two independent governments make decisions following a Stackelberg leader–follower game structure: Region A first determines the pollution emission level, pollution control investment, and compensation rate for pollution control costs that maximize its own value function. Based on this, Region B selects the pollution emission level and control investment that maximize its total benefits. The Stackelberg feedback Nash equilibrium is solved through backward induction, starting with the decision-making process of the less-developed Region B. Assuming that Region A’s compensation rate for pollution control costs is

θ

.

The optimal value function

V_{B}^{S}

for Region B satisfies the following HJB equation:

ρ V_{B}^{S} (x) = \max_{g_{B} {, h}_{B}} \{\begin{array}{l} α_{B} g_{B} - \frac{1}{2} g_{B}^{2} - \frac{1}{2} (1 - θ) c_{B} h_{B}^{2} - τ (g_{B} - ω_{B} h_{B}) - ε_{B B} g_{B} \\ - ε_{A B} g_{A} - β d x + \frac{\partial V_{B}^{S}}{\partial x} (\sum_{i = A, B} g_{i} - \sum_{i = A, B} ω_{i} h_{i} - ζ x) \end{array}\}

(20)

By taking the derivatives of Equation (20) with respect to pollution emissions and pollution control investment, and maximizing the first-order optimal conditions, we obtain the following:

g_{B} = α_{B} - τ - ε_{B B} + {\frac{\partial V_{B}^{S}}{\partial x}}_{,} h_{B} = \frac{ω_{B} (τ - \frac{\partial V_{B}^{S}}{\partial x})}{(1 - θ) c_{B}}

(21)

Since Region A is more developed, it will rationally predict Region B’s choices and select its optimal pollution emissions and pollution control investment based on Region B’s reaction function. Substituting this reaction function into Region A’s HJB equation and simplifying, we obtain the following:

ρ V_{A}^{S} (x) = \max_{g_{A} {, h}_{A}, θ} \{\begin{array}{l} α_{A} g_{A} - \frac{1}{2} g_{A}^{2} - \frac{1}{2} c_{A} h_{A}^{2} - \frac{1}{2} θ c_{B} {[\frac{ω_{B} (τ - \frac{\partial V_{B}^{S}}{\partial x})}{(1 - θ) c_{B}}]}^{2} \\ - τ (g_{A} - ω_{A} h_{A}) - ε_{A A} g_{A} - ε_{B A} (α_{B} - τ - ε_{B B} + \frac{\partial V_{B}^{S}}{\partial x}) - d x \\ + \frac{\partial V_{A}^{S}}{\partial x} [\begin{array}{l} g_{A} + (α_{B} - τ - ε_{B B} + \frac{\partial V_{B}^{S}}{\partial x}) - ω_{A} h_{A} \\ - ω_{B} \frac{ω_{B} (τ - \frac{\partial V_{B}^{S}}{\partial x})}{(1 - θ) c_{B}} - ζ x \end{array}] \end{array}\}

(22)

From this, the first-order optimal conditions for maximizing Region A’s pollution emissions, pollution control investments

g_{A}

and

h_{A}

, and the pollution control cost compensation rate

θ

are derived as follows:

g_{A} = α_{A} - τ - ε_{A A} + {\frac{\partial V_{A}^{S}}{\partial x}}_{,} h_{A} = {\frac{ω_{A} (τ - \frac{\partial V_{A}^{S}}{\partial x})}{c_{A}}}_{,} θ = \frac{2 \frac{\partial V_{A}^{S}}{\partial x} - \frac{\partial V_{B}^{S}}{\partial x} + τ}{2 \frac{\partial V_{A}^{S}}{\partial x} + \frac{\partial V_{B}^{S}}{\partial x} - τ}

(23)

Substituting the optimal decision conditions for both Region A and Region B into their respective HJB equations and simplifying, we obtain

ρ V_{A}^{S} (x) = - (d + ζ \frac{\partial V_{A}^{S}}{\partial x}) x + \{\begin{array}{l} \frac{c_{B} τ^{2} {ω_{A}}^{2}}{2 c_{A} c_{B}} + \frac{(- 2 c_{B} ε_{B A} + (2 c_{B} + {ω_{B}}^{2}) \frac{\partial V_{A}^{N}}{\partial x}) \frac{\partial V_{B}^{N}}{\partial x}}{2 c_{B}} \\ + \frac{\frac{\partial V_{A}^{N}}{\partial x}}{2 c_{A} c_{B}} \{\begin{array}{l} 2 c_{B} (\begin{array}{l} α_{A} c_{A} + α_{B} c_{A} - τ {ω_{A}}^{2} \\ - c_{A} (ε_{A A} + ε_{B B} + 2 τ) \end{array}) \\ - c_{A} τ ω_{B} {}^{2}+ (c_{B} (c_{A} + {ω_{A}}^{2}) + 2 c_{A} {ω_{B}}^{2}) \frac{\partial V_{A}^{N}}{\partial x} \end{array}\} \\ + \frac{1}{2} (α_{A} {}^{2}+ ε_{A A} {}^{2}+ τ^{2}) - α_{B} ε_{B A} + ε_{B A} ε_{B B} \\ + (ε_{A A} + ε_{B A}) τ - α_{A} (ε_{A A} + τ) \end{array}\}

(24a)

ρ V_{B}^{S} (x) = - (β d + ζ \frac{\partial V_{B}^{S}}{\partial x}) x + \{\begin{array}{l} \frac{1}{2} \{α_{B} {}^{2}+ {(ε_{B B} + τ)}^{2}\} - ((α_{A} - ε_{A A}) ε_{A B}) + τ ε_{A B} - α_{B} (ε_{B B} + τ) \\ + \frac{1}{8 c_{B}} \{\begin{array}{l} 3 τ^{2} ω_{B} {}^{2}+ (4 c_{B} + 3 {ω_{B}}^{2}) {(\frac{\partial V_{B}^{N}}{\partial x})}^{2} \\ - 4 \frac{\partial V_{A}^{N}}{\partial x} (2 c_{B} ε_{A B} + τ ω_{B} {}^{2}+ {ω_{B}}^{2} \frac{\partial V_{A}^{N}}{\partial x}) \end{array}\} \\ + \frac{\frac{\partial V_{B}^{N}}{\partial x}}{{4 c}_{A} c_{B}} \{(\begin{array}{l} 4 c_{B} (α_{A} c_{A} + α_{B} c_{A} - c_{A} (ε_{A A} + ε_{B B} + 2 τ) - τ {ω_{A}}^{2}) \\ - 3 c_{A} τ ω_{B} {}^{2}+ 2 (2 c_{B} (c_{A} + {ω_{A}}^{2}) + c_{A} {ω_{B}}^{2}) \frac{\partial V_{A}^{N}}{\partial x} \end{array})\} \end{array}\}

(24b)

Similarly, based on the structure of Equation (24a,b), it can be deduced that the linear structure of

X

is the solution to the value function, i.e., let

V_{A}^{S} (x) = k_{A}^{S} x + b_{A}^{S}, V_{B}^{S} (x) = k_{B}^{S} x + b_{B}^{S}

(25)

Here,

k_{A}^{N}

,

b_{A}^{N}

,

k_{B}^{N}

, and

b_{B}^{N}

are constants to be determined. By substituting Equation (25) and its first-order derivative with respect to

X

into Equation (24a,b), we obtain the following:

ρ (k_{A}^{S} x + b_{A}^{S}) = - (d + ζ k_{A}^{S}) x + \{\begin{array}{l} \frac{c_{B} τ^{2} {ω_{A}}^{2}}{2 c_{A} c_{B}} + \frac{(- 2 c_{B} ε_{B A} + (2 c_{B} + {ω_{B}}^{2}) k_{A}^{S}) k_{B}^{S}}{2 c_{B}} - α_{B} ε_{B A} + ε_{B A} ε_{B B} \\ + \frac{k_{A}^{S}}{2 c_{A} c_{B}} \{\begin{array}{l} 2 c_{B} (α_{A} c_{A} + α_{B} c_{A} - c_{A} (ε_{A A} + ε_{B B} + 2 τ) - τ {ω_{A}}^{2}) \\ - c_{A} τ ω_{B} {}^{2}+ (c_{B} (c_{A} + {ω_{A}}^{2}) + 2 c_{A} {ω_{B}}^{2}) k_{A}^{S} \end{array}\} \\ + \frac{1}{2} (α_{A} {}^{2}+ ε_{A A} {}^{2}+ τ^{2}) + (ε_{A A} + ε_{B A}) τ - α_{A} (ε_{A A} + τ) \end{array}\}

(26a)

ρ (k_{B}^{S} x + b_{B}^{S}) = - (β d + ζ k_{B}^{S}) x + \{\begin{array}{l} \frac{1}{2} (α_{B} {}^{2}+ {(ε_{B B} + τ)}^{2}) - (α_{A} - ε_{A A}) ε_{A B} + τ ε_{A B} - α_{B} (ε_{B B} + τ) \\ \frac{1}{8 c_{B}} \{3 τ^{2} ω_{B} {}^{2}- 4 k_{A}^{S} (2 c_{B} ε_{A B} + τ ω_{B} {}^{2}+ ω_{B} {}^{2}k_{A}^{S}) + (4 c_{B} + 3 {ω_{B}}^{2}) {(k_{B}^{S})}^{2}\} \\ + \frac{k_{B}^{S}}{{4 c}_{A} c_{B}} \{(\begin{array}{l} 4 c_{B} (α_{A} c_{A} + α_{B} c_{A} - c_{A} (ε_{A A} + ε_{B B} + 2 τ) - τ {ω_{A}}^{2}) \\ - 3 c_{A} τ ω_{B} {}^{2}+ 2 (2 c_{B} (c_{A} + {ω_{A}}^{2}) + c_{A} {ω_{B}}^{2}) k_{A}^{S} \end{array})\} \end{array}\}

(26b)

Based on the previous assumptions, Equations

V_{A}^{S} (x)

and

V_{B}^{S} (x)

must hold for all

x \geq 0

; thus, the values of

k_{A}^{S}

,

b_{A}^{S}

,

k_{B}^{S}

, and

b_{B}^{S}

can be determined as follows:

k_{A}^{S} = - \frac{d}{ρ + ζ}

(27a)

k_{B}^{S} = - \frac{β d}{ρ + ζ}

(27b)

b_{A}^{S} = \{\begin{array}{l} \frac{2 (d^{2} (1 + 2 β) - 2 d (ρ + ζ) (α_{A} + α_{B} - ε_{A A} - β ε_{B A} - ε_{B B} - 2 τ))}{ρ {(ρ + ζ)}^{2}} \\ + \frac{2 ({(α_{A} - ε_{A A})}^{2} + 2 ε_{B A} (ε_{B B} - α_{B}) + 2 τ (- α_{A} + ε_{A A} + ε_{B A}) + τ^{2})}{ρ} \\ + \frac{4 c_{B} {(d + (ρ + ζ) τ)}^{2} ω_{A} {}^{2}+ c_{A} {(d (2 + β) + (ρ + ζ) τ)}^{2} {ω_{B}}^{2}}{2 c_{A} c_{B} ρ {(ρ + ζ)}^{2}} \end{array}\}

(28a)

b_{B}^{S} = \{\begin{array}{l} \frac{d β (d + (ρ + ζ) τ) {ω_{A}}^{2}}{c_{A} ρ {(ρ + ζ)}^{2}} + \frac{τ ω_{B} {}^{2}{(d β + (ρ + ζ) τ)} (d (2 + β) + (ρ + ζ))}{4 c_{B} ρ {(ρ + ζ)}^{2}} \\ + \frac{ω_{B} {}^{2}{(d^{2} β (2 + β) + 2 d (ρ + ζ) (ε_{A B} + β (- α_{A} - α_{B} + ε_{A A} + ε_{B B} + 2 τ)))}}{2 ρ {(ρ + ζ)}^{2}} \\ + \frac{{ω_{B}}^{2} (α_{B} {}^{2}+ 2 (- α_{A} + ε_{A A}) ε_{A B} + 2 ε_{A B} τ - 2 α_{B} (ε_{B B} + τ) + {(ε_{B B} + τ)}^{2})}{2 ρ} \end{array}\}

(28b)

By substituting

k_{A}^{S}

,

b_{A}^{S}

,

k_{B}^{S}

, and

b_{B}^{S}

into Equations (21) and (23), we derive Proposition 2:

Proposition 2.

Under the Nash equilibrium between the two local governments, the pollution emissions

g_{A}^{S^{*}}

and

g_{B}^{S^{*}}

, pollution control investments

h_{A}^{S^{*}}

and

h_{B}^{S^{*}}

, pollution control cost compensation rate

θ^{*}

, and total benefits

V_{A}^{S} {}^{*}(x)

and

V_{B}^{S} {}^{*}(x)

are as follows:

\{\begin{cases} g_{A}^{S *} = \frac{(α_{A} - τ - ε_{A A}) (ρ + ζ) - d}{ρ + ζ} \\ g_{B}^{S *} = \frac{(α_{B} - τ - ε_{B B}) (ρ + ζ) - β d}{ρ + ζ} \end{cases}

(29)

\{\begin{cases} h_{A}^{S *} = \frac{ω_{A} [τ (ρ + ζ) + d]}{c_{A} (ρ + ζ)} \\ h_{B}^{S *} = \frac{ω_{B} [d (2 + β) + τ (ρ + ζ)]}{{2 c}_{B} (ρ + ζ)} \end{cases}

(30)

θ^{*} = \frac{d (2 - β) - τ (ρ + ζ)}{d (2 + β) + τ (ρ + ζ)}, d > \frac{τ (ρ + ζ)}{2 c_{B} - β}

(31)

V_{A}^{S} {}^{*}(x) = - \frac{d x}{ρ + ζ} + \{\begin{array}{l} \frac{2 (d^{2} (1 + 2 β) - 2 d (ρ + ζ) (α_{A} + α_{B} - ε_{A A} - β ε_{B A} - ε_{B B} - 2 τ))}{ρ {(ρ + ζ)}^{2}} \\ + \frac{2 ({(α_{A} - ε_{A A})}^{2} + 2 ε_{B A} (ε_{B B} - α_{B}) + 2 τ (- α_{A} + ε_{A A} + ε_{B A}) + τ^{2})}{ρ} \\ + \frac{4 c_{B} {(d + (ρ + ζ) τ)}^{2} ω_{A} {}^{2}+ c_{A} {(d (2 + β) + (ρ + ζ) τ)}^{2} {ω_{B}}^{2}}{2 c_{A} c_{B} ρ {(ρ + ζ)}^{2}} \end{array}\}

(32a)

V_{B}^{S} {}^{*}(x) = - \frac{β dx}{ρ + ζ} + \{\begin{array}{l} \frac{d β (d + (ρ + ζ) τ) {ω_{A}}^{2}}{c_{A} ρ {(ρ + ζ)}^{2}} + \frac{τ ω_{B} {}^{2}{(d β + (ρ + ζ) τ)} (d (2 + β) + (ρ + ζ))}{4 c_{B} ρ {(ρ + ζ)}^{2}} \\ + \frac{ω_{B} {}^{2}{(d^{2} β (2 + β) + 2 d (ρ + ζ) (ε_{A B} + β (- α_{A} - α_{B} + ε_{A A} + ε_{B B} + 2 τ)))}}{2 ρ {(ρ + ζ)}^{2}} \\ + \frac{{ω_{B}}^{2} (α_{B} {}^{2}+ 2 (- α_{A} + ε_{A A}) ε_{A B} + 2 ε_{A B} τ - 2 α_{B} (ε_{B B} + τ) + {(ε_{B B} + τ)}^{2})}{2 ρ} \end{array}\}

(32b)

To ensure that emissions remain positive, conditions

(α_{A} - τ - ε_{A A}) (ρ + ζ) - d > 0

and

(α_{B} - τ - ε_{B B}) (ρ + ζ) - β d > 0

must be satisfied.

From Equation (32a,b), the optimal benefit functions for local governments’ collaborative environmental governance can be derived.

V^{S} {}^{*}(x) = V_{A}^{S} {}^{*}(x) + V_{B}^{S} {}^{*}(x) = (32 a) + (32 b)

In this model, the dynamic changes in pollutant stock over time

t

for the two regions are represented by

x^{S} (t)

:

\{\begin{cases} \overset{\cdot}{x} (t) = A_{S} - ζ x (t) \\ x^{S} (0) = x_{0} \\ A_{S} = \{\begin{array}{l} \frac{(α_{A} + α_{B} - ζ - 2 τ - ε_{A A} - ε_{B B}) (ρ + ζ) - d (1 + β)}{(ρ + ζ)} \\ - \frac{ω_{A} {}^{2}{[τ (ρ + ζ) + d]}}{c_{A} (ρ + ζ)} - \frac{ω_{B} {}^{2}{[τ (ρ + ζ) + (2 + β) d]}}{{2 c}_{B} (ρ + ζ)} \end{array}\} \\ x^{S} (t) = \frac{A_{S}}{ζ} + (x_{0} - \frac{A_{S}}{ζ}) e^{- ζ t} \end{cases}

(33)

4.3. Cooperative Game

In this scenario, the two regions collaborate to form a unified ecological environment system, aiming to maximize the overall system benefit. By determining the optimal pollution emissions and pollution control investments, the entire system reaches its optimal state. Therefore, the total value function for both regions satisfies the following HJB equation:

ρ V_{A B}^{C} (x) = \max_{g_{A} {, h}_{A} {, g}_{B} {, h}_{B}} \{\begin{array}{l} - (1 + β) d x + α_{A} g_{A} + α_{B} g_{B} - \frac{1}{2} g_{A}^{2} - \frac{1}{2} g_{B}^{2} - \frac{1}{2} c_{A} h_{A}^{2} - \frac{1}{2} c_{B} h_{B}^{2} \\ - τ (g_{A} + g_{B} - ω_{A} h_{A} - ω_{B} h_{B}) - ε_{A A} g_{A} - ε_{B A} g_{B} - ε_{B B} g_{B} - ε_{A B} g_{A} \\ + \frac{\partial V_{A B}^{C}}{\partial x} (\sum_{i = A, B} g_{i} - \sum_{i = A, B} ω_{i} h_{i} - ζ x) - (k_{1} g_{A} + k_{2} g_{B} + k_{3} h_{A} + k_{4} h_{B}) \end{array}\}

(34)

The right-hand side of this HJB equation represents the maximization function. To determine its specific form, take the first-order partial derivatives with respect to the variables

g_{A}^{C}

,

g_{B}^{C}

,

h_{A}^{C}

, and

h_{B}^{C}

, and set them to zero. This results in the following maximization conditions:

\begin{array}{l} g_{A}^{C} = α_{A} - τ - ε_{A A} + \frac{\partial V_{A B}^{C}}{\partial x} - k_{1} g_{B}^{C} = α_{B} - τ - ε_{B B} + \frac{\partial V_{A B}^{C}}{\partial x} - k_{2} \\ h_{A}^{C} = \frac{ω_{A} (τ - \frac{\partial V_{A B}^{C}}{\partial x}) - k_{3}}{c_{A}} h_{B}^{C} = \frac{ω_{B} (τ - \frac{\partial V_{A B}^{C}}{\partial x}) - k_{4}}{c_{B}} \end{array}

(35)

Substituting Equation (35) into the HJB Equation (34) and simplifying, we obtain the following:

ρ V_{A B}^{C} (x) = - (1 + β) d x + \{\begin{array}{l} \frac{{(α_{A} - k_{1})}^{2} + {(α_{B} - k_{2})}^{2} + ε_{A A} {}^{2}+ {ε_{B B}}^{2}}{2} + \frac{{k_{3}}^{2}}{2 c_{A}} + \frac{{k_{4}}^{2}}{2 c_{B}} \\ + ε_{A A} ε_{A B} + ε_{B A} ε_{B B} + τ^{2} \\ + τ (k_{1} + k_{2} - 2 \frac{\partial V_{A B}^{C}}{\partial x} + ε_{A A} + ε_{A B} + ε_{B A} + ε_{B B}) \\ - \frac{\partial V_{A B}^{C}}{\partial x} (ε_{A A} + ε_{A B} + ε_{B A} + ε_{B B}) + {(\frac{\partial V_{A B}^{C}}{\partial x})}^{2} \\ + k_{1} (ε_{A A} + ε_{A B} - \frac{\partial V_{A B}^{C}}{\partial x}) + k_{2} (ε_{B B} + ε_{B A} - \frac{\partial V_{A B}^{C}}{\partial x}) \\ + α_{A} (\frac{\partial V_{A B}^{C}}{\partial x} - ε_{A A} - ε_{A B} + τ) + α_{B} (\frac{\partial V_{A B}^{C}}{\partial x} - ε_{B A} - ε_{B B} - τ) \\ + \frac{1}{2 c_{A}} \{(\frac{\partial V_{A B}^{C}}{\partial x} - τ) ω_{A} (2 k_{3} + (\frac{\partial V_{A B}^{C}}{\partial x} - τ) ω_{A})\} \\ + \frac{1}{2 c_{B}} \{(\frac{\partial V_{A B}^{C}}{\partial x} - τ) ω_{B} (2 k_{4} + (\frac{\partial V_{A B}^{C}}{\partial x} - τ) ω_{B})\} \end{array}\}

(36)

Based on the structure of Equation (36), it can be concluded that the linear optimal value function of

X

is the solution to the HJB equation. Let

V_{A B}^{C} (x) = k_{A B}^{C} x + b_{A B}^{C}

(37)

Substitute Equation (37) and its first-order derivative with respect to

\frac{\partial V_{A B}^{C}}{\partial x}

into Equation (36), and simplify to obtain the following:

k_{A B}^{C} = - \frac{(1 + β) d}{ρ + ζ}

(38)

b_{A B}^{C} = \{\begin{array}{l} \frac{α_{A} {}^{2}+ α_{B} {}^{2}+ (k_{1} + ε_{A A}) (k_{1} + 2 ε_{A B} + ε_{A A}) + (k_{2} + ε_{B B}) (k_{2} + 2 ε_{B A} + ε_{B B})}{2 ρ} \\ + \frac{{(d (1 + β) ω_{A} - (ρ + ζ) (k_{3} - τ ω_{A}))}^{2}}{2 c_{A} ρ {(ρ + ζ)}^{2}} + \frac{{(d (1 + β) ω_{B} - (ρ + ζ) (k_{4} - τ ω_{B}))}^{2}}{2 c_{B} ρ {(ρ + ζ)}^{2}} \\ + \frac{1}{ρ} \{\begin{array}{l} τ (k_{1} + k_{2} + ε_{A A} + ε_{A B} + ε_{B A} + ε_{B B} + τ) \\ - α_{A} {}^{2}{(k_{1} + ε_{A A} + ε_{A B} + τ)} - α_{B} {}^{2}{(k_{2} + ε_{B A} + ε_{B B} + τ)} \end{array}\} \\ + \frac{d^{2} {(1 + β)}^{2} - d (1 + β) (ρ + ζ) (α_{A} + α_{B} - k_{1} - k_{2} - ε_{A A} - ε_{A B} - ε_{B A} - ε_{B B} - 2 τ)}{ρ {(ρ + ζ)}^{2}} \end{array}\}

(39)

By substituting

k_{A B}^{C}

and

b_{A B}^{C}

into Equations (35) and (37), we derive Proposition 3:

Proposition 3.

Under the Nash equilibrium between the two local governments, the pollution emissions

g_{A}^{C^{*}}

and

g_{B}^{C^{*}}

, pollution control investments

h_{A}^{C^{*}}

and

h_{B}^{C^{*}}

, and total benefits

V_{A B}^{C} {}^{*}(x)

for the two regions are as follows:

\{\begin{cases} g_{A}^{C^{*}} = \frac{(α_{A} - τ - ε_{A A} - k_{1}) (ρ + ζ) - (1 + β) d}{ρ + ζ} \\ g_{B}^{C^{*}} = \frac{(α_{B} - τ - ε_{B B} - k_{2}) (ρ + ζ) - (1 + β) d}{ρ + ζ} \end{cases}

(40)

\{\begin{cases} h_{A}^{C *} = \frac{ω_{A} [τ (ρ + ζ) + (1 + β) d] - k_{3} (ρ + ζ)}{c_{A} (ρ + ζ)} \\ h_{B}^{C *} = \frac{ω_{B} [τ (ρ + ζ) + (1 + β) d] - k_{4} (ρ + ζ)}{c_{B} (ρ + ζ)} \end{cases}

(41)

V_{A B}^{C} {}^{*}(x) = - \frac{(1 + β) d x}{ρ + ζ} + \{\begin{array}{l} \frac{d^{2} {(1 + β)}^{2}}{ρ {(ρ + ζ)}^{2}} + \frac{{(d (1 + β) ω_{A} - (ρ + ζ) (k_{3} - τ ω_{A}))}^{2}}{2 c_{A} ρ {(ρ + ζ)}^{2}} \\ + \frac{α_{A} {}^{2}+ α_{B} {}^{2}+ (k_{1} + ε_{A A}) (k_{1} + 2 ε_{A B} + ε_{A A}) + (k_{2} + ε_{B B}) (k_{2} + 2 ε_{B A} + ε_{B B})}{2 ρ} \\ + \frac{1}{ρ} \{\begin{array}{l} τ (k_{1} + k_{2} + ε_{A A} + ε_{A B} + ε_{B A} + ε_{B B} + τ) \\ - α_{A} {}^{2}{(k_{1} + ε_{A A} + ε_{A B} + τ)} - α_{B} {}^{2}{(k_{2} + ε_{B A} + ε_{B B} + τ)} \end{array}\} \\ + \frac{{(d (1 + β) ω_{B} - (ρ + ζ) (k_{4} - τ ω_{B}))}^{2}}{2 c_{B} ρ {(ρ + ζ)}^{2}} \\ + \frac{d^{2} {(1 + β)}^{2} - d (1 + β) (ρ + ζ) (α_{A} + α_{B} - k_{1} - k_{2} - ε_{A A} - ε_{A B} - ε_{B A} - ε_{B B} - 2 τ)}{ρ {(ρ + ζ)}^{2}} \end{array}\}

(42)

In this model, the dynamic changes in pollutant stock over time

t

for the two regions are represented by

x^{C} (t)

:

\{\begin{cases} \overset{\cdot}{x} (t) = A_{C} - ζ x (t) \\ x^{C} (0) = x_{0} \\ A_{C} = \{\begin{array}{l} \frac{(α_{A} + α_{B} - ζ - 2 τ - ε_{A A} - ε_{B B} - k_{1} - - k_{2}) (ρ + ζ) - 2 d (1 + β)}{(ρ + ζ)} \\ - \frac{(c_{A} ω_{B} {}^{2}+ c_{B} {ω_{A}}^{2}) [τ (ρ + ζ) + (1 + β) d] - c_{A} k_{4} (ρ + ζ) - c_{B} k_{3} (ρ + ζ)}{c_{A} c_{B} (ρ + ζ)} \end{array}\} \\ x^{C} (t) = \frac{A_{C}}{ζ} + (x_{0} - \frac{A_{C}}{ζ}) e^{- ζ t} \end{cases}

(43)

5. Comparison of Equilibrium Results

The following is a comparison of the pollution emissions between the two regions under three different scenarios. Based on the analysis of Propositions 1 through 3:

g_{A}^{N *} = g_{A}^{S *} > g_{A}^{C *}, g_{B}^{N *} = g_{B}^{S *} > g_{B}^{C *}, h_{A}^{N *} = h_{A}^{S *} < h_{A}^{C *}, h_{B}^{C *} > h_{B}^{S *} > h_{B}^{N *}

The results indicate that when the developed region (Region A) compensates the less-developed region (Region B) for pollution control costs, the total pollution emissions of both regions remain unchanged. The pollution control investment of the developed region (Region A) also remains the same. However, the pollution control investment of the less-developed region (Region B) increases.

Based on Equations (19), (33), and (43), the differences in pollution stock between the two regions under noncooperative and cooperative games are

x^{N} (t) - x^{C} (t) = \frac{d [c_{A} c_{B} (β + 1) + c_{B} ω_{A} {}^{2}β + c_{A} {ω_{B}}^{2}]}{c_{A} c_{B} (ρ + ζ)} > 0

(44)

The difference in pollution stock between the two regions under the pollution control cost compensation game and the cooperative game is

x^{S} (t) - x^{C} (t) = \frac{{2 c}_{A} c_{B} d (β + 1) + {2 c}_{B} ω_{A} {}^{2}β d + c_{A} ω_{B} {}^{2}{[τ (ρ + ζ) + d β]}}{{2 c}_{A} c_{B} (ρ + ζ)} > 0

(45)

The difference in pollution stock between the two regions under the noncooperative game and the pollution control cost compensation game is

x^{N} (t) - x^{S} (t) = \frac{ω_{B} {}^{2}{[2 d - d β - τ (ρ + ζ)]}}{{2 c}_{B} (ρ + ζ)}

(46)

From the above analysis, it is clear that when the two regions engage in collaborative cooperation, the pollution capacity reaches its lowest level. When

d > \frac{τ (ρ + ζ)}{2 - β}

, indicating that environmental damage exceeds a certain threshold, Region A compensates Region B for pollution control costs. As a result, the pollution capacity is lower than when the two regions act independently. Therefore, from an environmental perspective, joint pollution control represents the Pareto optimal state, while pollution control cost compensation is the second-best Pareto state. In contrast, the Nash noncooperative game produces the least efficient outcome.

Based on Equations (18a,b), (32a,b), and (42), we obtain the following:

(1): The difference in Region A’s benefits between the pollution control cost compensation game and the noncooperative game is

V_{A}^{S} {}^{*}(x) - V_{A}^{N} {}^{*}(x) = \frac{{(d (β - 2) + τ (ρ + ζ))}^{2} {ω_{B}}^{2}}{8 c_{B} ρ {(ρ + ζ)}^{2}} > 0

(47)

(2): The difference in Region B’s benefits between the pollution control cost compensation game and the noncooperative game is

V_{B}^{S} {}^{*}(x) - V_{B}^{N} {}^{*}(x) = \frac{(2 d - d β - τ (ρ + ζ)) (d β + τ (ρ + ζ)) {ω_{B}}^{2}}{4 c_{B} ρ {(ρ + ζ)}^{2}} > 0

(48)

(3): The difference in the benefits of both regions between the cooperative game and the noncooperative game is

V_{A B}^{C} {}^{*}(x) - V_{A}^{N} {}^{*}(x) - V_{B}^{N} {}^{*}(x) = \{\begin{array}{l} \frac{d^{2} (1 + β^{2}) + 2 d (k_{1} + k_{2} + k_{1} β + k_{2} β + β ε_{A B} + ε_{B A}) (ρ + ζ)}{2 ρ {(ρ + ζ)}^{2}} \\ + \frac{{k_{3}}^{2}}{2 c_{A} ρ} + \frac{{k_{4}}^{2}}{2 c_{B} ρ} + \frac{d^{2} β^{2} {ω_{A}}^{2}}{2 c_{A} ρ {(ρ + ζ)}^{2}} - \frac{d^{2} {ω_{B}}^{2}}{2 c_{B} ρ {(ρ + ζ)}^{2}} - \frac{α_{A} {k_{1}}^{2}}{ρ} - \frac{α_{B} k_{2}}{ρ} \\ + \frac{k_{1} {}^{2}+ k_{2} {}^{2}+ 2 k_{1} (ε_{A A} + ε_{A B} + τ) + 2 k_{2} (ε_{B A} + ε_{B B} + τ)}{2 ρ} \\ - \frac{k_{3} (d (1 + β) + τ (ρ + ζ)) ω_{A}}{c_{A} ρ (ρ + ζ)} - \frac{k_{4} (d (1 + β) + (ρ + ζ) τ) ω_{B}}{c_{B} ρ (ρ + ζ)} \end{array}\}

(49)

(4): The difference in the benefits of both regions between the cooperative game and the pollution control cost compensation game is

V_{A B}^{C} {}^{*}(x) - V_{A}^{S} {}^{*}(x) - V_{B}^{S} {}^{*}(x) = \{\begin{array}{l} \frac{{k_{3}}^{2}}{2 c_{A} ρ} + \frac{{k_{4}}^{2}}{{2 c}_{B} ρ} + \frac{d^{2} β^{2} {ω_{A}}^{2}}{2 c_{A} ρ {(ρ + ζ)}^{2}} + \frac{{(d β + (ρ + ζ) τ)}^{2} {ω_{B}}^{2}}{8 c_{B} ρ {(ρ + ζ)}^{2}} \\ + \frac{d^{2} (1 + β^{2})}{2 ρ {(ρ + ζ)}^{2}} - \frac{2 α_{A} k_{1}}{2 ρ} - \frac{k_{3} (d (1 + β) + (ρ + ζ) τ) ω_{A}}{c_{A} ρ (ρ + ζ)} \\ + \frac{k_{1} {}^{2}+ 2 k_{1} (ε_{A A} + ε_{A B} + τ) + k_{2} (k_{2} + 2 (ε_{B A} + ε_{B B} + τ) - 2 α_{B})}{2 ρ} \\ + \frac{d (k_{1} + k_{2} + k_{1} β + k_{2} β + β ε_{A B} + ε_{B A})}{ρ (ρ + ζ)} \\ - \frac{k_{4} (d (1 + β) + (ρ + ζ) τ) ω_{B}}{c_{B} ρ (ρ + ζ)} \end{array}\}

(50)

The above analysis leads to Proposition 4:

Proposition 4:

When the negative effects of pollution

d

exceed a certain threshold, i.e.,

d > \frac{τ (ρ + ζ)}{2 - β}

, the environmental pollution capacity under the pollution control cost compensation mechanism is lower than that under the noncooperative mechanism, i.e.,

x^{N} (t) - x^{S} (t) > 0

. At the same time, the instantaneous returns of both regions are greater than under the noncooperative mechanism, i.e.,

V_{A}^{S} {}^{*}(x) - V_{A}^{N} {}^{*}(x) > 0

and

V_{B}^{S} {}^{*}(x) - V_{B}^{N} {}^{*}(x) > 0

. In the collaborative cooperation scenario, the environmental pollution capacity is better than in both the noncooperative mechanism and the pollution control cost compensation mechanism, i.e.,

x^{N} (t) - x^{C} (t) > 0

and

x^{S} (t) - x^{C} (t) > 0

. However, due to the existence of cooperation costs

c_{C}

, the governments of both regions will only choose collaborative cooperation if

V_{A B}^{C} {}^{*}(x) - V_{A}^{N} {}^{*}(x) - V_{B}^{N} {}^{*}(x) > 0

. Otherwise, due to cost constraints, local governments will still opt for the noncooperative approach based on their self-interests. Similarly, the governments will only adopt cooperative play if

V_{A B}^{C} {}^{*}(x) - V_{A}^{S} {}^{*}(x) - V_{B}^{S} {}^{*}(x) > 0

is satisfied. Otherwise, they will revert to the cost compensation game mode. If both conditions

V_{A B}^{C} {}^{*}(x) - V_{A}^{N} {}^{*}(x) - V_{B}^{N} {}^{*}(x) < 0

and

V_{A B}^{C} {}^{*}(x) - V_{A}^{S} {}^{*}(x) - V_{B}^{S} {}^{*}(x) < 0

are met, the choice of game mode depends on the value of

d

. When

d > \frac{τ (ρ + ζ)}{2 - β}

, the pollution control cost compensation mechanism is superior to the noncooperative mechanism.

From both environmental protection and economic benefit perspectives, regional cooperative governance of pollution represents the ideal Pareto optimal state. On one hand, regional cooperation in pollution control allows both parties to share the environmental burden, resulting in individual costs that are lower than social costs, thereby reducing pollution levels. On the other hand, collaborative cooperation prevents economic losses caused by “free-rider” behavior, enhances overall returns, and achieves Pareto improvements. However, cooperation costs have a significant influence on the feasibility of cooperative governance. These costs mainly include expenses related to addressing cross-border pollution, technical coordination, and resource integration. High cooperation costs may reduce the willingness of governments to engage in collaborative governance, especially when the costs are excessively high, leading them to prefer noncooperative governance models. Therefore, reducing cooperation costs—particularly by optimizing pollution control cost structures through technological innovation and resource sharing—is essential to achieving long-term sustainable cooperation. Compared to full collaboration, when pollution exceeds a certain threshold due to its mobility and accumulation, the developed region can reduce the pollution control costs of the less-developed region by providing ecological subsidies for pollution control technologies. This can result in a Pareto improvement in both environmental pollution levels and the instantaneous returns for both regions.

6. Numerical Analysis

In this section, the optimal decisions outlined above are analyzed numerically using Mathematica. Referring to the studies by Li [30] and Shoude [31], the benchmark parameters are set as

α_{A} = 140, α_{B} = 120, d = 5, β = 0.85, c_{A} = 1, c_{B} = 1.2, ρ = 0.1, ζ = 0.2, τ = 4,

x_{0} = 350, ε_{A A} = 0.1, ε_{B B} = 0.1, ε_{A B} = 0.1, ε_{B A} = 0.1, ω_{A} = 1, ω_{B} = 2, k_{1} = 1.8, k_{2} = 1.8, k_{3} = 2.2,

k_{4} = 2.2

. Based on this, a numerical analysis is conducted.

6.1. Optimal Pollution Capacity and Instantaneous Benefit Trajectory

Figure 1 illustrates the time evolution trajectories of pollutant capacity and benefits for the two regions under the feedback Nash equilibrium. Figure 1a,c show that the pollution capacity in both regions during collaborative cooperation is significantly lower than in the other two scenarios and gradually converges to a stable state over time. Additionally, the total benefits for both regions in collaborative cooperation are clearly superior to those in the other two mechanisms. For the noncooperative mechanism and the pollution control cost compensation mechanism, when the damage coefficient borne by Region A exceeds a certain threshold,

d > \frac{τ (ρ + ζ)}{2 - β}

, Region A chooses to share pollution control costs with Region B. Figure 1b,c show that when Region A provides pollution control cost compensation, the returns for both regions are higher than those under the noncooperative mechanism. However, when Region A cooperates with Region B in pollution control, the initial returns are lower compared to the cost compensation mechanism. Over time, though, the returns from cooperative pollution control gradually exceed those of the cost compensation mechanism.

6.2. Numerical Analysis in Steady State

In this section, we analyze the impact of factors such as the discount rate and the natural decomposition rate of pollutants on the steady-state equilibrium results.

Figure 2a–c illustrate the effect of the discount factor on the pollution capacity and benefits of the two regions. It can be observed that the discount rate is positively correlated with pollution capacity and negatively correlated with benefits. As the discount rate

ρ

increases, it indicates that both regions exhibit “short-sighted” behavior, with the slopes of the pollution capacity and benefits curves decreasing. This suggests that both regions focus more on maximizing short-term benefits and reduce their investment in pollution control technologies. Additionally, due to this “short-sighted” behavior, Region A reduces its pollution control cost compensation to Region B, causing the pollution capacity under the “cost compensation mechanism” to gradually approach that under the “noncooperative mechanism”.

Figure 3a–c illustrate the relationship between the natural decomposition rate of pollutants

ζ

, environmental pollution capacity, and the benefits of the two regions. When adopting the “noncooperative mechanism” and the “pollution control cost compensation mechanism”, an increase in the natural decomposition rate leads to a significant reduction in pollution capacity, thereby reducing environmental damage in both regions and increasing their benefits.

Furthermore, as shown in Figure 3b, with the rise in the natural decomposition rate, the slope of Region A’s benefit curve is steeper compared to Region B, and the benefit curves for the noncooperative mechanism and the cost compensation mechanism tend to converge. This suggests that Region A is more sensitive to environmental damage than Region B. As the natural decomposition rate

ζ

increases, pollution capacity initially grows at a decreasing rate (concave growth) and then declines; correspondingly, the benefits of both regions first decrease and then increase, eventually stabilizing. Additionally, this study found that if the natural degradation rate of pollutants continues to rise, the total returns from cooperative pollution control will gradually decline and eventually fall below the returns of the other two models. This occurs because, as the ecological environment improves and pollutants can self-degrade at a higher rate, the need for cooperation diminishes.

When the natural decomposition rate

ζ

is low, it implies that the ecological environment of the region is fragile and has poor recovery capability, making it more sensitive to environmental damage, and vice versa. Figure 3b also shows that the slope of Region A’s benefit curve is greater than that of Region B, indicating that Region A is more sensitive to environmental damage and, thus, requires higher pollution control costs, while Region B exhibits “free-rider” behavior. As the natural decomposition rate increases, the impact of environmental damage on Region A gradually diminishes, and the cooperation between the two regions becomes more stable.

7. Conclusions and Recommendations

7.1. Conclusions

This paper, based on a differential game model, investigated three governance models for controlling cross-border water pollution between two regions with asymmetric economic development: the Nash noncooperative mechanism, the pollution control cost compensation mechanism, and the collaborative cooperation mechanism. Comparative analysis of these models reveals significant differences in terms of pollution emissions, pollution control investments, economic returns, and cooperation costs.

The key research findings are as follows:

Nash noncooperative mechanism: Each region makes independent decisions, resulting in higher pollution emissions, poor governance outcomes, and lower total returns. Due to the absence of coordinated action, each region bears the full cost of governance, leading to reduced efficiency.

Pollution control cost compensation mechanism: The developed region takes on a larger share of the pollution control costs, which reduces overall pollution levels, while the less-developed region increases its pollution control investment. However, the overall impact remains limited. Despite the developed region’s greater investment, the less-developed region’s pollution control capacity does not reach optimal levels.

Collaborative cooperation mechanism: This is the optimal model. Cooperation not only reduces pollution emissions but also maximizes the total returns for both regions. Through resource sharing and technical coordination, cooperation costs are distributed equitably, leading to Pareto improvement. Collaborative cooperation effectively prevents the “free-rider” problem and significantly enhances long-term returns for both regions.

This study also highlights that cooperation costs significantly affect the feasibility of the cooperative mechanism. Local governments will only opt for collaborative cooperation if cooperation costs remain below a certain threshold. If these costs are too high, governments are more likely to choose the pollution control cost compensation mechanism or revert to the noncooperative model. Therefore, controlling cooperation costs is crucial for sustaining long-term cooperation.

7.2. Recommendations

Strengthen regional cooperation mechanisms and equitably distribute costs: Governments should establish stable cooperation frameworks to ensure that costs are fairly shared. Developed regions can provide technical and financial support to compensate for the pollution control investments of less-developed regions, thereby preventing a decline in cooperation due to high costs.

Optimize cooperation cost structures: Reducing cooperation costs through technological innovation and resource integration can minimize expenses related to technical coordination and infrastructure. Market mechanisms such as emissions trading systems can further optimize the allocation of pollution control costs.

Develop flexible compensation mechanisms: Developed regions should bear a larger share of cooperation costs, particularly in heavily polluted areas. Flexible compensation mechanisms can encourage less-developed regions to participate actively in governance and ensure effective control of cross-border water pollution. The compensation mechanism should be dynamically adjusted to prevent cooperation breakdowns caused by excessive costs.

Establish a long-term evaluation system: A system should be implemented to regularly assess cooperation costs and governance outcomes. Periodic evaluation of costs and environmental benefits will help adjust strategies to ensure the long-term success of the cooperation model.

Improve pollution control technologies and reduce cooperation costs: Promoting technological innovation in pollution control can enhance governance efficiency and reduce the costs of technical coordination. By fostering government–enterprise partnerships to develop new technologies, pollution control investments can be lowered, creating a win–win situation for both the environment and the economy.

By controlling and optimizing cooperation costs, regional cooperation mechanisms for cross-border water pollution governance can be stabilized, leading to effective pollution control and balanced economic development between regions.

Author Contributions

Conceptualization, G.T. and C.Y.; Methodology, G.T. and C.Y.; Software, C.Y.; Validation, C.Y.; Formal analysis, G.T. and F.Y.; Investigation, F.Y.; Data curation, C.Y. and F.Y.; Writing—original draft, C.Y.; Writing—review & editing, G.T.; Visualization, C.Y.; Project administration, G.T.; Funding acquisition, F.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the [National Natural Science Foundation of China], grant number [72264023].

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors disclose the use of AI tools in preparing this manuscript. Specifically, ChatGPT was utilized to enhance the clarity and accuracy of the language by correcting grammar, refining sentence structures, and improving word choices. It should be emphasized that all academic content, research data, and analyses were solely conducted by the authors, with the AI tool serving exclusively to improve the linguistic quality of the text. The authors also sincerely thank the editorial team and all reviewers for their valuable time and expert feedback during the peer-review process. Your insights and suggestions have significantly contributed to the refinement of this research.

Conflicts of Interest

The authors declare no conflict of interest.

References

Lu, Z.; Chen, C.; Zhang, X.; Li, Y. A Differential Game Analysis of Multi-Regional Coalition for Transboundary Pollution Problems. Ecol. Indic. 2022, 145, 112925. [Google Scholar] [CrossRef]
Mooney, C. Air and Water Problems Are Worsening on a Global Scale, U.N. Says. The Washington Post, 23 May 2016. [Google Scholar]
Huisman, P.; de Jong, J.; Wieriks, K. Transboundary Cooperation in Shared River Basins: Experiences from the Rhine, Meuse, and North Sea. Water Policy 2000, 2, 83–97. [Google Scholar] [CrossRef]
List, J.A.; Mason, C.F. Optimal Institutional Arrangements for Transboundary Pollutants in a Second-Best World: Evidence from a Differential Game with Asymmetric Players. J. Environ. Econ. Manag. 2001, 42, 277–296. [Google Scholar] [CrossRef]
Montgomery, W.D. Markets in Licenses and Efficient Pollution Control Programs. J. Econ. Theory 1972, 5, 395–418. [Google Scholar] [CrossRef]
Eheart, J.W.; Brill, E.D.; Lence, B.J.; Kilgore, J.D.; Uber, J.G. Cost Efficiency of Time-Varying Discharge Permit Programs for Water Quality Management. Water Resour. Res. 1987, 23, 245–251. [Google Scholar] [CrossRef]
Hung, M.F.; Shaw, D. A Trading-Ratio System for Trading Water Pollution Discharge Permits. J. Environ. Econ. Manag. 2005, 49, 83–102. [Google Scholar] [CrossRef]
Mesbah, S.M.; Kerachian, R.; Nikoo, M.R. Developing Real-Time Operating Rules for Trading Discharge Permits in Rivers: Application of Bayesian Networks. Environ. Model. Softw. 2009, 24, 238–246. [Google Scholar] [CrossRef]
Shen, J.; Gao, X.; He, W.; Sun, F.; Zhang, Z.; Kong, Y.; Wan, Z.; Zhang, X.; Li, Z.; Wang, J.; et al. Prospect Theory in an Evolutionary Game: Construction of Watershed Ecological Compensation System in Taihu Lake Basin. J. Clean. Prod. 2021, 291, 125929. [Google Scholar] [CrossRef]
Jorgensen, S.; Zaccour, G. Time Consistent Side Payments in a Dynamic Game of Downstream Pollution. J. Econ. Dyn. Control 2001, 25, 1973–1987. [Google Scholar] [CrossRef]
Bai, M.; Chen, M.; Zhang, M.; Duan, Y.; Zhou, S. Research on Evolutionary Game Analysis of Spatial Cooperation for Social Governance of Basin Water Pollution. Water 2022, 14, 2564. [Google Scholar] [CrossRef]
Asab, C.; Han, Q.D.; Sw, D. A Model of River Pollution as a Dynamic Game with Network Externalities. Eur. J. Oper. Res. 2020, 290, 1136–1153. [Google Scholar]
Yang, T.; Liao, H.; Du, Y. A Dynamic Game Modeling on Air Pollution Mitigation with Regional Cooperation and Noncooperation. Integr. Environ. Assess. Manag. 2023, 19, 1555–1569. [Google Scholar] [CrossRef] [PubMed]
Nagurney, A.; Salarpour, M.; Dong, J.; Nagurney, L.S. A Stochastic Disaster Relief Game Theory Network Model. SN Oper. Res. Forum 2020, 1, 231–244. [Google Scholar] [CrossRef]
Anna, T.; Ekaterina, G.; Dmitry, G. On the Estimation of the Initial Stock in the Problem of Resource Extraction. Mathematics 2021, 9, 3099. [Google Scholar] [CrossRef]
Ge, L.; Li, S. Price and Product Innovation Competition with Network Effects and Consumers’ Adaptive Learning: A Differential Game Approach. Comput. Ind. Eng. 2024, 193, 110298. [Google Scholar] [CrossRef]
Yuan, L.; Qi, Y.; He, W.; Wu, X.; Kong, Y.; Ramsey, T.S.; Degefu, D.M. A Differential Game of Water Pollution Management in the Trans-Jurisdictional River Basin. J. Clean. Prod. 2024, 438, 140823. [Google Scholar] [CrossRef]
Huang, X.; He, P.; Zhang, W. A Cooperative Differential Game of Transboundary Industrial Pollution between Two Regions. J. Clean. Prod. 2016, 120, 43–52. [Google Scholar] [CrossRef]
Xu, X.; Wu, F.; Zhang, L.; Gao, X. Assessing the Effect of the Chinese River Chief Policy for Water Pollution Control under Uncertainty: Using Chaohu Lake as a Case. Int. J. Environ. Res. Public Health 2020, 17, 3103. [Google Scholar] [CrossRef]
Wei, C.; Luo, C.C. A Differential Game Design of Watershed Pollution Management under Ecological Compensation Criterion. J. Clean. Prod. 2020, 274, 122320. [Google Scholar] [CrossRef]
Li, H.; Guo, G. A Differential Game Analysis of Multipollutant Transboundary Pollution in River Basin. Phys. A Stat. Mech. Its Appl. 2019, 535, 122484. [Google Scholar] [CrossRef]
Dockner, E.J.; Van Long, N. International Pollution Control: Cooperative versus Noncooperative Strategies. J. Environ. Econ. Manag. 1993, 25, 13–29. [Google Scholar] [CrossRef]
Long, N.V. Pollution Control: A Differential Game Approach. Ann. Oper. Res. 1992, 37, 283–296. [Google Scholar] [CrossRef]
Wirl, F. Pollution Control in a Cournot Duopoly via Taxes or Permits. J. Econ. 2007, 91, 1–29. [Google Scholar]
Insley, M.; Forsyth, P.A. Climate Games: Who’s on First? What’s on Second? L’Actualité Économique 2019, 95, 287–322. [Google Scholar] [CrossRef]
Insley, M.C.; Snoddon, T.; Forsyth, P. Strategic Interactions and Uncertainty in Decisions to Curb Greenhouse Gas Emissions. SSRN Electron. J. 2019. Available online: https://ssrn.com/abstract=3189506 (accessed on 2 November 2024).
Nkuiya, B.; Plantinga, A.J. Strategic Pollution Control under Free Trade. Resour. Energy Econ. 2021, 64, 101218. [Google Scholar] [CrossRef]
Carbonaro, B.; Menale, M. Markov Chains and Kinetic Theory: A Possible Application to Socio-Economic Problems. Mathematics 2024, 12, 1571. [Google Scholar] [CrossRef]
Breton, M.; Sbragia, L.; Zaccour, G. A Dynamic Model for International Environmental Agreements. Environ. Resour. Econ. 2010, 45, 25–48. [Google Scholar] [CrossRef]
Liu, J.; Xiao, L.; Wang, J.; Wang, C. Payments for Environmental Services Strategy for Transboundary Air Pollution: A Stochastic Differential Game Perspective. Sci. Total Environ. 2022, 852, 158286. [Google Scholar] [CrossRef]
Li, S. A Differential Game of Transboundary Industrial Pollution with Emission Permits Trading. J. Optim. Theory Appl. 2014, 163, 642–659. [Google Scholar] [CrossRef]

Figure 1. (a) Trajectories of total pollutant stock in Regions A and B under three game models. (b) Individual benefit trajectories of Regions A and B under noncooperative and Stackelberg game models (with compensation mechanism). (c) Total benefit trajectories of Regions A and B under three game models.

Figure 2. (a) Impact trajectories of discount rate

ρ

on total pollutant stock in Regions A and B under three game models. (b) Impact trajectories of discount rate

ρ

on the individual benefits of Regions A and B under noncooperative and Stackelberg game models (with compensation mechanism). (c) Impact of discount rate

ρ

on the total benefits of Regions A and B under three game models.

Figure 2. (a) Impact trajectories of discount rate

ρ

on total pollutant stock in Regions A and B under three game models. (b) Impact trajectories of discount rate

ρ

on the individual benefits of Regions A and B under noncooperative and Stackelberg game models (with compensation mechanism). (c) Impact of discount rate

ρ

on the total benefits of Regions A and B under three game models.

Figure 3. (a) Impact trajectory of natural decomposition rate of pollutants

ζ

on total pollutant stock in Regions A and B under three game models. (b) Impact trajectory of natural decomposition rate of pollutants

ζ

on individual benefits of Regions A and B under noncooperative and Stackelberg game models (with compensation mechanism). (c) Impact trajectory of natural decomposition rate of pollutants

ζ

on total benefits of Regions A and B under three game models.

Figure 3. (a) Impact trajectory of natural decomposition rate of pollutants

ζ

on total pollutant stock in Regions A and B under three game models. (b) Impact trajectory of natural decomposition rate of pollutants

ζ

on individual benefits of Regions A and B under noncooperative and Stackelberg game models (with compensation mechanism). (c) Impact trajectory of natural decomposition rate of pollutants

ζ

on total benefits of Regions A and B under three game models.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tu, G.; Yu, C.; Yu, F. Cooperative Strategies in Transboundary Water Pollution Control: A Differential Game Approach. Water 2024, 16, 3239. https://doi.org/10.3390/w16223239

AMA Style

Tu G, Yu C, Yu F. Cooperative Strategies in Transboundary Water Pollution Control: A Differential Game Approach. Water. 2024; 16(22):3239. https://doi.org/10.3390/w16223239

Chicago/Turabian Style

Tu, Guoping, Chengyue Yu, and Feilong Yu. 2024. "Cooperative Strategies in Transboundary Water Pollution Control: A Differential Game Approach" Water 16, no. 22: 3239. https://doi.org/10.3390/w16223239

APA Style

Tu, G., Yu, C., & Yu, F. (2024). Cooperative Strategies in Transboundary Water Pollution Control: A Differential Game Approach. Water, 16(22), 3239. https://doi.org/10.3390/w16223239

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Cooperative Strategies in Transboundary Water Pollution Control: A Differential Game Approach

Abstract

1. Introduction

2. Problem Description and Model Construction

3. Materials and Methods

3.1. Differential Game Method

3.2. AI Tools Disclosure

4. Differential Game Model

4.1. Nash Noncooperative Game

4.2. Stackelberg Game

4.3. Cooperative Game

5. Comparison of Equilibrium Results

6. Numerical Analysis

6.1. Optimal Pollution Capacity and Instantaneous Benefit Trajectory

6.2. Numerical Analysis in Steady State

7. Conclusions and Recommendations

7.1. Conclusions

7.2. Recommendations

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI