1. Introduction
The two-dimensional Moran model is a simple discrete process used in many fields to describe the evolution of two discrete random walks in each unit of time.
Moran’s random walk model can be applied in the field of renewable energy. Many renewable energy situations can be modelled as Moran’s random walk. This modelling has the advantage of minimizing expenses to guarantee the proper functioning of such a system by avoiding surprise breakdowns. In certain ecosystems, and more particularly in certain tropical forests, different species with the same ecological requirements coexist in the same environment. For example, some forests have more than a hundred different tree species on one hectare. To explain this astonishing diversity, scientists have constructed models in which community composition is solely based on the stochastic dispersal of individuals. The mathematical model studied in our paper is in line with this. It was suggested by M. Kalyuzhni [
1] in an article where he justifies its relevance. It is known as Moran’s model in a random environment. It is therefore a question of studying a process of birth and death which takes into account environmental hazards (climates, diseases, etc.) that randomly favour or disadvantage certain species.
In this work, our goal is to study the statistical properties of some discrete statistics such as the limiting distributions, the mean and the variance of two discrete walks in our model, and , and the maximum of two walks (called final altitude) using very elegant tools called probability generating functions. Also, we analyse the return time, , of the random walk and find its mean and variance.
In the literature, these properties of discrete random walks are studied in one dimension and in higher dimensions via the kernel method and singularity analysis (see [
2,
3]). For example, for one dimension, if one focuses on articles which play a role in our analysis, Banderier and Flajolet have proven in [
2] that the limiting distribution of the final altitude of a random meander of length
n converges to a Rayleigh distribution (drift
) and normal distribution (
). Furthermore, the height of discrete bridges/meanders/excursions for bounded discrete walks has been analysed by Banderier and Nicodème [
4]. Also, Aguech, Althagafi, and Banderier in [
5] have studied the height of walks with resets and the Moran Model. Similar extremal parameters were studied for trees in [
6,
7], and by Gafni [
8] for the asymptotic distribution of the length of the longest run of consecutive equal parts. Finally, Banderier and Wallner treated the number of catastrophes of a random excursion of size
n, which converges to a Gaussian, Rayleigh, or discrete distribution depending on the drift (see Theorem 4.12 in [
9]).
For higher dimensions, still in connection with our model, we can mention the two-dimensional Moran model, investigated by Abdelkader and Althagafi in [
10], where they showed that the age of each component converges to a shifted geometric distribution in law. Furthermore, the limiting distribution for the lifetime of an individual converges to a (shifted) geometric distribution in law, proven by Itoh and Mahmoud [
11]. Itoh, Mahmoud, and Takahashi in [
12] proved that the wavelength converges, in distribution, to a convolution of geometric random variables. Other papers are related to the Moran process (in biology and population genetics); see, e.g., [
13,
14,
15]. The models in the papers [
16,
17] can be modelled as a Moran process.
This paper is organized as follows. In
Section 2, we present our model in detail and define some statistics. In
Section 3, in order to obtain the probability generating functions of the random walks,
and
, we give some recursive equations for the sequence of multivariate polynomials in our model. We show that the two Moran random walks
and
converge to shifted geometric distributions in law asymptotically. Also, we calculate their means and variances using the probability generating function of the random walks
and
. In
Section 4, we study the statistical properties of the maximum age,
, between two random walks
and
. In
Section 5, we analyse the number of returns up to time
n,
. We start with a simulation of the random walk
with different lengths: 100, 1000, 10,000, and 100,000 according to the initial probability
q using R software. Also, we obtain the distribution and the probability generating function. In
Section 6, we determine the general probability generating function of the two-dimensional random walk, which can be useful to extract the distribution of the height
. In
Section 7, we present some conclusions concerning our results and some perspectives. In
Appendix A, we give some technical lemmas useful for studying the final altitude.
2. Definitions and Presentation of the Model
In this section, we introduce our model: the two-dimensional symmetric Moran model. We define some statistics such as the final altitude, the height, and the return time. We present an elegant tool called the probability generating function, which plays an important role in finding the statistical properties of discrete random walks.
2.1. Presentation of the Model
Our model is presented as follows: At time 0, the random walk starts from the origin. After one unit of time, (a) the first random walk shifts by one positive unit, but the second random walk returns to 1 with probability
; (b) the second walk shifts by one positive unit, but the first random walk returns to 1 with same probability
; (c) the two random walks shift by one positive unit with probability
, where
. Mathematically, our model is given by the following system: for all
where
. The process
is considered a stochastic process with dimension two defined on the state space
, and started from the initial state
.
2.2. Definitions
In this subsection, we present some definitions concerning some discrete random walks.
where , and .
Our goal is to study the statistical properties of the following discrete random walks: , , , , . Precisely, we want to find their limiting distributions, their means, and their variances. As mentioned before, as a tool, we use the probability generating function.
Definition 1. Let U be a discrete random variable with distribution , . The probability generating function, denoted by G, of the variable U is defined byfor all such that . Due to their numerous uses, probability generating functions constitute an elegant tool to study the characteristic of a distribution. Mainly, the probability density functions associated with discrete stochastic processes and their moments can be obtained from the derivatives of the probability generating function. In fact, the mean and the variance of the process (the first and second centred moments of the distribution of U) are related to the derivatives of the probability generating function at . More precisely, the next folklore lemma explains this link.
Lemma 1 ([
3]).
Let be the probability generating function of a the discrete random process U. For all , the factorial moment of U is given by In addition, if the limits of
and
exist at
, then we have the following two important equations, which are related to the mean and variance of
U and
:
3. Distributions of and
In this section, firstly, we derive a conditional probability of the position of the process defined in (
1) at time
given that we know its position at time
n. Secondly, we determine the sequence of multivariate polynomials, denoted by
, and find the recursive equation related to this sequence between two consecutive times
n and
. Finally, we show that the two symmetric random Moran walks
and
converge to the shifted geometric distribution, and we compute their means and variances. Using the definitions in
Section 2, we define the joint probability mass function of
. Denote, for all
r,
,
this is the probability that the process is in the position
at time
.
We start this section with a technical lemma. It involves a recursive equation between the probability of our model for two consecutive times, n and , to be used in Theorem 1. It is based on the following conditional probability:
Proof. This proof is based on the utility of the conditional probability that the Moran walks X and Y are aged r and s at time , given that they are aged l and k at time n, and then
For
and
, we have
For
and
, we have:
For
and
, we have by symmetry:
□
Remark 1. Consider two consecutive times n and , r and s days, starting from 1 to , and the ages of two components X and Y are equal at time , respectively. We give some comments on the different cases of the age of two components X and Y at time :
If and , then the probability that X and Y are aged r and s at time is equal to the probability that is aged days at the preceding time n multiplied by .
If and , the probability that is aged days equals multiplied by the sum of all probabilities of X and Y that are aged l and days at time n, where l starts from 1 to n, respectively.
If and , the probability that is aged days equals multiplied by the sum of all probabilities of X and Y that are aged and k days at time n, where k starts from 1 to n, respectively.
Next, we define the sequence of multivariate polynomials
(for
) associated with the two-dimensional process
, by
The coefficient of
in
represents the probability that the position of the two-dimensional process
is at level
at time
n.
When
, we have the special case
By Equation (
4) and Lemma 2, we deduce a recursive equation related to
,
,
, and
. It is presented in the next proposition.
Proposition 1. For all , the explicit expression of the sequence of multivariate polynomials holds the following recurrence: Proof. Using Equation (
4) and for all
, the function
can be developed as
Due to Lemma (2), we can compute
, and
C as follows:
finally, via symmetry, we deduce
We obtain Equation (
6) by combining Equations (
7) and (
10). □
In this part, we study some statistical characteristics such that the probability generating function, the asymptotic distribution, the mean, and the variance of the final altitude of each component and at time n can be obtained. Precisely, we start by finding the probability generating function of each component and . Next, we show that the final altitude of the two random walks and converge to a shifted geometric distribution asymptotically. Finally, we finish this section by computing the mean and the variance of the two random walks. The following theorem introduces the probability generating function and the asymptotic limit distributions of and .
Theorem 1. and converge to a shifted geometric distribution with parameter in law asymptotically, with the same probability generating function given by the following: for all for all , such that . Proof. Using Equations (
5) and (
6) with
, we obtain
We iterate
n times and we obtain
Hence, passing to the limit of
, we the have
it is exactly the generating function of a shifted geometric distribution with parameter
.
□
Theorem 1 leads us to find the explicit expressions of the means and variances of and , which depend on the first and the second derivatives of the probability generating function .
Corollary 1. The means and the variances of and are given byand Proof. Calculating the first derivative of
defined in Equation (
11) with respect to
u,
evaluating with
,
Using Equation (
2), we obtain
To derive the variance of
and
, we need to define the following sequences of functions:
Observe that
using Equation (
16) and computing the second derivative of
with respect
u, one has
The first derivatives of the functions
,
, and
are given by
Let
and multiply by
,
, and
, respectively, we can obtain
Replacing, in Equation (
17), the first derivatives of
,
, and
, with respect the variable
u, with 1, we obtain
Using the following equalities,
and combining Equations (
18)–(
20), the second derivative of
evaluated at
is given by
Applying Equation (
2), and using Equations (
13) and (
18), we obtain
□
5. Return Time of the Random Walk
In this section, we analyse the number of return times at time n of the process to position 1 at time n. Precisely, we start with a simulation of the process and determine the explicit form of , i.e., the probability generating function of .
5.1. Simulations of
In this subsection, we give some simulations with R-program using with different lengths: 100, 1000, 10,000, and 100,000 for different values of the given probabilities
Figure 1 shows that the return time,
, of the random walk,
X, with length 100 is increasing from 0 to 40 from time 0 to time 100, when the random walk
X alternates between 0 and 9 with initial probability
q equal to 0.6. Also, we observe that the return time of random walk
with lengths 1000, 10,000, and 100,000 is increasing from 0 to 400, 4000, and 40,000, when the evolution of the random walk,
X, is about 0 (very small variation in
X), respectively.
Figure 2 shows that the return time,
, of the random walk,
X, with length 100 is increasing from 0 to 40 from time 0 to time 100, when the evolution of the random walk
X alternates between 0 and at most 12 with initial probability
q equal to 0.75. Also, we observe that the return time of random walk
with length 1000 is increasing from 0 to 300, when the evolution of the random walk,
X, alternates between 0 and 25. Furthermore,
Figure 2 shows that the return time of random walk
with lengths 10,000 and 100,000 is increasing from 0 to 2500 and 25,000, when the evolution of the random walk
X is about 0, respectively.
Figure 3 shows that the return time,
, of the random walk,
X, with length 100 is increasing slowly from 0 to 10 from time 0 to time 100, when the evolution of the random walk
X alternates between 0 and 17 with initial probability
q equal to
. Also, it shows that the return time of random walk
with length 1000 is increasing from 0 to 100, when the evolution of the random walk
X alternates between 0 and 50. Furthermore, we observe that the return time of random walk
with lengths 10,000 and 100,000 is increasing from 0 to 1000 and from 0 to 10,000, when the evolution of the random walk
X alternates between 0 and 80, and about 0, respectively.
5.2. Probability Distribution of
In this section, we give the probability distribution of .
Theorem 3. The exact distribution of is given by Remark 3. Through an easy computation, we prove that this probability can be given bywhere for a k differentiable function g, the notation denotes the derivative of g. Proof. For the proof of Theorem 3, we start by computing the joint distribution of the discrete return time and the discrete random walk X. To this end, for all and for all , we compute, as a first step, the probability of intersection between the return time equal to k, and the random walk equal to s. As a second step, we deduce the marginal distribution of .
Consider
the number of visits of the process
to the state 1 up to time
n.
We start by giving the joint distribution of
. □
Lemma 3. The joint distribution of satisfies the following relation:and is given aswhere is a binomial distribution with parameters and . Proof. Using Equation (
39), we have
By Theorem 1,
follows a shifted geometric distribution with parameter
We conclude the proof by using the fact that
□
By summing with respect to s and using the known distribution of , we deduce the result of Theorem 3.
Remark 4. In the particular cases,
If the return time of X equals 0 (), then the probability that the random walk X is strictly increasing from time 0 to time n equals for any age of the random walk Y at time n.
If the return time of X equals (), given that the age of X increases from 0 at time 0 to at time , the probability that the age of X equals 1 at time comes from the probability of X at time multiplied by .
Remark 5. The probability generating function of can be expressed, and we prove that it is given bywhere the function is the well-known hyper-geometric function given byFrom , we can compute the mean and the variance of , but the expressions are very complicated. 7. Conclusions and Perspectives
In this current paper, we use very useful tools called probability generating functions to find the statistical properties, i.e., the mean, the variance, and the limiting distribution, of the random walks , , , and . Firstly, we prove that both symmetric random Moran walks and converge to a shifted geometric distribution with parameter using the probability generating functions asymptotically. Also, the means and the variances of and are calculable explicitly using the same tools. Secondly, we use the symmetry of two random walks and to find the statistical properties of the maximum age between two components, such as the mean and the variance, derived from the probability generating function. Finally, we analyse the return time, , of the random Moran walk . From the simulation of , we observe that the return time is affected according to the initial probability q and the length of the random walk. Precisely, we distinguish two cases:
When the initial probability
q approaches 1 (
q = 0.9), the return time with a small length
is increasing slowly and remains lower than the final altitude of
at time
n (see
Figure 3). In this case, the Moran random walk increases often and returns to 1 few times. That means the number of increases in
is greater than the number of times that
returns to 1.
When the length of the random walk
is very large,
or 10,000 or 100,000, the return time
is not affected by the initial probability
q and increases quickly (see
Figure 1,
Figure 2 and
Figure 3). In this case, the Moran random walk often returns to 1 but
alternates between 1 and at most 50. That means the number of times that
returns to 1 is greater than the increase in
.
Here, the initial probability
q represents the probability that the random walk
increases. This increase in
happens intwo ways: in the first way,
increases but
stops at 1 with probability
; in the second, both walks
and
increase in the same time with probability
(see Equation (
1)).
In the next work, we will use the probability generating function to study the statistical properties of the height statistics,
. Precisely, we will find the distribution of
and compute its mean and variance based on the return time
. Firstly, we will start with the following conditional probability:
where the random walk
is bounded by an integer
given that the random walk
equals
. Secondly, we will try to obtain the joint distribution of
. Finally, we can extract the distribution of the bounded random walk
and determine its statistical properties.