1. Introduction
The uncertainty in statistical information theory is an aspect of randomness which is measured by entropy, originally introduced by Shannon [
1] in their seminal work. The entropy has been firstly proposed to quantify the uncertainty in a discrete random variable (RV). There are many generalizations of Shannon entropy recognized in the literature, defined by appending additional parameters. These generalizations make the entropies sensitive to different probability distributions (see Renyi [
2] and Tsallis [
3], among others). One such generalization is the fractional entropy (FE), which is based on fractional calculus. Moreover, the FE is a concave positive function with non-additivity property. From a physical point of view, there are also some descriptions associated with the Lesche and thermodynamic stability.
The Shannon entropy can be developed from the discrete to continuous case as below:
where
is the probability density function (PDF) of a nonnegative RV
X with an absolutely continuous cumulative distribution function (CDF)
. Despite the many advantages of the differential entropy on the one hand, it has some disadvantages on the other hand. For example, it is very complicated to estimate the differential entropy of a continuous RV in terms of the empirical distribution arisen from samples. However, since the PDF exists, we can use this possibility. Furthermore, it can take negative values in calculations (cf. Rao et al. [
4] and Rao [
5]). For some recent work surveyed in the literature, we refer the reader to Kharazmi and Balakrishnan [
6], Mohamed et al. [
7], and the references therein.
An alternative and more direct quantity proposed by Rao et al. [
4], using the survival function (SF)
in place of the PDF
is given by:
Properties of (
2) and its dynamic version and some other generalization of this measure together with their properties are discussed in detail in Asadi and Zohrevand [
8], Navarro et al. [
9], Toomaj et al. [
10], Psarrakos and Navarro [
11], Psarrakos and Toomaj [
12], Tahmasebi and Mohammadi [
13], and Mohamed et al. [
14], among others. By analogy with (
2), Di Crescenzo and Longobardi [
15] introduced cumulative entropy (CE) by substituting
in place of PDF
as:
where:
is the cumulative reversed hazard rate (CRHR) function and
is the reversed hazard rate (RHR) function (see, e.g., Hanagal and Pandey [
16]). Note that the integral in (
4) is convergent for all
; however,
. It is also evident that
. It is obvious that
, since the argument of the logarithm is a probability. Moreover, the CE becomes zero if, and only if
X is a degenerate random variable. Properties of the cumulative entropy in the two-dimensional case have been argued in Ahmadi et al. [
17]. For further properties of cumulative entropies, we refer the reader to Di Crescenzo and Longobardi [
15,
18], Kundu et al. [
19], and references therein. The CE has been developed to more general cases, see, for instance, Di Crescenzo and Toomaj [
20], Kayal [
21], Kayal and Moharana [
22], and references therein.
Recently, Xiong et al. [
23] introduced the fractional cumulative residual entropy (FCRE) following the properties of the CRE in (
2) and the FE as:
for all
Xiong et al. [
23] investigated several properties of this measure, including the effect of linear transformations of RVs on it. To be more specific, they showed that
, where
and
. They also obtained several bounds and used the concept of stochastic orders to establish some insightful comparison of distributions following their corresponding FCREs.
This article concentrates on a newly proposed measure of uncertainty that complements the FCRE from (
5). This measure has been derived by transplantation of the cumulative distribution function with the survival function in (
5) (see Di Crescenzo et al. [
24]). This measure extends the cumulative entropy (
3) to a fractional order. In fact, our definition is a special case, but equivalent in the conception, of that given in Di Crescenzo et al. [
24] given by:
We give an overview of the paper: the fractional cumulative entropy is first described in
Section 2. For generic and linear transformations of RVs, the measure is developed and some stochastic orderings are considered. We utilize some well-known stochastic orderings. Some instructive example are given to illustrate the concepts. In
Section 3, we present several bounds and establish some inequalities involving the FCE. We provide several characterization results of symmetric distributions in
Section 4. In
Section 5, we finally conclude the paper with some remarks and illustrations about our future plan.
2. Fractional Cumulative Entropy
Throughout the paper, we assume that
X is a nonnegative RV with an absolutely continuous CDF
. In analogy with (
5), we define the fractional cumulative entropy (FCE) of
X as follows:
for all
It is clear that
which may be divergent.
We remark that our definition is a special case of the definition given by Di Crescenzo et al. [
24] in (
6). Therefore, in this case, the fractional cumulative entropy is given by:
Moreover, FCE becomes the CE when the parameter
q takes the value one, i.e.,
, as given in (
3). On the other hand, the FCE is nonnegative and a concave function of the distribution, but it is a convex function of the parameter
From (
7), it is clear that
takes values in
In particular,
for all
if, and only if
X is a constant.
Remark 1. Let us assume an RV X with finite mean The FCE is equal with the fractional cumulative residual entropy (FCRE) if the distribution of X is symmetric around μ, i.e., if for all
The next example makes the comparison of
for
with the standard deviation
for a number of reputable distributions tabulated by Di Crescenzo et al. [
24].
Example 1. (a) Assume that X is distributed uniformly in for which has the standard deviation:The straightforward computation yields:In this case, due to Remark 1. Moreover, we have:Thus, we have for all and for all (b) Let X have the Fréchet distribution with the CDF with with the standard deviation:for Recalling (7), we obtain:for all Thus, we have:where: Comparing the functions to 1 as shown in Figure 1, and considering (9), we have the following results given in Table 1. One can see for that there exists a number for which for for all . Note that does not imply that X is equal in distribution with Y; however, the converse holds. Let us now discuss the effect of an increasing transformation on the FCE. We omit the proof, as it is straightforward.
Lemma 1. If where is a strictly increasing and differentiable function, then: Making use of Lemma 1, the following theorem is immediately obtained.
Theorem 1. Let the condition in Lemma 1 hold. If then for all
If
with
and
we have the following result:
for all
, which is given in Proposition 2.2 of Di Crescenzo et al. [
24].
The r.h.s of (
11) is not affected by
b in the sense that the FCE is shift-independent.
The FCE of
X can also be obtained in terms of the cumulative RHR function of
X given in (
4). The proof can be obtained from Proposition 2.1 of Di Crescenzo et al. [
24].
Corollary 1. Let X have a finite FCE for all . Then:where: We note that
in (
13) is a decreasing convex function of
This immediately allows us to obtain the following theorem.
Theorem 2. Let X have a finite mean . Then:for all Proof. Noting that
is a convex function of
x, the Jensen’s inequality is applicable in (
12) and immediately provides the proof. □
For the definition of the decreasing convex order denoted by
and also the definition of the dispersive order denoted by
, we refer the reader to Shaked and Shanthikumar [
25]. In the following result, we demonstrate that the dcx order is a sufficient condition for ordering distributions according to their FCEs.
Theorem 3. If , then .
Proof. We first prove that if
, then:
where the function
is as given in (
13). Since
is a decreasing convex function for all
, it follows (see Section 4.A.1 of Shaked and Shanthikumar [
25]) that
. In particular, since the decreasing convex order implies the expectation ordering, thus
. □
The following corollary follows from Theorem 3.1 of Di Crescenzo et al. [
24], for which another proof is given here.
Corollary 2. If , then for all
Proof. We remark that the dilation property implies
Using (
10), the proof is obtained. □
Di Crescenzo and Longobardi [
18] showed that the ordering distributions according to their associated cumulative entropies is not a result of ordering by the usual stochastic ordering of the underlying distributions. Similar results hold for the FCE.
3. Bounds and Inequalities
In what follows, we derive some upper and lower bounds for the FCE of nonnegative RVs. Let us first consider the following theorem.
Theorem 4. For all where Proof. It is known that
for all
and for every
From (
7), we achieve the result. □
Remark 2. We remark that is known as the risk-adjusted premium introduced by Wang [26] lies in the framework of the proportional hazards model. It is remarkable that if are a random sample from F, then the minimum order statistic follows the proportional hazards model, which makes the results on this model useful in broader applications. If denotes a nonnegative RV with the survival function for then in the proportional hazards model we have:where is the survival function of the baseline model and q is the proportionality constant. For an insurer, the risk-adjusted premium automatically and consistently adjusts the risk burden relative to the expected loss for different risks. Moreover, it is additive when the risk is divided into layers, which makes it very attractive for insurance layer pricing. For detailed discussions, we refer the reader to Wang [26]. The following theorem gives a sufficient condition for the FCE to be finite.
Theorem 5. If for some then for all
Proof. Applying Theorem 4, we get:
where the third inequality is obtained by virtue of Markov inequality. The last integral is finite if
, and this completes the proof. □
A normalized version of cumulative residual entropy as well as cumulative entropy were studied in Rao [
5] and Di Crescenzo and Longobardi [
18]. We will now introduce a normalized version of the FCE. For a nonnegative RV
X with finite nonvanishing risk-adjusted premium
, we define the normalized FCE as:
for all
Recalling Theorem 4, we anticipate that the normalized FCE takes values in
For the special case
we have the normalized cumulative entropy as:
which is discussed in Di Crescenzo and Longobardi [
18].
In the next result, we discuss the relation of FCE with the CE by virtue of Jensen’s inequality. Indeed, it gives an upper bound for the FCE depending on the CE. Its proof is similar to that of the proof of Proposition 2 of Xiong et al. [
23], and hence, we omit it.
Theorem 6. For X with the support and : In the following, we show a lower bound of the FCE in terms of the differential entropy (
1).
Theorem 7. For all ,where is a finite function of The proof of Theorem 7 is based on the log-sum inequality and is similar to Theorem 2 of Xiong et al. [
23], where an analogous result is given for the FCRE. Another lower bound for the cumulative entropy is given below.
Proof. Recalling that
for
and
from (
7) we obtain the result. □
Remark 3. The right-hand side of (16) can be interpreted in probabilistic terms as follows:(i) In spirit of the right-hand-side of (16) can be rewritten as:where is the risk-adjusted premium. (ii) Let X have a finite non-vanishing mean. Thanks to the use of Fubini’s theorem and recalling (14):where:is the survival function of the ‘equilibrium variable’ of with survival function (14). (iii) Since when we conclude that:being similar to that Proposition 4.3 of Di Crescenzo and Longobardi [15]. The proportional reversed hazards rate (PRHR) model of an absolutely continuous nonnegative RV
with proportionality constant
is defined by:
It is worth mentioning that if
are a random sample from
F, then the maximum order statistic
follows the PRHR model. One has
, where
is defined in (
4). For more details on the applications and properties of PRHR model, see Di Crescenzo [
27], Kirmani and Gupta [
28], and references therein. In the forthcoming theorem, we give an upper bound for the FCE of
depending on
.
Theorem 9. For all :and the inequality is reversed whenever Proof. Recalling (
7) and (
17), we have:
Since
when
, we obtain:
which completes the proof. For
we have
and hence, the desired result follows. □
By assuming that the proportionality constant is integer, the following corollary follows from Theorem 9.
Corollary 3. Let be i.i.d. random variables. Then:for all The next theorem is analogously established as Theorem 1 in Xiong et al. [
23], and states that a sum of independent RVs are greater than that of either of them.
We now provide an expression for the FCE in terms of the mean inactivity time (MIT) function (cf. [
29]). We recall that the MIT function of
X which is defined by:
for all
To this aim, we define the RV
with the PDF as:
for all
, where
is defined in (
4).
Theorem 11. Let X have MIT function Then, for all : Proof. It is clear that, for all
:
From the above relation and Equation (
7), and using Fubini’s theorem, we obtain:
where the last equality is obtained from (
18). Thus, the results finally are obtained using (
19) and the well-known relation
. □
Example 2. Let denote the maximum of a random sample of continuous nonnegative RVs , which are uniformly distributed on It is clear that the CDF of is with the PDF for all From (18), we have for Thus, Theorem 11 implies that:where the last equality is obtained by noting that:for all The following result is an interesting achievement.
Theorem 12. Let X have standard deviation and FCE Then:for all Proof. For all
by the Cauchy–Schwarz inequality, we obtain:
Applying Theorem 21 of Toomaj and Di Crescenzo [
30], it holds that:
On the other hand, we have:
which is positive for all
Therefore, the proof is then completed. □
Through a proper expression, the FCE can be described differently. In fact, we provide an expression for the FCE based on the covariance of the proportional RHR function and the RV
Theorem 13. Let X have FCE Then:for all Proof. First, from (
19) it is verified that
Now:
One can obtain:
which implies that:
where the last equality is obtained from Proposition 2.3 of Di Crescenzo et al. [
24]. Therefore, we have the results using relation
. □
The study of waiting times for events is a topic of interest in many fields. To add the possibility of considering the inspection time
t at which the system is found failed and modify the information of the updated distribution, a dynamic version of FCE can be considered. Let
X be a lifetime under the condition that the system has failed prior to the time
t. The distribution function of the inactivity time
is given as:
The FCE for the inactivity time
is:
for all
. As in Theorem 7:
where
is given as in Theorem 7 and
is the past entropy at time
t of
X; see Di Crescenzo and Longobardi [
31] and Muliere et al. [
32]. Moreover, using the FCE of inactivity time, Theorem 2.2 reformulated as:
where:
for all
4. Characterization Properties
We produce a characterization property based on the maximum of a random sample. First, a technical lemma in the spirit of the Muntz–Szasz theorem (see Kamps [
33]) is given.
Lemma 2. For any sequence of positive integers , which is increasing in j, the sequence of polynomials is complete on , iff: Let be n i.i.d. RVs with PDF f and CDF F. We recall that the cumulative distribution function of the largest value of order statistics is . Now, we state the result.
Theorem 14. F and G belong to the same family of distributions, but for a change in location, iff:for a fixed q and for all such that is infinite. Proof. The necessity is simple to prove. For the sufficiency part, if for two CDFs
F and
G Equation (
21) holds, using the probability integral transformations
and
we haveL
If (
22) holds for all
such that
from Lemma 2, we derive that
for all
It follows that
for all
This means that
F and
G belong to the same family of distributions, but for a location shift. □
In what follows, the completeness property of the characterization results of symmetric continuous distributions are applied through the FCE measures. We obtain a result for symmetric distributions based on the equality of the FCRE of the first-order statistic with the FCE of the last-order statistic. As usual, we denote by
the maximum of a random sample having size
m whose RVs are distributed as
X. In a similar way, we denote by
the minimum of a random sample having size
m whose RVs are equal in distribution with
X. The RVs
and
are known as the lifetimes of series and parallel systems in reliability engineering and there are many applications in this context (Barlow and Proschan [
34]). By appealing to the techniques used in the proofs of Theorems 4 and 5 of Ahmadi and Fashandi [
35] and further by applying Lemma 2, we acquire the following characterization results.
Theorem 15. Suppose is a sequence of positive integers which is strictly increasing such that (20) holds. Then, the following statements are equivalent: - (i)
X has a symmetric distribution;
- (ii)
for a fixed and for all
Proof. The probability integral transformation, identified by
provides that:
for all
If
X has a symmetric distribution, then by (
23) and (
24), we readily find that
for all
For the sufficiency, by substituting Equations (
23) and (
24) into
yields:
Since, according to the hypothesis, Equation (
25) holds for
, such that
, Lemma 2 implies:
Thus, by Lemma 2, the proof is completed. □
An analogue theorem can be stated as Theorem 14 for the dynamic FCE.
Theorem 16. F and G belong to the same family of distributions, but for a change in location and scale, if and only if:for a fixed , for all , and for all , where is defined as in Theorem 15. Proof. The necessity is obvious. Therefore, we prove the other part. For a fixed and for all if for all , then appealing to Theorem 14, one concludes that and follows the same distribution but for a change in location parameter, i.e., , for all , where and are, respectively, used to represent the PDFs of and Hence, , and this signifies that F and G lie within a same family of distributions, but with a change in the location and scale. □