1. Introduction
1.1. Paper Background and Motivation
Since Bandt and Pompe [
1] introduced the concept of permutation entropy (PE), it has been applied in different fields from biomedicine to econophysics (e.g., Zanin et al. [
2], and Amigò et al. [
3]) and developed into various directions. One relatively new variant of permutation entropy is based on Rényi entropies instead of the originally used Shannon entropy and is called Rényi permutation entropy (RPE). Roughly speaking, RPE quantifies the complexity of the distribution of ordinal patterns of some length
n underlying a dynamical system, where ordinal patterns describe the up and down in the dynamics. As Rényi entropies depend on a parameter
, there are also different choices of RPE depending on
q.
The central aim of the paper is to discuss the asymptotics of RPE for increasing pattern length. This is motivated by the striking fact that, under certain assumptions, asymptotic PE is equal to Kolmogorov–Sinai entropy, which was first observed by Bandt et al. [
4]. This paper shows that the situation for
is more complicated than that for
.
The paper is organized as follows. It first follows a short overview of first applications of the RPE.
Section 2 provides the main definitions. The concepts of RPE are introduced in empirical and model-based settings. Moreover, RPE is discussed for some special
q, including
as a limit case.
Section 3 is devoted to the asymptotics of RPE and PE. With Corollary 1, the section contains the main new result of the paper relating RPE to Kolmogorov–Sinai entropy for
and measures with maximal entropy. Its proof and a class of discriminating examples for
(see Example A1) are given in
Appendix A.
1.2. First Applications of RPE
To our best knowledge, the concept of RPE was first considered in the literature in 2015. In a study of monitoring the depth of anaesthesia by EEG, Liang et al. [
5] systematically compared 12 entropy measures, with RPE among them. They reported that RPE had the best performance in distinguishing different anaesthesia states. Mammone et al. [
6] discussed RPE in the context of absence epilepsy EEG. Their results suggested improved abilities in classifying ictal and interictal EEG by using RPE (with suitable parameters) instead of PE. Zunino et al. [
7] introduced permutation min entropy, which is the limit of Rényi entropy for defining parameter
q approaching to
∞ as a tool for finding temporal correlations in a time series.
Moreover, Rivero et al. [
8] combined an enhanced Bayesian approach and RPE for predicting long-term time series. Following the results of Liang et al., Park et al. [
9] used RPE for comparing anaesthetics given during a cesarean section with similar results as those for other entropy measures. Different variants of RPE, from weighting to multiscaling, have been applied to complex stock-market data (Zhou and Shang [
10], and Chen et al. [
11]). Some remarks on RPE can also be found in [
12].
2. Rényi Entropies
2.1. General Entropy Concept
Given a finite index set
I consisting of
n elements,
is called a
stochastic vector if
for all
and
. The Rènyi entropy
of a stochastic vector
for
is defined by
The Rényi entropy of a fixed stochastic vector monotonically decreases and is continuous with respect to
q. It generalizes the
Shannon entropy given in the standard case
. The larger that
q is, the more the role of the largest entries in the stochastic vector is emphasized, and the smaller that
q is, the more equal the role of all positive entries in the entropy formula is.
On the basis of the concept of Rényi entropies, we want to give precise definitions of RPE regarding both the empirical and the modelling viewpoint.
2.2. Empirical RPE
For
, we denote the set of permutations of
by
. A vector
has
ordinal pattern if
and
The latter requirement realises the uniqueness of ordinal patterns.
Definition 1. The empirical Rényi permutation entropy for and of a time series is defined bywithbeing the relative frequency of ordinal patterns π in the time series, and and being defined by 0.
2.3. RPE
On the model side, we consider a measure-preserving dynamical system , defined as a probability space being equipped with a --measurable map which satisfies for all . T and the system are called ergodic if for implies .
Generally, the dynamics of T can be related to ordinal patterns via a real-valued random variable X on by assigning the ordinal pattern of . Here, X is interpreted as observable modelling of a measuring process. If X (or, more generally, a collection of random variables) has certain separation properties, the ordinal patterns obtained via X (or all random variables) contain much information on the given system. In the following, however, we usually assume that is a subset of , and the ordinal pattern assigned to is that taken from (this is equivalent to considering an X being the identity map).
For a permutation
, we denote the sets of all points
x with ordinal pattern
by
. From these sets, we obtain the partition
of
, with this being central to considering RPE on the model side. Analogously to empirical permutation entropy, we define Rényi permutation entropy for
and
on the basis of
by
2.4. Estimation
Given an orbit
of some
, it is natural to estimate
for
and
by
and
, respectively. In the case that
T is ergodic, by Birkhoff’s ergodic theorem, the corresponding estimators are asymptotically consistent. This particularly means that
for
-almost all
.
2.5. RPE for Special Parameters q
In the following, we discuss the RPE for some special parameters q, and touch the general concept of Rényi entropies.
: Rényi entropy for is the well-known Hartley entropy, and the RPE of a measure preserving dynamical system is no more than the logarithm of the number of ordinal patterns appearing with positive probability. Rényi entropy for is also called max entropy since it is maximal among Rényi entropies.
: This case providing the standard (Shannon) permutation entropy has been discussed in various papers both from a theoretical and an application viewpoint. We particularly refer to the literature mentioned in several parts of this paper.
: Rényi entropy for
, also called
quadratic entropy or
collision entropy, is used in different fields. It is obviously related to the
Simpson index given for a stochastic vector
and used as a diversity measure in ecology (see [
13]). Given a measure-preserving dynamical system
, we look at the RPE for
.
By Fubini’s theorem, it holds that
with
Here,
stands for the indicator of a set
A assigning a point the value 1 if it belongs to
A and value 0 otherwise, and
denotes the product measure of
with itself.
So, is related to the probability that the ordinal patterns of length n of two independently (with respect to ) chosen points coincide.
A natural estimation of
based on a finite orbit
of some
is given by
providing the relative frequency of pairs in the orbit with coinciding (completely defined) ordinal patterns. This qualifies the RPE for
as a recurrence measure.
Quantity (
1) was introduced by Caballero et al. [
14] as the
symbolic correlation integral in the context of a stochastic process and studied mainly in the i.i.d. case.
: It is well-known that the Rényi entropy of a stochastic vector
for
converges to value
This fact can be used to reconstruct a stochastic vector up to permuting its components from its Rényi entropies for an unbounded sequence
(see
Appendix A). Since
for all
, number
is called
min entropy of
. Applications of min entropy in the permutation entropy context can be found in Zunino et al. [
7]. In the following, we further assume that
.
3. Asymptotics of RPE and PE
As already mentioned, there is a strong relationship between Kolmogorov–Sinai entropy and PE. The result of Takens and Verbitskiy [
15] that, for
, Kolmogorov–Sinai entropy can be expressed by a limit on the basis of Rényi entropies instead of Shannon entropies suggests the question whether PE can be similarly replaced by RPE in that relationship. This question addresses the asymptotics of RPE, and the general nature of RPE is thus in the centre of this section.
3.1. Kolmogorov–Sinai Entropy via Rényi Entropies
Definitions and statements of this subsection go back to Takens and Verbitsky [
15]. Many considerations of this paper are related to partitions of
. We generally assume that, in a context where a
-algebra on
is specified, such partitions are contained in it.
Let
now be a measure-preserving dynamical system, and consider a finite partition
of
. For
and multi-indices
define the sets
forming the partition
of
. For
, the
generalized entropy rate of
T with respect to partition is defined as
with
for a finite partition
of
. Generalized Kolmogorov–Sinai entropy for
is defined as the supremum of generalized entropy rates taken over all finite partitions:
(Standard) Kolmogorov–Sinai entropy is given by
In the case of
, the limit inferior in (
3) can be replaced by a limit and that, for
being an interval and
being the Borel
-algebra, Kolmogorov–Sinai entropy is already determined by
finite interval partitions defined as finite partitions consisting of intervals (e.g., [
16]):
The following theorem of Takens and Verbitskiy [
15] was originally proved for invertible systems; however, it also holds true for noninvertible systems (see Verbitskiy [
17]). Assumption (i) of ergodicity can be relaxed (see Takens and Verbitskiy [
18]); however, we do not go into the technical details.
Theorem 1. Let be a standard-probability space and an aperiodic and ergodic measure-preserving function. Then,holds true for all . Additionally, impliesfor all . Here,
T is called
aperiodic if the set of periodic points has measure zero with respect to
. The property of a probability space to be standard is a relatively technical one; however, it is not very restrictive since it is principally satisfied for the most common probability spaces (e.g., Walters [
16]).
3.2. Kolmogorov–Sinai Entropy and RPE
In order to discuss the relationship between RPE and Kolmogorov–Sinai entropy of a measure-preserving dynamical system
, we define
lower and
upper Rényi permutation entropies
and
for
as
and
respectively. We write
and
in the case of
. Both
and
monotonically decreasee with respect to
q by the definition of Rényi entropies.
: The celebrated result of Bandt et al. [
4] that, for piecewise continuous and monotone interval maps, the permutation and Kolmogorov-Sinai entropy coincide is the motivation for the following discussion. Here, we state the more general version of the result proved in Gutjahr and Keller [
19], but afterwards return to the case of piecewise monotone interval maps. In the following, we call a subset of
interval if it is the intersection of an interval of
with
or a one point set.
Theorem 2 ([
19]).
Let be a measure-preserving dynamical system, with being compact, and being the Borel σ-algebra on Ω. If there exists a finite partition or a countable partition with of Ω into intervals, such that T is monotone on each of the intervals, then Theorem 2 covers interval maps, since a noncompact can be replaced by compactification without substantially changing the structure of the given system.
: In light of the statement of Takens and Verbitskiy [
15] mentioned above, it is a natural question whether also
. The general answer is no. Examples with
covering all
are given by Example A1 in
Appendix B.2.
: We also look at case
in the class of maps considered by Bandt et al. in [
4]. For this, let
be an interval,
the Borel
-algebra on it, and
be a finite partition of
into intervals on each of which
T is monotone and continuous. For such a map, it was shown in [
4] that
holds true. Using the fact that Rényi entropy monotonically decreases in
q, this implies
for all
. Let us summarize:
Proposition 1. Let be a measure-preserving dynamical system, with being an interval, and being the Borel σ-algebra on it. Suppose that is a finite partition of Ω into intervals such that T is monotone and continuous on each of the intervals. Thenholds true for all . Quantity
could be considered to be a topological version of permutation entropy. This is justified by (
4) for
T, as defined above (
4), and the following: If
T is continuous on all of
, Misiurewicz and Szlenk showed that
is equal to the topological entropy of
T [
20]. For the definition of topological entropy and the following, see, e.g., [
16].
By variation principle, the topological entropy of a map
T on a compact Hausdorff space is equal to the supremum of the Kolmogorov–Sinai entropy of all Borel measures for which
T is measure-preserving (e.g., [
16]). Often, topological entropy is assumed by the Kolmogorov–Sinai of such a measure. Generally, given a continuous map
T on a metric space, a corresponding Borel measure being measure-preserving has maximal entropy if its Kolmogorov–Sinai entropy coincides with the topological entropy of
T.
On the basis of the discussion above, we show the following statement (see
Appendix B.1).
Corollary 1. Let be a measure-preserving dynamical system, with being an interval, and being the Borel σ-algebra on Ω. Suppose that T is continuous, and there exists a finite partition of Ω into intervals, such that T is monotone on each of those intervals, and μ is a measure of maximal entropy. Thenholds true for all . 4. Conclusions
In this paper, we looked more closely at the recently introduced and used Rényi variant of permutation entropy, depending on a parameter , which is called Rényi permutation entropy (RPE) here. Giving a summary of first applications of RPE, and discussing RPE for some special parameter q, we mainly focused on the asymptotics of RPE for ordinal pattern length going to ∞. This was motivated by the fact that the usual permutation entropy (PE) often asymptotically coincides with Kolmogorov–Sinai entropy, and that, for , Kolmogorov–Sinai entropy can be defined by Rényi entropies instead of Shannon entropies.
This paper showed that, for , asymptotics of RPE can be different from that of PE, meaning that, for long ordinal patterns, the nature of RPE is also not the same as that of PE. One the other hand, it is interesting that, for continuous piecewise monotone interval maps with a measure of maximal entropy and , asymptotics of RPE and PE are the same. Results indicate that the behaviour of general RPE is more specific than that of PE, although the asymptotics of PE is not completely understood. Further work for the better understanding of RPE for large pattern lengths is necessary.
The content of this paper is more or less purely mathematical, but in a certain sense, it justifies the application of RPE in dynamical systems and time series besides PE. Some of the applications mentioned at the beginning of the paper underline the benefit of using RPE. There is, however, the other interesting point that special q address special features; so, for example, is related to recurrence. The symbolic correlation integral related to case is a U-statistic, which is helpful in the statistical analysis of the corresponding entropy. Work on utilizing this fact for testing for asymmetry in temporal data is in progress.