1. Introduction
Entropy is a great tool in thermodynamics and statistical physics. Conceptualized originally by Clausius [
1] as a state descriptor, that distinguishes it from heat, it became a basic principle for statistical and informatics calculations. Its classical form, the Boltzmann entropy has a wide use [
2,
3,
4,
5]. Nevertheless, mostly in mathematical approaches to informatics, its generalizations occurred: altered, non-logarithmic formulas between the probability of a given sate and the total entropy of the system [
6,
7,
8,
9,
10].
Still, the classical logarithmic formula is of widest use, having a number of properties that make it destined to be a convex measure of information and probability. The most known generalization is due to Alfred Renyi [
6], who constructed a formula being also additive for factorizing joint probability, but abandoned the logarithm as the sole function with this property. There is a parameter, occurring as a power of probability, denoted by
q or
. The classical formula emerges in the
limit, formally.
As interesting as the Renyi entropy is, its form is not an expectation value. The
q-entropy as an expectation value was suggested by C. Tsallis [
11,
12,
13], as that form is not additive for factorizing joint probabilities (or being additive for correlated, non-factorizing probabilities). Other and further generalizations, naming more parameters, were also suggested formulated in terms of leading order corrections to the Boltzmann formula in the thermodynamical limit or just utilizing more parameters for possible deformations of the original formula [
9,
14,
15,
16,
17,
18,
19]. Properties of generalized entropy formulas were intensely studied, for two selected examples with respect to the Tsallis entropy see [
20,
21].
In the present paper another viewpoint is presented: (i) first we identify deviations from the classical logarithmic formula derived from deviations from additivity; (ii) then we demonstrate how phase space finiteness effects cause corrections to the additivity of entropy in coincidence with the factorization of probability in a microcanonical approach; (iii) and finally we ask the question of which modified entropy can be the most additive one in this respect. More closely, a general group entropy [
22] is considered following non-addition rules and a limit of its infinite repetitions on small amounts is considered as an asymptotic rule of composition [
23]. Associative rules form group operations, therefore a logarithm of the formal group can be derived from a general composition law, which is then additive. This will be the content of the next section.
There were decade-long discussions about the physical (or statistical) meaning of the parameter
q, the first non-universal parameter occurring in generalized entropy formulas. It may be bound to the sort of the system under discussion, to its material properties, but at the same time it occurs generally in a given class of statistical systems, including informatics, statistical mechanics, dynamics at the edge of chaos, and complex random networks. A few approaches in the quest of uncovering physical mechanisms determining the value of
q in particular cases in which the present author was involved, are in Refs. [
23,
24,
25,
26,
27]. Similar studies by others are copiously cited in review books, cf. [
12,
24].
Following this, a physical interpretation of the parameter
q will be established connected to the finite heat capacity of an environment [
28] and to possible fluctuations in phase space dimensionality. The latter is akin to the superstatistical approach [
29,
30,
31,
32,
33]. A balance between physical factors reducing and increasing the value of
q may ensure the classical
case, however, in a general setting it is not provided. One is then tempted to restore additivity at best—since classical thermodynamics is based on this property. The attempt is made by using a logarithm of the formal group of entropy composition instead of the original entropy,
, and deriving again the associated
parameter. Then
generates a differential equation for the
function, and due to that, a new composition rule.
Finally some examples will be discussed and families of forms will be established. The Boltzmann, Renyi, Tsallis entropies are all special cases, and they represent physical extremes in terms of the heat container capacity and the relative size of superstatistical fluctuations.
2. Logarithm of the Formal Group
It is worth starting our mathematical considerations with the composition law of entropy, or any other real valued physical quantity, when contacting and combining two systems to a bigger, unified one. Abandoning additivity, which ensures the co-extensivity properties of entropy and other extensive quantities, constructed also as expectation values, further composition rules are considered for non-extensive statistical mechanics [
12]. The Abe-Tsallis composition law,
is a particular case for a more general one, described by a two-variable function,
. In such composition rules the entropies,
(i = 1, 2, or 12) can be any state functions,
and assumed to be valid in general, both for equilibrium and non-equilibrium entropies. Here we do not address such questions, just look for the consequences of adopting non-additive rules for the entropy. The value of the parameter
q is usually in the open interval
, in measured cases very often close but not equal to
. Formally, however, it can be any real number, even negative. Certainly for
, the entropies cannot be too large in order to avoid a negative composite result. In thermodynamics we deal mostly with large systems; thus, the requirement of associativity is natural:
Having the third law in mind, on the other hand, zero entropy is a valid value, and its addition must be trivial:
and similarly
. These two requirements already circumvent composition as a group operation. The question of inverse building remains only nontrivial.
Here, the logarithm of the formal group is helpful. It can be shown that, from the associativity Equation (
2), it follows the existence of a monotonous and hence invertible mapping, which maps the general rule to the addition:
The function
is the formal logarithm; it can be constructed asymptotically from the rule
as follows. Imagine we compose from
x to
by adding small amounts,
, a number of time,
N, so that
. Then, in a general step, one proceeds as
The index
n in this change is additive, while
x and
y are not. Seeking for a continuous limit in the variable
between zero and one, we arrive at
Here
and
. Denoting the primitive function of the above integral by
, our result reads as
This form breaks the symmetry between
x and
y. We remedy this problem by re-defining
, i.e., taking steps by the additive quantity
. By this we obtain the K-additivity
This is the sought mapping to additivity; therefore,
is called the formal logarithm. We note by passing this point that due to the continuous limit in the above derivation, the new rule is only asymptotic:
does not always coincide with the starting rule,
. Such asymptotic rules, however, build attractors among all composition rules. The result Equation (
6) can also be obtained by taking the partial derivative of Equation (
4) at
, using
and integrating. The asymptotic rule is a reconstruction of the composition rule from its first derivative at a very small (zero) second argument.
Here, we mention examples. The Tsallis-Abe rule, with , leads to the formal logarithm and does not change in the asymptotics: . A more general rule, using a general function of the product of the composables, on the other hand leads back to the Tsallis-Abe rule with . Triviality requires , of course. The properties and also do hold.
Lesser-known, more complex rules can also be investigated. For example, results in a logarithm of a rational function for . Instead of listing more and more examples, however, let us close this section with a more general comment. Once we change the simple additivity to K-additivity, equivalent to the use of an associative composition rule, the entropy formula in terms of the probability also changes.
Considering an ensemble in the Gibbs sense, the state
i is realized
times, while altogether,
instances are investigated. The probability of being in state
i approaches the
ratio. Since the individual contribution to entropy would be
in the classical approach, a composition of
W such instances from which the
i-th is repeated
times shall be constructed by K-additivity. The logarithm of the probability being additive for factorizing joint probabilities, a non-additive entropy can be constructed by the inverse function of the formal logarithm:
In this way the generalized entropy formula belongs to an ensemble average value (or expectation value):
This formula may need a little more explanation. Since we replaced the original additivity assumption with K-additivity, the additive quantities, reflected in the
formula which is additive for factorizing probabilities, must be a K-function of the non-additive ones. The above Equations (
10) and (
11) are for the non-additive quantities; therefore, the inverse function,
is used on the additive log.
With the example of the Tsallis–Abe composition law, we have
and
. This delivers the Tsallis entropy,
with
as the non-additive, but expectation value-like construction, with the corresponding Rényi entropy,
as the version additive for factorizing probabilities, but not being an expectation value (ensemble average).
Consequently, equilibrium distributions when maximizing the entropy or its monotonous function,
, deliver corresponding solutions of
while keeping an average energy fixed. Both for the Rényi and Tsallis q-entropy the resulting canonical distribution becomes a cut power law in the individual energy,
:
In the
(
) limit the Boltzmann-Gibbs exponential emerges. The factor
Z ensures the normalization,
.
3. q Parameter in the Boltzmannian Approach
At a first glance, it seems arbitrary which composition rule, and consequently which formal logarithm, is to be used in our models. However, the parameters occurring in a modification of the entropy addition law need some connection to physical reality. In this section we first extend the familiar textbook derivation of the exponential canonical distribution by going a step further in the thermodynamical limit expansion and then compare the result with the cut power-law canonical distribution. This gives rise to a possible physical interpretation of the parameter q.
Following the classical argumentation, there is a factor in the probability of a subsystem having energy
out of the total
E occurring as a ration of corresponding phase space volumes:
Here, the averaging is over parallel ensemble copies of the same system, allowing for microscopical fluctuations in parameters beyond the total energy,
E, like particle numbers, charges, etc. The occupied phase space volumes are connected to the entropy,
. Expanding the expression Equation (
16) up to first order in
one arrives at the well known canonical factor
Here
Z is a normalization factor ensuring
. The temperature is interpreted as
. The information about the environment (heat bath) is comprised into this single parameter, traditionally.
Now we look at the consequences of going one step further, i.e., performing an
expansion:
This result is to be compared with the canonical distribution following from the Rényi and Tsallis entropy, to the Tsallis–Pareto distribution [
12,
34] to the same order
Term by term comparison between Equations (
18) and (
19) interprets the parameters
T and
q in the Tsallis–Pareto distribution as being
Denoting
as a fluctuating quantity, one uses its variance in the above interpretation of
q, besides the derivative of the temperature
, with
C being the total capacity of the heat bath in the formula
:
This result allows for
q values both smaller and larger than one; the
remaining a special choice. Indeed, textbooks [
35] suggest that the
is the only possible choice and then conclude that
argues for the ”one over square root law” for energy fluctuations. This argumentation is, however, misleading: the physical situation and size of the heat bath actually present must determine the value of the parameter
q.
4. Optimal Restoration of Additivity
In the present section we shall optimize the choice of instead of S in estimating the phase space volumes above in order to achieve . This requirement leads to a differential equation restricting the function . Since according to the Tsallis-Abe composition law is the additive case, seeking for we call restoration of additivity. This is the best result possible, keeping in mind that the Tsallis-Pareto distribution is also an approximation in case of finite heat bathes, although one term improved beyond the traditional Boltzmann–Gibbs exponential.
The canonical statistical factor in this case transmutes to
Expanding up to
terms, as in the previous section, with the assumption that the
function is universal, i.e., independent from the energy stored in the heat bath, we obtain
Comparing this with a Tsallis–Pareto distribution of the same approximation, we obtain another temperature and different variance parameters,
and
q:
The requirement,
, singles out a
formal logarithm for the entropy composition rule, which optimizes to the subleading order
in general. This results in a simple, solvable differential equation for
:
Here, we again used the fact that
and
. Ordering this equation to
one obtains
Such an equation can be solved by quadrature even for complicated functions of the entropy.
The simplest physical system is an ideal gas: in this case, the heat capacity is independent of the total entropy, so
is a constant. On the other hand, we may also assume that the relative variance in temperature, comprised in the term
, is also a constant with respect to the total entropy. In this simplest case, it is straightforward to obtain the optimal formal logarithm,
from Equation (
26).
Here we present the solution, which contains two parameters:
This ansatz is a “to and back” construction in a
form with
being the formal logarithm associated to the Tsallis–Abe composition rule. The reciprocal inverse of the first derivative of
is obtained as
satisfying the
condition. Substituting this function and its first derivative into Equation (
26) we conclude that
with
It is interesting to check some limits of this expression. For
or in the
case, corresponding to zero fluctuations in the thermodynamical temperature, one obtains
This generates a non-additive entropy formula
The corresponding canonical distribution is a complicated expression involving Lambert’s function.
In the other extreme,
, meaning the presence of an infinite heat capacity (ideal) heat bath, we arrive at
In this case the non-additive entropy formula delivers a Tsallis entropy, cf. Equations (
11) and (
12), and the canonical distribution is a Tsallis–Pareto distribution.
It is most intriguing that the choice
, i.e., stating that the temperature fluctuations are exactly following the inverse square root law,
; and therefore,
, always the Boltzmann–Gibbs formula emerges:
and
S itself is additive.
One realizes that the balance between ensemble fluctuations and the finiteness of the heat bath determines which is the optimally additive entropy formula. It is convenient to use the formal logarithm instead of S as an additive quantity. Still, the traditional balance derived from () is not always given in physical situations. Some might find it strange that the parameter q depends on the heat capacity controlling the reservoir. Here the relation is with the same system’s heat capacity. Analogously, a much simpler correspondence is true for a fixed volume of photon gas, where , since the equation of state is given by , stemming from and .
5. Fluctuations in Phase Space Dimension
One of the most prominent cases when fluctuations are “external”, occurring due to the way of collecting data and not stemming from the finiteness of the heat bath, is the study of single particle energy spectra in high energy collisions. Hadronization makes n particles, event by event a different number, while the total energy shared by them is approximately constant. This situation is opposite to the energy fluctuations with a fixed number of particles (atoms).
In this case the phase space to be filled by individual energies has a fluctuating dimension, while the total energy determining the microcanonical hypersurface is fixed. Then, depending on the actual probability of making n hadrons in a single collision event, , the summed distribution of single particle energy, frequently measured by the transverse momentum for very energetic particles, will differ from the traditionally expected exponential or Gaussian. In order to dwell on this problem, let us first review how the microcanonical phase space is calculated at a given total energy, E, and how dimensions for n particles move in some spatial dimensions.
Phase space is over momenta. Individual energies are functions of momenta according to the corresponding dispersion relation. A number of such relations look like a power of the absolute value, so they can be comprised into an
-norm:
with
individual momentum components, altogether n-dimension in phase space and
E total energy. The
function also reflects the dispersion relation between energy and momenta. For a one dimensional jet
, here simply
and the
norm is to be used. For nonrelativistic ideal gases
, therefore,
and the
norm is used.
For extreme relativistic particles
, and
measures the volume satisfying
The general formula reads as
Originally Dirichlet obtained this formula in a french publication [
36]. More contemporary popularizations are due to Smith and Vamanamurthy [
37] from 1989 and Xianfu Wang [
38] from 2005. Wikipedia also has an entry on this formula [
39] and a recursive proof in few lines can be obtained from [
40]. A microcanonical constrained hypersurface is the derivative of the above volume formula against the total energy
Meanwhile the surface is the derivative against
R.
We define a ratio of volumes, and consider the
case, related to relativistic particles in a 1-dimensional jet:
For normalization we use the pure environmental factor, the above ratio without the single particle factor:
This
is normalized over an integral in the single-particle phase space. This means an integral over
p between
and
, while
with the absolute value:
Due to the
absolute value expression, this integral is twice of that between 0 and
E, whence the factor 2 in the denominator becomes cancelled.
Once its integral is normalized to 1, also the mixture
is normalized to 1, provided that the
sum is also normalized.
Finally, note that the ratio of volumes and energy shells in the 1-dimensional, relativistic case are simply related:
in the
(relativistic)
-norm delivers due to
So we have
and
Finally, we have
In this case the microcanonical ratio,
is normalized to 1 for an integral over
between 0 and
E.
For ideal gases, one considers
, delivering
Here,
q is actually the measure of non-Poissonity,
For the negative binomial distribution (NBD)
, for the Poissonian exactly
. In hadronization statistics the Tsallis–Pareto distribution extracted from transverse momentum distributions and the multiplicity fluctuations event by event go hand in hand [
41,
42,
43]. Such distributions may be a consequence of dynamical random processes, too, as it is investigated in the framework of the Local Growth Global Reset (LGGR) model recently on the master equation level [
44], or earlier in the framework of the generalization of Boltzmann’s kinetic approach and the related H theorem [
45,
46,
47].