1. Introduction
Studies of a calculus based on generalized forms of arithmetic were initiated in the late 1960s by Grossman and Katz, resulting in their little book
Non-Newtonian Calculus [
1,
2,
3]. Some twenty years later, the main construction was independently discovered in a different context and pushed in a different direction by Pap [
4,
5,
6]. After another two decades the same idea, but in its currently most general form, was rediscovered by myself [
7,
8,
9,
10,
11,
12,
13,
14,
15]. In a wider perspective, non-Newtonian calculus is conceptually related to the works of Rashevsky [
16] and Burgin [
17,
18,
19,
20] on non-Diophantine arithmetics of natural numbers, and to Benioff’s attempts [
21,
22,
23,
24,
25] of basing physics and mathematics on a common fundamental ground. Traces of non-Newtonian and non-Diophantine thinking can be found in the works of Kaniadakis on generalized statistics [
26,
27,
28,
29,
30,
31,
32,
33,
34]. A relatively complete account of the formalism can be found in the forthcoming monograph [
35].
In the paper, we will discuss links between generalized arithmetics; non-Newtonian calculus; generalized entropies; and classical, quantum, and escort probabilities. As we will see, certain constructions such as Rényi entropies or exponential families of probabilities have direct relations to generalized arthmetics and calculi. Some of the constructions one finds in the literature are literally non-Newtonian. Some others only look non-Newtonian, but closer scrutiny reveals formal inconsistencies, at least from a strict non-Newtonian perspective.
Our goal is to introduce non-Newtonian calculus as a sort of unifying principle, simultaneously sketching new theoretical directions and open questions.
2. Non-Diophantine Arithmetic and Non-Newtonian Calculus
The most general form of non-Newtonian calculus deals with functions
A defined by the commutative diagram (
and
are arbitrary bijections)
The only assumption about the domain and the codomain is that they have the same cardinality as the continuum . The latter guarantees that bijections and exist. The bijections are automatically continuous in the topologies they induce from the open-interval topology of , even if they are discontinuous in metric topologies of and (a typical situation in fractal applications, or in cases where or are not subsets of ). In general, one does not assume anything else about and . In particular, their differentiability in the usual (Newtonian) sense is not assumed. No topological assumptions are made about and . Of course, the structure of the diagram implies that and may be regarded as Banach manifolds with global charts and , but one does not make the usual assumptions about changes of charts.
Non-Newtonian calculus begins with (generalized, non-Diophantine) arithmetics in
and
, induced from
,
(and analogously in
).
Example 1. According to one of the axioms of standard quantum mechanics, states of a quantum system belong to a separable Hilbert space. All separable Hilbert spaces are isomorphic, so state spaces of any two quantum systems are isomorphic. Does it mean that all quantum systems are equivalent? No, it only shows that mathematically isomorphic structures can play physically different roles. Similarly, the arithmetic given by (2)–(5) is isomorphic to the standard arithmetic of , but it does not imply that the two arithmetics are physically equivalent. Example 2. The origin of Einstein’s special theory of relativity goes back to the observation that the velocity of a source of light does not influence the velocity of light itself, contradicting our everyday experiences with velocities in trains or football. Relativistic addition of velocities is based on a fundamental unit c and the dimensionless parameter β, related to velocity by . while the bijection reads . The velocities are added or subtracted by means of (2) and (3), Interestingly, (4) and (5) are not directly employed in special relativity. The presence of the fundamental unit c is a signature of a general non-Diophantine arithmetic (which typically works with dimensionless numbers). Numbers play the roles of infinities, . The velocity of light is therefore literally infinite in the non-Diophantine sense. The neutral element of multiplication, (i.e., ), does not seem to play in relativistic physics any privileged role. Sometimes, for example in the context of Bell’s theorem, one works with mixed arithmetics of the form [
13]
Mixed arithmetics naturally occur in Taylor expansions of functions whose domains and codomains involve different arithmetics, and in the chain rule for derivatives (see Example 6).
In order to define calculus one needs limits “to zero”, and thus the notion of zero itself. In the arithmetic context a zero is a neutral element of addition, for example,
for any
. Obviously, such a zero is arithmetic-dependent. The same concerns a “one”, a neutral element of multiplication, fulfilling
for any
. Once the arithmetic in
is specified, both neutral elements are uniquely given by the general formula:
for any
. Therefore, in particular,
,
. One easily verifies that
for all
, which extends also to mixed arithmetics,
If there is no danger of ambiguity one can simplify the notation by
or
. Mixed arithmetics can be given an interpretation in terms of communication channels. Mixed multiplication is in many respects analogous to a tensor product [
13].
Example 3. Consider , , , , , . “Two plus two equals four” looks here as follows,where , . From the point of view of communication channels the situation is as follows. There are two parties (“Alice” and “Bob”), each computing by means of her/his own rules. They communicate their results and agree the numbers they have found are the same, namely, “two” and “four”. However, for an external observer (an eavesdropper “Eve”), their results are opposite, say and . Mixed arithmetic plays a role of a “connection” relating different local arithmetics. This is why, in the terminology of Burgin, these types or arithmetics are non-Diophantine (from Diophantus of Alexandria who formalized the standard arithmetic). Similarly to nontrivial manifolds, non-Diophantine arithmetics do not have to admit a single global description (which we nevertheless assume in this paper). A limit such as
is defined by the diagram (
1) as follows,
i.e., in terms of an ordinary limit in
. A non-Newtonian derivative is then defined by
if the Newtonian derivative
exists. It is additive,
and satisfies the Leibniz rule,
A general chain rule for compositions of functions involving arbitrary arithmetics in domains and codomains can be derived [
12] (see Example 6). It implies, in particular, that the bijections defining the arithmetics are themselves always non-Newtonian differentiable (with respect to the derivatives they define). The resulting derivatives are “trivial”,
A non-Newtonian integral is defined by the requirement that, under typical assumptions paralleling those from the fundamental theorem of Newtonian calculus, one finds
which uniquely implies that
Here, as before,
is defined by (
1) and
denotes the usual Newtonian (Riemann, Lebesgue, etc.) integration. To have a feel of the potential inherent in this simple formula, let us mention that for a Koch-type fractal (
24) turns out to be equivalent to the Hausdorff integral [
12,
36,
37]. In applications, typically the only nontrivial element is to find the explicit form of
. It should be stressed that (
24) reduces any integral to the one over a subset of
. The fact that such a counterintuitive possibility exists was noticed already by Wiener in his 1933 lectures on Fourier analysis [
38].
3. Non-Newtonian Exponential Function and Logarithm
Once we know how to differentiate and integrate, we can turn to differential equations. The so-called exponential family plays a crucial role in thermodynamics, both standard and generalized [
39,
40,
41,
42,
43]. Many different deformations of the usual
can be found in the literature. However, from the non-Newtonian perspective, the exponential function
is defined by
Integrating (
25) (in a non-Newtonian way) one finds the unique solution
In thermodynamic applications, one often encounters exponents of negative arguments, . In a non-Newtonian context the correct form of a minus is . The example discussed in the next section will involve and (r). In consequence, it will be correct to write , but in general such a simple rule may be meaningless (because “−”, as opposed to , may be undefined in ).
Example 4. Let , with the arithmetic defined by , , . Then The same number can be both positive and negative, depending on the arithmetic.
A (natural) logarithm is the inverse of
, namely,
,
Expressions such as
are in general meaningless even if
and
. However, formulas such as
make perfect sense. For example, if
, then an entropy can be defined as
Many intriguing questions occur if one asks about normalization of probabilities. We will come to it later.
Non-Newtonian constructions of Exp and Ln are systematic, general, and flexible. There seems to exist a relation between the arithmetic formalism and the method of monotone embedding discussed in information geometry [
44], but the problem requires further studies.
Example 5. In order to appreciate the difference between Newtonian and non-Newtonian differentiation let us differentiate the function , , but in two cases. The first one is trivial, , with the arithmetic defined by the identity . Then, the non-Newtonian and Newtonian derivatives coincide, so The second case involves, as before, the codomain , with the arithmetic defined by the identity . However, as the domain we choose , with the arithmetic defined by , , . Now, As, , we find , and conclude that , belongs to the exponential family. Indeed, To understand the result, write , so that . Then, by the second form of derivative in (18), The map A does not affect the value of x, but changes its arithmetic properties. It behaves as if it assigned a different meaning to the same word. The example becomes even more intriguing if one realizes that logarithm is known to approximately relate stimulus with sensation in real-life sensory systems (hence the logarithmic scale of decibels and star magnitudes) [35]. Example 6. Many calculations in thermodynamics reduce to formulas of the formbeing equivalent to the derivative of a composite function of several variables. The latter has a unique formulation in non-Newtonian calculus: One only needs to specify the arithmetics. For example, let U be a map , and let , . Then, Under the usual assumptions about continuity of inwe reduce (40) toand then to two instances of the non-Newtonian chain rule,valid for the compositionof maps. Finally, Effectively,is the non-Newtonian formula for a differential. The next section shows that the above mentioned subtleties with arithmetics of domains and codomains have straightforward implications for generalized thermostatistics.
4. Kaniadakis -Calculus Versus Non-Newtonian Calculus
Kaniadakis, in a series of papers [
26,
27,
28,
29,
30,
31,
32,
33,
34], developed a generalized form of arithmetic and calculus, with numerous applications to statistical physics, and beyond. In the present section, we will clarify links between his formalism and non-Newtonian calculus. As we will see, some of the results have a straightforward non-Newtonian interpretation, but not all.
Assume
, with the bijection
given explicitly by
Kaniadakis’
-calculus begins with the arithmetic,
As
, the case
corresponds to the usual field
, which we will shortly denote by
. The neutral element of addition,
, is the same for all
s. The neutral element of
-multiplication is nontrivial,
. The fields
are isomorphic to one another due to their isomorphism with
,
Kaniadakis defines his
-derivative of a real function
as
We will now specify in which sense the
-derivative is non-Newtonian. First consider a function
A,
Its non-Newtonian derivative
if compared with (
55), suggests
. Setting
,
, we find
as
for
. Denoting
we find
, and
in agreement with the Kaniadakis formula. However, as a by-product of the calculation we have proved that
-calculus is applicable only to functions mapping
into
. Kaniadakis exponential function satisfies
with
,
. Accordingly,
which is indeed the Kaniadakis result. Recalling that
, we find the explicit form of the logarithm,
,
which again agrees with the Kaniadakis definition.
Yet, the readers must be hereby warned that it is
not allowed to apply the Kaniadakis definition of derivative to
. The correct non-Newtonian form is
because
maps
into
. Kaniadakis is aware of the subtlety and thus introduces also another derivative, meant for differentiation of inverse functions,
a definition which, from the non-Newtonian standpoint, must be nevertheless regarded as incorrect (‘/’ should be replaced by
typical of the codomain
). As a result,
This is probably why (
64), as opposed to (
55), has not found too many applications.
Let us finally check what would have happened if instead of (
61) one considered the exponential function mapping
into itself,
,
As in thermodynamic applications one typically encounters
of a negative argument, one expects that physical differences between
and
should not be essential. Moreover, indeed,
Figure 1 shows that both exponents lead to identical asymptotic tails.
5. A Cosmological Aspect of the Kaniadakis Arithmetic
Kaniadakis explored possible relativistic implications of his formalism. In particular, he noted that fluxes of cosmic rays depend on energy in a way that seems to indicate
. It is therefore intriguing that essentially the same arithmetic was recently shown [
14] to have links with the problem of accelerated expansion of the Universe, one of the greatest puzzles of contemporary physics.
Cosmological expansion is well described by the Friedman equation,
for a dimensionless scale factor
evolving in a dimensionless time
t (in units of the Hubble time
yr). The observable parameters are
,
[
45,
46].
is typically interpreted as an indication of dark energy. Equation (
67) is solved by
Now assume that
whereas the Friedman equation involves no
,
for some
. Its solution by non-Newtonian techniques reads
so, comparing (
71) with (
68), we find
Accelerated expansion of the Universe looks like a combined effect of non-Euclidean geometry and non-Diophantine arithmetic. The resulting dynamics is non-Newtonian in both meanings of this term.
The presence of the inverse bijection
and
raises a number of interesting questions. It is related to the fundamental duality between Diophantine and non-Diophantine arithmetics. Namely, any equation of the form, say
can be inverted by
into
suggesting that it is ⊕ and not + which is the Diophantine arithmetic operation. Having two isomorphic arithmetics we, in general, do not have any criterion telling us which of the two is “normal”, and which is “generalized”.
6. Kolmogorov–Nagumo Averages and Non-Diophantine/Non-Newtonian Probability
Another non-Diophantine/non-Newtonian aspect that can be identified in the context of information theory and thermodynamics is implicitly present in the works of Kolmogorow, Nagumo, and Rényi. Let us recall that a Kolmogorov–Nagumo average is defined as [
47,
48,
49,
50,
51,
52,
53,
54]
Rewriting (
75) as
where
, one interprets the average as the one typical of a non-Diophantine-arithmetic-valued probability. Apparently, neither Kolmogorov nor Nagumo nor Rényi had interpreted their results from this arithmetic point of view [
7].
The lack of arithmetic perspective is especially visible in the works of Rényi [
49] who, while deriving his
-entropies, began with a general Kolmogorov–Nagumo average. Trying to derive a meaningful class of
fs he demanded that
be valid for any constant random variable
c, and this led him to the exponential family
(up to a general affine transformation
, which does not affect Kolmogorov–Nagumo averages). In physical applications, it is more convenient to work with natural logarithms, so let us replace
by
,
,
. With this particular choice of
f one finds
As is well known, the standard linear average is the limiting case
that includes the entropy of Shannon,
, as the limit
of the Rényi entropy
Still, notice that
for any
f, so had Rényi been thinking in arithmetic categories, he would not have arrived at his
. Yet,
is an interesting special case. For example,
The random variable
is, according to Shannon [
49,
55], the amount of information obtained by observing an event whose probability is
. The choice of
b defines units of information. Therefore, Rényi’s non-Diophantine probability
is the amount of information encoded in
.
7. Escort Probabilities and Quantum Mechanical Hidden Variables
Non-Diophantine arithmetics have several properties that make them analogous to sets of values of incompatible random variables in quantum mechanics. Generalized arithmetics and non-Newtonian calculi have nontrivial consequences for the problem of hidden variables and completeness of quantum mechanics.
Example 7. Pauli matrices and represent random variables whose values are and , respectively. However, it is not allowed to assume that represents a random variable whose possible values are , even though an average of ia a sum of independent averages of and . In non-Diophantine arithmetic one encounters a similar problem. In general it makes no sense to perform additions of the form even if and . One should not be surprised if non-Diophantine probabilities turn out to be analogous to quantum probabilities, at least in some respects.
Normalization of probability implies
In principle,
. An interesting and highly nontrivial case occurs if both
and
are probabilities in the ordinary sense, i.e., in addition to (
81) one finds
,
, and
. What can be then said about
f? We can formalize the question as follows.
Problem 1. Find a characterization of those functions that satisfy In analogy to the generalized thermostatistics literature we can term
the escort probabilities [
56,
57,
58]. Notice that we are
not in interested in the trivial solution, often employed in the context of Tsallis and Rényi entropies, where
is replaced by
and then
renormalized,
as
for a single function
g of one variable. As we will shortly see, the solution of (
82) turns out to have straightforward implications for the quantum mechanical problem of hidden variables, and relations between classical and quantum probabilities.
The most nontrivial result is found for binary probabilities, .
Lemma 1. for all if and only ifwhere . The lemma has profound consequences for foundations of quantum mechanics, as it allows to circumvent Bell’s theorem by non-Newtonian hidden variables. For more details the readers are referred to [
13,
15], but here just a few examples.
Example 8. The trivial case implies , where and .
Example 9. Consider . Then, Now let be the probability of finding a point belonging to the overlap of two half-circles rotated by θ. Then,is the quantum-mechanical law describing the conditional probability for two successive measurements of spin-1/2 in two Stern–Gerlach devices placed one after another, with relative angle θ. Escort probability has become a quantum probability. Example 10. Let us continue the analysis of Example 9. Function , , is one-to-one. It can be continued to the bijection by the periodic repetition, Now let . (88) leads to a non-Diophantine arithmetic and non-Newtonian calculus. Let , , be an angle between two vectors representing directions of Stern-Gerlach devices. Quantum conditional probability (87) can be represented in a non-Newtonian hidden-variable form,where . Here, ρ is a conditional probability density of non-Newtonian hidden-variables (the half-circle is a result of conditioning by the first measurement). Non-Newtonian calculus shifts the discussion on relations between classical and quantum probability, or classical and quantum information, into unexplored areas.
Example 11. In typical Bell-type experiments one deals with four probabilities, corresponding to four combinations , of pairs of binary results. The corresponding non-Newtonian model is obtained by rescaling , with . The rescaled bijection satisfies for any . Explicitly, The resulting hidden-variable model is local, but standard Bell’s inequality cannot be proved [15]. Why? Mainly because the non-Newtonian integral is not a linear map with respect to the ordinary Diophantine addition and multiplication (unless f is linear), whereas the latter is always assumed in proofs of Bell-type inequalities. A generalization to arbitrary probabilities,
, leads to an affine deformation of arithmetic, an analogue of Benioff number scaling [
21,
22,
23,
24,
25]. Affine transformations do not affect Kolmogorov–Nagumo averages.
Lemma 2. Consider probabilities , . are probabilities for any choice of if and only if , .
The bijection g implied by Lemma 2 depends on n. In infinitely dimensional systems, that is when n can be arbitrary, the only option is and thus is the only acceptable solution. However, in spin systems there exits an alternative interpretation of this property: The dimension n grows with spin in such a way that with is a correspondence principle meaning that very large spins are practically classical. The transition non-Diophantine → Diophantine, non-Newtonian → Newtonian becomes an analogue of non-classical → classical.
Example 12. Limitations imposed by Lemma 2 can be nevertheless circumvented in various ways. For example, let for a solution g from Lemma 1, so that . Obviously, Replacing each of the 1s by an appropriate sum of binary conditional probabilitieswe can generate various conditional classical or quantum probabilities typical of a generalized Bernoulli-type process, representing several classical or quantum filters placed one after another. 8. Non-Newtonian Maximum Entropy Principle
Let us finally discuss the implications of our non-Newtonian form (
32) of entropy for maximum entropy principles. Assume probabilities belong to
. Define the Massieu function [
43] by
where
, and
,
are Lagrange multipliers. Explicitly,
Vanishing of the derivative of
,
is equivalent to the standard formula for probabilities
(see the second form of non-Newtonian derivative in (
18)),
Accordingly, the solution reads
and involves the exponential function
we have encountered before. The normalization,
implies the usual relation
.
Equivalently, directly at the level of
,
All the standard tricks one finds in thermodynamics textbooks will work here. For example,
for some function
we yet have to determine. Clearly,
where
,
. Ultimately,
9. Final Remarks
Non-Newtonian calculus, and the non-Diophantine arithmetics behind it, are as simple as the undergraduate arithmetic and calculus we were taught at schools. Their conceptual potential is immense but they remain largely unexplored and unappreciated. Apparently, physicists in general do not feel any need of going beyond standard Diophantine arithmetic operations, in spite of the fact that the two greatest revolutions of the 20th century physics were, in their essence, arithmetic (i.e., relativistic addition of velocities and quantum mechanical addition of probabilities). It is thus intriguing that two of the most controversial issues of modern science—dark energy and Bell’s theorem—reveal new aspects when reformulated in generalized arithmetic terms.
One should not be surprised that those who study generalizations of Boltzmann–Gibbs statistics are naturally more inclined to accept non-aprioric rules of physical arithmetic. Anyway, the very concept of non-extensivity, the core of many studies on generalized entropies, is implicitly linked with generalized forms of addition, multiplication, and differentiation [
54,
59,
60,
61].