1. Introduction
The title of this volume is ‘On the role of geometric and entropic arguments in physics’, and the title of this article could well be ‘On the role of physical arguments in geometry’, where geometry means differential geometry, the mathematical tool which made general relativity possible, and subsequently the theory of symmetry in quantum mechanics and gauge theories of elementary particle physics. Underlying these theories is the assumption that space-time is a differentiable manifold. Our question—which we shall try to answer—is the following: Is the differentiable structure of space-time an invention of mathematicians, or does it follow from physical principles, or to be precise, the principle of relativistic causality?
Simply asking this question represents a departure from the view that mathematics is invented and not discovered. Let us take a brief look—without pretensions to historical completeness—at how this view came to be articulated. For physicists, the marker is surely Wigner’s 1960 paper entitled ‘The unreasonable effectiveness of mathematics in the natural sciences’ [
1]. As an answer to the question ‘What is mathematics?’, Wigner wrote the following: ‘… I would say that mathematics is the science of skillful operations with concepts and rules
invented just for the purpose’ [emphasis added]. Mathematicians may trace it to Georg Cantor, founder of the theory of sets and transfinite numbers, who claimed in 1883 that ‘the very essence of mathematics is its freedom’ [
2]. In citing this sentence, Fraenkel asserted in 1953 that ‘… we shall make plain the possibility of
free creation in mathematics which is not equaled in any other science’ (p. 3 of [
3]; the quotation from Cantor is on p. 4). In 1926, Hilbert had already declared that ‘No one shall expel us from the paradise which Cantor has created for us’ ([
4], p. 170).
However, a year before the publication of Wigner’s paper, the Soviet geometer A. D. Alexandrov pointed out that the interiors of double cones (Note: By a
double cone, we mean the nonempty intersection of a forward and backward light cone.) formed a base for the usual topology on Minkowski space [
5]. In 1964, Zeeman published a paper with the title ‘Causality implies the Lorentz group’ [
6]. The abstract of this paper reads as follows: ‘Causality is represented by a partial ordering on Minkowski space, and the group of all automorphisms that preserve this partial ordering is shown to be generated by the inhomogeneous Lorentz group and dilatations.’ These results suggest the possibility that
some mathematical structures—here a Lie group structure—may be traceable to the laws of physics rather than the human imagination.
Then, in 1983, Donaldson announced a result, using gauge theory, which implied that the differentiable structure of
was not unique [
7]. (Note: The following notations are standard:
,
denote the sets of real numbers, rational numbers and integers, respectively. There is some ambiguity with
, as some authors use it to denote the set of all
nonnegative integers, which includes zero, and others use it to denote the set of positive integers. To avoid confusion, we shall denote the set of positive integers by
and the set of nonnegative integers by
.) In the same year, Gompf established that there were at least three inequivalent differentiable structures on
[
8]. Since
is of particular interest to physicists, these results prompted the late H.-J. Borchers and the present author to undertake a systematic,
mathematical investigation of the mathematical implications of relativistic causality. A full report of these investigations was published in 2006 as a volume in the Springer
Lecture Notes in Physics series [
9].
The end result of these investigations can be summed up as two theorems (omitting some fine points which can be ignored for the present without harm).
Theorem 1. A causal structure can be defined on a countably infinite point set M which has no predefined mathematical structure to it. This causal structure defines a Tychonoff topology on M and therefore a uniformity (called the order uniformity) on it.
The construction begins with defining distinguished, totally ordered subsets called light rays on M (In physical terms, these will be paths of light rays in space-time.). The total order on the light ray l is denoted by . A crucial property of a light ray is the density property, where between any two distinct points x and z on a light ray l lies a third point y, and if then there exists such that .
Theorem 2. The order uniformity can be uniformly completed and the order extended to the uniform completion. According to a theorem of Shirota, the completed space can be embedded as a closed subspace of some , inheriting the differentiable structure of the latter.
The mathematics used in the proof of Theorem 1 was developed ab initio; it does not use results from the existing corpus of mathematics. The proof of Theorem 2 uses a completion process which creates new points, as well as the fact that has a differential structure. Is the completion process a discovery or an invention? Is the differentiable structure of a discovery or an invention?
The aim of this paper is to discuss these two questions, which are interesting, not strictly mathematical and (paraphrasing Bertrand Russell) may not have definite answers. They were not discussed in [
9]. This discussion is carried out in the following section.
2. Why Is Mathematics Considered an Invention?
To answer this question, we have to recapitulate the basic concepts of set theory. We begin with Cantor’s definition of a set and his power set construction.
2.1. What Cantor Did
Cantor defined a set to be ‘a collection of different, distinct objects of our intuition or of our intellect, to be conceived of as a whole’ ([
3], p. 6). Two sets were defined to be
equivalent if they could be put in one-to-one correspondence with each other. For example the sets
and
are equivalent, but the sets
and
are not. The property of equivalence divides the aggregate of finite sets into
equivalence classes, with an equivalence class being characterized by the
number of elements in a member of the set. (Here and in the following, we shall ignore some subtleties. This will not affect our argument materially.)
Using the definition of equivalence, Cantor gave a precise definition of an infinite set: a set is infinite if and only if it is equivalent to a proper subset of itself. For example, the set of positive integers is equivalent to the set of positive even integers, and the invertible map establishes the equivalence. It is an easy exercise to show that the sets (positive integers) and (all integers) are equivalent. Sets equivalent to are called countably infinite or simply countable or enumerable. It is only slightly more difficult to prove the counterintuitive result that the set of rational numbers is countable.
One of Cantor’s most important results was that some infinities are ‘bigger’ than others. The set of real numbers, which is infinite, is not countable. It will suffice to prove this for the reals values in the interval . The proof is carried out by contradiction.
Assume that
is an enumeration of the real values in
. Then,
can be expressed as
where
is the
digit after the decimal point in the decimal expression for
. Now, we form the number
where
Clearly, . This differs from in the jth digit after the decimal point, contradicting the assumption that all real values in are contained in the enumeration Note that there are many such numbers b. The (arbitrary) choice of and works just as well. The sets and are both infinite, but appears to be, in some sense, ‘bigger’ than .
Equivalent finite sets have the same
number of elements. Cantor generalized the notion of numbers to sets equivalent to
and
. He named this generalization
cardinality, or
cardinal numbers, and assigned the symbols
(from the Hebrew letter
ℵ, pronounced ‘aleph’) to the cardinality of
and
c to that of
. (Note: Fraenkel [
3] used
ℵ for the cardinality of
, reserving
for the generic cardinal. Hausdorff in his classic text [
10] also used
ℵ for the cardinality of
.) The cardinality of a finite set is, of course, the number of elements in the set.
Now we come to the critical point: Cantor’s power set construction. According to Cantor’s definition of a set, given
any set
S, its
power set, defined as the
set of subsets of
S and denoted by
or
, is a well-defined set. He showed that
the cardinality of was always greater than that of S. Thus, by starting with a set
S of cardinality
and iterating the power set construction, one obtains a non-terminating sequence of sets
of increasing cardinality, which were labeled
by Cantor. The alephs were called
transfinite cardinals by him. The power set construction displayed the power of ‘free creation’ in mathematics.
2.2. Russell’s Paradox and Its Resolution
In 1901, Bertrand Russell discovered that Cantor’s definition of a set was so general as to be unsustainable. (A good elementary account with references can be found in [
11].) Sets are usually not members of themselves. The set
R of all physicists is definitely not a physicist, and therefore
. According to Cantor, one may define the set
C of
all sets which are
not members of themselves. Then,
C is a member of
C (
), meaning that
C is
not a member of
C (
), which is a contradiction!
Mathematicians quickly realized that Cantor’s notion of sets was too general, and some restrictions had to be put in place. The axiom system (for sets) which gained the greatest acceptance was set up by Zermelo and Fraenkel, and it is known as
ZF [
12]. The power set construction and Cantor’s theory of transfinite numbers remain valid in
ZF. Mathematicians generally work with
ZFC, where
C stands for the independent
axiom of choice.
The conjecture is that is known as the continuum hypothesis. Cantor tried in vain for the rest of his life to prove the continuum hypothesis. In 1940, Kurt Gödel showed that could not be proven in ZFC, and in 1963, Paul Cohen showed the same was true of . Both the continuum hypothesis and its negation were consistent with ZFC.
2.3. Physics and the Power Set Construction
Early in his book, Fraenkel asserted that ‘… we shall make plain that possibility of
free creation in mathematics which is not equaled in any other science’ ([
3], p. 3). This assertion clearly pointed to the power set construction on which the theory of transfinite numbers is based, but how relevant is this construction to physics? We claim that the power set construction does not produce physically meaningful sets when applied to space-time manifolds, which are the most important uncountably infinite sets used in physics.
In physics, the notion of a geometrical point is an idealization. The mere recording of the infinitely many digits of an irrational number will demand infinite physical resources and infinite time. The position of a point in space can only be determined approximately. From the empirical point of view, the important notion is the slightly fuzzy one of the neighborhood of a point. While the notion of the continuum is not directly accessible to experimentation, it is the foundation on which the notions of differentiable and analytic functions are based, and these notions are the basic tools of discovery of the theoretical physicist. The predictive power of general relativity would not be diminished if its differential equations were to be replaced by difference equations, but could they have been discovered in the difference equation form?
In brief, in physics one can never separate a geometrical point from its neighborhood. (Note: To be precise, here neighborhood is taken to mean neighborhood in the standard topology of space-time.) But that is exactly what the power set construction does; it destroys the essential, necessarily fuzzy notion of the neighborhood when applied to any continuum!
2.4. The Mathematics Needed by Physics
For this reason, the present author contends that the mathematics which derives from application of the power set construction to infinite sets is not relevant to physics. The only infinities which physics needs are the integers and rational and real numbers (the continuum). It may be useful to know that their cardinalities and are different, but the continuum hypothesis has no relevance.
The moment one drops the power set construction (for infinite sets), one loses the raison d’être for the contention that mathematics is an invention of the human mind.
3. From the Discrete to the Continuous
As was already pointed out, the mathematics used for establishing Theorem 1 was developed ab initio. This is no longer the case with Theorem 2.
The mathematics used to establish Theorem 2 is, first and foremost, the process called completion in mathematical analysis. It would suffice to consider metric completion, the archetype of which is the completion of the rational numbers to the real numbers under the metric .
To explain the process, we need to recall the following well-known definitions and examples:
Definition 1. A sequence of real numbers is called a Cauchy sequence
if, given any
, there exists a positive integer N such that Example 1. Consider the sequence . This is a Cauchy sequence, because for any , there exists a positive integer N such that . Then, for , we have such that Definition 2. Let be a subset of . A sequence in is said to converge to s if there exists such that, given ϵ, there exists N such that for all .
The sequence converges to 0 in any interval which contains the point 0. It does not converge in intervals and , which do not contain the point 0.
We need one last definition, namely that of completeness:
Definition 3. A metric space S (with metric is said to be complete if every Cauchy sequence in S converges (to a point in .
For uniform spaces, (A
uniform structure in a space is weaker than a metric structure, but it is a space which admits a differentiable structure [
13].) the notion of completeness still holds. One has to only replace the sequences with filters.
Example 2. The space of real numbers , with the metric , is complete. The space of rational numbers (with the same metric) is not complete. All closed intervals are complete, and all open intervals are incomplete.
Example 3. Consider the sequence defined by It can be shown, with some effort, that the sequence defined by Equation (1) is a Cauchy sequence which converges to . It converges in but not in (since is irrational). This algorithm of approaching with successive approximations (calculating from ) was known in several ancient civilizations.
The process of completing to —passing from the discrete to the continuous—is exemplified perfectly by the simple example described above, which defines the irrational number with a Cauchy sequence of rationals, namely the sequence . More generally, in a complete metric space, every Cauchy sequence (Cauchy filter in complete non-metric Tychonoff spaces) converges, and one completes an incomplete space by defining new points with Cauchy sequences which do not converge in the original space. Every point in the complete space may equally be regarded as a Cauchy sequence. In the completed space, all Cauchy sequences converge.
4. Putting the Pieces Together
Assuming we have agreed that the power set construction is irrelevant to physics, the two questions raised at the end of
Section 1 remain to be answered. Is the continuum (space-time, for example) a discovery or an invention? In other words, is the convergence of every Cauchy sequence in space-time a fact of nature or merely a mathematical artifice? Is the differentiable structure of
a discovery or an invention?
4.1. Is the Continuum a Discovery or an
Invention?
The totality of the experimental data can be represented as a finite set of n tuples of rational numbers. The convergence or divergence of an infinite series depends on its infinite ‘tail’ and not on its finite ‘head’. It is not possible to determine through experimentation (rather, observation) whether or not an arbitrary sequence of numbers is a Cauchy sequence for the simple reason that it is impossible to write down explicitly all the terms of an infinite sequence on a piece of paper.
The laws of physics—classical and quantum mechanics and electrodynamics, relativity theory as well as most forms of quantum field theory—are based on the assumption that space, time and space-time are continua. However, this assumption may no longer be tenable if Einstein’s theory of gravitation turns out to be quantizable. Although these attempts have not succeeded thus far, they are still continuing in the form of string theory. The present situation may therefore be summed-up as follows.
Mathematical continua are well-defined entities. Whether or not they are identifiable with physical space, time or space-time has not yet been settled. An affirmative answer—which, in the present state of knowledge, would require the possibility of examining infinite sequences on a blackboard—seems to be out of reach. A negative answer may be possible if gravity can be quantized successfully.
4.2. Is the Differentiable Structure of
a Discovery or an Invention?
Given the one-dimensional mathematical continuum , one may argue that differentiable functions on it are provided ‘by nature’, and no human artifice like the completion process is needed; that is, the differentiable structure of , and likewise that of , is a discovery and not an invention. This would seem to be reinforced by the existence of ‘exotic’ differentiable structures on . On the other hand, if the continuum itself is a human invention, it is surely legitimate to ask how nature can provide mathematical structures on objects which do not ‘exist’ in nature.
It appears that the question posed in the title of this paper may well be unanswerable. It is, however, surprising how far the principle of relativistic causality has led us toward an answer.