2.1. Minkowski Spacetime
The causal structure on
to be preserved is the null cone, embodied in a metric tensor
of
, a symmetric and non-degenerate bilinear form that defines a scalar product on
. The metric
is such that for any vectors
, there exists a basis
of
such that
where the scalar
c is the speed of light, and
and
define the components
and
of
and
., respectively. Note the summation convention, with Greek indices taking values in
, with letters
near the beginning of the Greek alphabet preferred for dummy indices, and letters
from later in the alphabet preferred for free indices. Let
be the dual space of
and
the basis dual to
. With respect to this dual basis, it is apparent from Equation (
1) that
The null cone of
is the set of all vectors
such that
.
The particular basis
considered above is not the only one for which the scalar product
takes not only the value, but the algebraic form exhibited in Equation (
1). Let
be the Minkowski matrix
and
the matrix
collecting the components
of
. Note that latin indices take values in
, and that letters
near the middle of the alphabet are preferred for free indices; letters
near the beginning of the alphabet will be preferred for dummy indices. With respect to the particular basis considered above, it follows from Equation (
2) that the matrix representing
is the Minkowski matrix:
Let
and
be the
n-column representations of
with respect to the considered basis (see
Appendix A). Then, Equation (
1) is expressed by the matrix equation
Lorentz transformations are the invertible linear transformations
of
that preserve the scalar product defined by
(without also transforming
):
The Lorentz transformations constitute a subgroup of
. With respect to the considered basis, this preservation of the scalar product reads
(Recall from
Appendix A that
is taken to act on the basis elements rather than on the
n-columns collecting the vector components). With slight ambiguity, refer to both the set of Lorentz transformations, and the set of matrices whose elements
faithfully represent them and are such that the Minkowski matrix is preserved according to the relation
as the Lorentz group
. Under (suitable representations of) Lorentz transformations, the matrix expression of Equation (
6) for the scalar product is indifferent to a change of basis
:
Call a ‘Minkowski basis’ any basis of
for which Equation (
5) holds so that the inner product
is given by Equation (
6) with η the Minkowski matrix of Equation (
3), which yields the arithmetic form of Equation (
1).
The definition of the null cone of
as the set of all vectors
such that
, together with the invariance of the scalar product as the defining property of Lorentz transformations (Equation (
7)), implies that the null cone is preserved under Lorentz transformations.
Elements
of the identity component
of the Lorentz group (the connected component containing the identity), also called the restricted Lorentz group or proper orthochronous Lorentz group, can be uniquely factored into a ‘boost’ and a ‘rotation’. With respect to a Minkowski basis
of
,
Here,
with
a rotation of the subspace
of
spanned by
;
is the orthogonal complement (relative to
) of the 1-dimensional subspace spanned by
. Moreover,
is a boost that can be parametrized as
where the 3-column
is the boost velocity parameter,
is the Lorentz factor associated with
, and
is the squared Euclid norm with respect to an orthonormal basis of
(naturally appropriate to a Minkowski basis of
). Thus,
The inverse is
where
is obtained from
via
.
Because
is non-degenerate, its matrix representation
of Equation (
4) has an inverse
written here in a way suggestive of the fact that
collects the components
resulting from the evaluation of the inverse metric
on elements of a basis of
. The inverse Minkowski matrix is
so that
with respect to a Minkowski basis—say, the same basis used to obtain Equation (
8) from Equation (
7). Given a Lorentz transformation
acting on
, the dual space
is transformed by the algebraic adjoint of the inverse transformation,
; see
Appendix A. The inverse metric
defines a scalar product on
, also Lorentz-invariant, in the sense that for any
,
This implies the preservation of the inverse Minkowski matrix,
consistent with Equation (
8).
Equipped with a metric and its inverse , the affine spacetime (and flat differentiable manifold) with its underlying vector space enjoys a fullness of the apparatus of tensor algebra (and tensor calculus).
The tensor algebra includes metric duality between vectors and linear forms manifest in conventions for the raising and lowering of indices. Associated with a vector
, which is a
tensor, is a linear form
, which is a
tensor. This is expressed in matrix notation as
(so that
is represented as a 4-row
), and in terms of indexed components as
. Associated with a covector
, which is a
tensor, is a vector
, which is a
tensor. This is expressed in matrix notation as
(so that
is represented as a 4-column
), and in terms of indexed components as
. Consider also a bilinear form
, which is a
tensor, taking values
for
. It is associated by metric duality with a
tensor
defined by
and a
tensor
defined by
, where in both cases,
. In matrix notation,
and
. In terms of indexed components,
and
. In terms of the infix dot operator (·),
express the above relations describing metric duality.
As for tensor calculus,
is a pseudo-Riemann manifold; the natural affine connection
∇ possessed by an affine space mentioned at the end of
Appendix A becomes a Levi–Civita connection associated with
. An orientation on
is specified with a volume form on
, the Levi–Civita tensor
defined such that
with components
for a right-handed Minkowski basis. With respect to another right-handed but otherwise arbitrary basis
, Equation (
A11), together with the matrix relation
, show that in the more general basis the components are given by
where
. Raising all four indices yields
with respect to a general basis, or
with respect to a Minkowski basis.
A metric
that makes the volume form
also a Levi–Civita tensor makes available the Hodge star operator that provides a bijection between
p-forms and
-forms. In particular, a 2-form
is related to a complementary 2-form
by
where
is the version of
with both indices raised, and the expression with a double contraction
reads
. The bijective nature of the Hodge duality relation is manifest in its ‘invertibility’, in the sense that (again for a 2-form)
One application concerns 4-vector fields
and
, for which the identities
and
hold, where
and
.
2.2. Galilei–Newton Spacetime
Galilei–Newton spacetime
might in a more or less literal sense be regarded as a ‘degeneration’ of Minkowski spacetime
as
. In many respects, one obtains a smooth limit, but crucially, the limit of the metric
does not exist so that a qualitatively different geometric structure results. In particular, Equation (
2) asymptotes as
This indicates that the covector (linear form)
becomes the fundamental causal structure on
. With respect to what once was a Minkowski basis—now to be called a Galilei basis—it is represented by the 4-row
The other remnant that survives from Minkowski spacetime is a limit that does exist as
, namely,
from Equation (
13). With respect to a Galilei basis it is represented by
and does not qualify as an inverse metric on
because for
(and similarly for any scalar multiple thereof)
for any
, that is, it is degenerate in the technical sense.
The homogeneous Galilei group
consists of the linear transformations
of
that preserve these structures, and it turns out that they are the
limit of the Lorentz transformations. Require first that
for any
. When expressed with respect to a Galilei basis, this requirement implies
Require also that for any
,
which implies
when expressed with respect to a Galilei basis (compare Equation (
14)).
As with the restricted Lorentz group, the elements
of the identity component
of
can be uniquely factored into a boost and a rotation. With respect to a Galilei basis, these read
where
is the same as in Equation (
9), and the Galilei boost
is the
limit of Equation (
10) so that
with
and
. The inverse is
where
is obtained from
via
. It is easy to see that the matrices
satisfy the above conditions for the invariance of
and
.
Without a spacetime metric, the tensor algebra and tensor calculus on the affine spacetime are more limited. In particular, there is no metric duality, no ‘raising and lowering of indices’, for tensors on or tensor fields on : the type of a particular tensor is fixed. As will be discussed shortly, there is metric duality on a subspace of , and for later notational consistency a double arrow adorns the degenerate inverse ‘metric’ . For regarded as a tensor on , however, this must not be associated with metric duality, but simply as an integral part of the symbol denoting this particular tensor of fixed type .
A volume form exists, given by Equation (
A10) with the ‘preferred basis’ being a Galilei basis, but is not the traditional Levi–Civita tensor associated with a metric. There is no Hodge star operator because there is no inverse metric, though one can define something partly comparable—a ‘slash-star operator’—using the Galilei-invariant
instead of
available only on
. For a 2-form
, the analog of Equation (
15) is
where now the raised-index object is interpreted as
and has had its time components projected out, and
in contrast to Equation (
16) as a consequence of the degeneracy of
.
2.3. Spacetime Foliation and Tensor Decomposition
Humans and their measuring instruments do not apprehend spacetime directly, but only perceive happenings in nearby ‘space’ at successive instants of ‘time’. Thus, if a physical theory is formulated in terms of tensor fields on spacetime, comparison with human observations requires a means of decomposing spacetime and tensor fields thereon into structures compatible with perceptions experienced and recorded in this way. The key tensor structures on the vector spaces and underlying the affine spacetimes and , along with the symmetry groups compatible with them, enable such decompositions. The manner in which these structures describe time implies a notion of space: given an event in or , the subset of events ‘simultaneous’ to constitutes ‘space’ according to an observer at . A notion of time also embodies ‘causality’: if the value of a physical variable at event in or is to influence the value of a physical variable at event , event must ‘precede’ event .
Affine spacetimes permit ‘inertial observers’ with straight worldlines and no rotation, and the splitting of spacetime into space and time as perceived by a single inertial observer is formally similar on
and
. Select an event
of
or
as origin, and a Minkowski basis of
or a Galilei basis of
accordingly, designated
in either case. Such bases are determined by the metric
and the Lorentz transformations, which preserve it in the case of
, or by the covector
and the
tensor
and the homogeneous Galilei transformations, which preserve them in the case of
, as discussed in
Section 2.1 and
Section 2.2. For these affine spacetimes, such choices of origin and vector basis determine a global coordinate system, as in Equation (
A2); call these coordinates
, with
t the time coordinate and
the space coordinates. The
t coordinate curve passing through
is the straight line
Interpret
as as the worldline of a fiducial (and inertial) observer, whose location in
or
when the ideal clock (which marks time at a constant rate) reads time
t is the event
. Let
be the subspace of
or
spanned by
. For a given time
, consider a one-to-one mapping
The image of this mapping,
is a hyperplane (a 3-dimensional affine subspace) of
or
through the event
. Interpret
as ‘space’, that is, position space, according to the fiducial observer with straight worldline
at the time
t: each of its points also has time coordinate
t, and together they constitute a surface of simultaneity with the fiducial observer. Each hypersurface
is a level surface of (abusing notation) the coordinate function
t; these hypersurfaces partition spacetime, and the complete collection
is said to be a foliation of
or
.
For a given inertial observer—one with a straight worldline
—the structure of position space, that is, of the leaf
of the foliation of spacetime she encounters at time
t, is the same for
or
: it is a 3-dimensional affine space whose underlying vector space
is rotationally invariant. This is apparent from the expressions for Lorentz transformations
on
and homogeneous Galilei transformations
on
exhibited in
Section 2.1 and
Section 2.2, respectively: the symmetry transformations
and
both reduce to a rotation of
for vanishing boost parameter
. As a rotationally invariant vector space,
is naturally endowed with a flat Euclid metric defining the usual scalar product; call it
.
While much is the same for the split of
and
into space and time for a single inertial observer, an important difference becomes apparent in comparing these splits for different inertial observers. In a conventional spacetime diagram for
, the fiducial time and space axes—here aligned with
and
—are vertical and horizontal, respectively, and for
, the trace of the null cone makes a
angle midway between them. Under a pure boost of magnitude
u aligned with
, and temporarily setting
, the basis relation (see
Appendix A)
yields
for the transformation of the basis vectors. According to these equations, the time axis and the first space axis of another inertial observer moving with speed
u relative to the fiducial observer undergo a pseudo-rotation governed by
, each tilting towards the null cone by an equal amount so as to maintain (pseudo-)orthogonality (see, for instance, Figures 1 and 2 of Ref. [
30]). That the worldline
of the second observer is tilted relative to
according to its velocity is expected; the new result of Poincaré physics is that the hyperplane
, which reflects simultaneity according to the second observer at
, is tilted relative to
. This is the geometric origin of the relativity of simultaneity. For
, the degeneration of the spacetime metric
and its inverse
into the covector
and the
tensor
, related by the degeneracy condition of Equation (
21), can be understood by returning
c to its value in, say, SI units; this is a large number representing the rapid speed of light propagation as perceived in ordinary human experience. Then, the trace of the null cone opens wide (large distance for small time) until it nearly coincides with the horizontal axis; indeed, this coincidence is complete in the limit
. What was the invariant null cone for
is now the invariant hypersurface
, corresponding to the invariant covector
on
. The basis vectors
transformed by a pure Galilei boost
confirm that
is tilted but
is not (see, for instance, Figures 3 and 4 of Ref. [
30]). This is the geometric origin of absolute simultaneity in Galilei physics, and the corresponding ‘floppiness’ of straight inertial observer worldlines relative to a fixed surface of simultaneity results in the degeneracy of
.
Having split spacetime into space and time for a given observer, a means of decomposing tensor fields on or into pieces ‘pointing along ’ and ‘tangent to ’ is needed. On these are ‘orthogonal decompositions’ thanks to the spacetime metric ; this allows flexibility in the raising and lowering of indices of decomposed pieces, but is not the fundamental source of the uniqueness of the decomposition. On unique decompositions are still possible even though they are not ‘orthogonal’, because the uniqueness that matters is the uniqueness inherent to expansion with respect to a particular basis. As will be seen explicitly below, the point is that even without a metric, one always has an identity operator that preserves an entire vector and a dual basis that can be used to pick off particular pieces.
What is needed is a projection operator
that subtracts off the portion of a vector field lying along
, which is parallel to
; the result is necessarily a vector tangent to
, because
is spanned by the remaining basis vectors
. To emphasize the status of
as the value of the 4-velocity field of the fiducial observer associated with the selected Minkowski or Galilei basis, label it
and call it the ‘fiducial observer vector’. (The notion of 4-velocity will be introduced in
Section 3.1). The other key element is the dual-basis covector
, for which
and
. Thus,
corresponds to a covector field ‘pointing completely away from’
, in the sense that it vanishes when evaluated on any vector tangent to
. Because
is a level surface of the coordinate function
t, the covector
also corresponds to the exterior derivative or (covariant) gradient of this function. Thus, at each point of
, the relation
holds, with components
in the selected basis. With
and
, the dual-basis relationship reads
Thus, any vector field
on
or
can be uniquely decomposed as
with
Here,
where
is the identity tensor. This is the desired projection operator: the second term in Equation (
26) removes the part along
, leaving a vector field tangent to
.
The same projection operator can be used to decompose covector fields on
and
. Writing
with
decomposes
into pieces that do and do not vanish when evaluated on a vector parallel to
, namely,
and
, respectively.
For
, the ‘covector pointing away’
can be characterized in terms of a ‘dual-observer vector’
the metric dual of
characterized by
This relation is what motivates the names ‘dual-observer vector’ for
and ‘dual-observer covector’ for
. Moreover, because
while
, it is clear from the non-degeneracy of
that
that is, that the dual-observer vector is a rescaled and oppositely directed version of the observer vector, and unlike a 4-velocity is characterized by
In terms of the dual-observer covector
, Equation (
26) becomes
as follows from Equations (
26) and (
27).
For
, the ‘covector pointing away’
is already a fundamental structure, the previously encountered invariant covector
: any Galilei basis must conform to this fundamental structure by having
. With
and
, the dual-basis relationship requires
and the projection operator reads
as follows from Equation (
26).
Naturally one has a different projection operator
relative to a different Minkowski or Galilei observer vector
. In the case of
,
with the dual-observer vector
pseudo-rotating along with
to maintain the pseudo-orthogonality of
and
. In contrast, for
the projection operator relative to a different Galilei observer includes the same invariant covector
:
That is, a
different projection is made to the
same invariant hypersurface
embodied by the covector
pointing away from it. Despite this ‘degeneracy’, the decompositions relative to
and
are unique.
In summary, appropriate contractions project out desired parts of decomposed tensors. Contraction of vectors and the ‘vector-like parts’ (contravariant indices) of more general tensors with or projects out the ‘time’ parts parallel to , while contraction with projects out the ‘space’ parts belonging to . Contraction of covectors and the ‘covector-like parts’ (covariant indices) of more general tensors with projects out the ‘time’ parts that do not vanish when evaluated on vectors parallel to , while contraction with projects out the ‘space’ parts that vanish when evaluated on vectors parallel to .
Another issue is the question of how to extend a multilinear form originally defined only on
to
or
. A case in point is the Euclid metric
defined on
by virtue of its rotational invariance, as described above. An extension to
or
, denoted by the same symbol
, is defined with the help of the projection operator defined above, which is used to enforce tangency to
. For
in
or
, the
tensor
on
or
is defined by
The
on the left is the tensor extended to
or
, and the
on the right is the original tensor on
. The notation may seem a bit odd, but it evades a proliferation of symbols, and the meaning is generally clear from the context.
The notational subtleties of the various tensors
,
, and
can now be explained. When denoting tensors on
, these are simply the 3-metric; the 3-metric with first index raised, that is, the identity tensor on
; and the inverse 3-metric. When denoting tensors on
, it turns out that
and the versions adorned with arrows reflect index raising with
. (On
the identity tensor is related to the metric and its inverse by raising an index of the metric itself:
or
). When denoting tensors on
, each of the tensors
,
, and
can be defined as distinct projection tensors, but they are not related by metric duality; the arrows must simply be considered integral to the symbols defining those particular tensors.
A word on a unified presentation of the volume form
on
and
is in order. This is defined in terms of a right-handed Minkowski or Galilei basis, respectively, with respect to which the components are
in either case, where the right-hand side is the alternating (permutation) symbol. Given some fiducial Minkowski or Galilei basis, it may be useful to employ coordinates that include curvilinear space coordinates on
and/or observers in (generally non-inertial) motion relative to the fiducial observer, with (generally curved) worldlines exhibiting a 3-velocity
with rectangular components given by
according to the fiducial Minkowski or Galilei observer. The matrix representing a basis change governing this case on either
or
is of the form
with
and
. Since the 3-metric
on
is represented by the identity matrix with respect to a Minkowski or Galilei basis, the matrix γ representing it in the curvilinear/moving basis has components
so that γ
. Thus,
, where
γ, and the components of the volume form on either
or
become
according to Equation (
A11). (As an aside, in the case of
, it is interesting to note what the matrix
representing
becomes under such a transformation). It may help to put
in the space part of η, and note that the 3-column
collects the components of
relative to the curvilinear basis, with the 3-row
representing the covector
on
. The result is
which up to setting the lapse function
is of the same form as in the
(traditionally,
) formalism of Poincaré general relativity).
It is useful to consider further consequences of spacetime foliation for the spacetime volume form
and the spacetime exterior derivative operator
. Because
, the contraction
yields the space volume form
on
, with components
Conversely, because
and
on
and
, respectively,
is a useful factorization of the spacetime volume form
. For vectors
, the cross-product
familiar from
with Euclid metric
—or more precisely, the covector
, that is, the metric dual of
—is defined by
The spacetime exterior derivative, represented symbolically as
as in Equation (
A12), breaks naturally into
where
is the exterior derivative operator on
. Combining the volume form and the exterior derivative on
enables contact with the vector calculus familiar on
with Euclid metric
. For a vector field
tangent to
, the expression
defines
, the 3-divergence of
, and the expression
where
, defines
, the curl of
.
Finally, a word about causality. Recall that for , vectors are classified as timelike for , spacelike for , and null for . These correspond to vectors ‘inside’ the null cone, ‘outside’ the null cone, and ‘on’ the null cone, respectively. It is well known that for two events separated by a spacelike vector, it is possible for the sign of the time interval between them to be reversed by a Lorentz transformation. In contrast, while simultaneity is relative for two events separated by a timelike vector, the time ordering of the events is invariant. Particles, and signals transmitted by field disturbances, must have timelike worldlines (curves in with tangent vectors everywhere timelike) or straight null worldlines (curves in with unchanging null tangent vector) directed toward the future. However, in the case of , the distinction between spacelike and null vectors vanishes as the past and future light cones merge with the invariant surface of simultaneity. Time intervals between events are invariant under Galilei transformations. There is no upper limit to the speed of particles or of signals transmitted by field disturbances, and indeed, forces affecting instantaneous action at a distance are not excluded. For , vectors are classified as timelike for and null/spacelike for .