1. Introduction
The mathematical
symmetries related to Lie groups are key elements of the standard model of elementary particle physics (SM) [
1,
2], which describes the electroweak (
) and strong
interactions among lepton and quark fermions. The way of including the associated gauge bosons in the quantum mechanical framework of modern field theory was first described in the seminal paper by Yang and Mills [
3]. However, the physical origin of these important symmetries has remained somewhat obscure or of a mainly empirical nature until today. They were chosen in the case of
to describe the effects of parity violation by assuming that weak interactions only involve the left-handed fields [
4] of massless fermions or were just introduced ad hoc in the case of
by Gell-Mann [
5] in order to bring some schematics into the complex variety of hadrons found empirically in accelerator experiments.
The aim of this paper is to cast somewhat more light on the genuine nature of the
symmetry (weak interactions) and
symmetry (strong interactions) in particle physics and mathematical physics with the help of (1) the mathematical and physical principles of Lorentz invariance, (2) the two associated Casimir operators of the Lorentz group (hereafter, LG), and (3) their relations to the notion of isospin. This paper is meant to extend and deepen the recent work by Marsch and Narita [
6,
7] on these topics. A similar approach to understanding space-time hidden symmetry was used by Hestenes in his development of space-time algebra, e.g., interpreting electron spin direction and spin magnitude as geometric properties [
8].
We first prepare some necessary ingredients of the theory in a section on the generators of the Lorentz group for the transformation of four-vectors in Minkowski space-time. Then, new versions of an extended Dirac equation and the associated Clifford algebra are presented. The related physical spin and rapidity of the resulting fermion spinor field are discussed. A version of the extended Dirac equation on the Weyl basis is also derived in the
Appendix A. The notion of isospin is introduced and derived in the context of the spinorial representation of the Lorentz group. The consequences for the origin of
and the related fermion lepton doublet, such as an electron and a neutrino, are elucidated.
Furthermore, on the basis of the Pauli–Lubański operator, we develop a concept for the possible origin of the color symmetry. It may stem from a new intrinsic isospin of 1/2, which is assumed to describe the lepton–quark fermion doublet. Thereby, use is made of the powerful concept of spinor helicity by employing the Pauli matrices, which permit one to decompose the square of any three-vector into a Pauli matrix product. The related symmetry turns out to be , which contains as a subgroup.
Finally, we discuss some possible schemes for the unification of these symmetries. A short discussion section concludes the paper.
2. The Generators of the Lorentz Group
To begin with, we discuss the generators of the Lorentz group in the vectorial representation. The Lie algebra for the Lorentz group [
9,
10,
11] is decomposed into two commuting independent sub-algebras as
. Here,
denotes the algebra associated with the special orthogonal group
, and
denotes the algebra associated with the special unitary group
. That is, they define the generators of the irreducible
representation of the LG. We introduce the standard hermitian rotation operator
and the anti-hermitian boost operator
. These three-vector generators of the LG are component-wise
matrices in Minkowski space-time. According to their definitions, the rotation and boost operators obey the well-known linked three-vector equations of the Lorentz algebra:
where the cross-product sign stands as an abbreviation for the commutator
. The four-vector LG generators can be written as tensors in Minkowski space-time [
12], a well-known subject. In the
Appendix A, the components of the matrix vectors
and
are quoted for completeness and reference. From them, we straightforwardly obtain the absolute value of the rotation operator as
and that of the boost matrix as
Consequently, we find
. Then, for any three-vector
, one obtains
and similarly, one obtains
Therefore, one finds that
. Moreover, we obtain the relation
This important result will play a key role and be exploited in the subsequent section.
3. New Versions of the Dirac Equation and Clifford Algebra
Recently, Marsch and Narita [
13] derived an extended Dirac equation on the basis of the vector representation of the Lorentz group. In this section, we present a new route to obtain such an extension of the standard Dirac equation [
14]. Historically, the key question was that of how to derive a linear relativistic wave equation. This task requires linearizing the kinetic energy for a massive particle, which goes with the momentum squared in the basic relativistic dispersion relation and is given by
This is the so-called mass–shell condition for a free particle of mass
m, energy
E, and momentum
, and it is just the first Casimir operator of the Lorentz group. Here, we use the covariant four-momentum
as a variable in Fourier space. According to relativistic quantum mechanics [
1,
2], the covariant four-momentum operator is associated with a temporal or spatial derivative:
We abbreviate the contravariant space-time coordinates with x and conveniently use the units for which and . The differential operator will be used later when we discuss the desired relativistic wave equation.
If we now multiply the above relativistic dispersion relation (
7) by the four-dimensional unit matrix, by means of (
6), we obtain the algebraic matrix result:
This equation should then be linearized in the energy and momentum variables
E and
. For that purpose, we introduce three
Pauli-type matrices:
which are real, obey
and
, and mutually anticommute with each other. Thus, we can write the linearized dispersion equation as follows:
When squaring this equation and exploiting Equations (
6) and (
9), we retain the original dispersion relation (
7) times the
unit matrix, which corresponds to the eight degrees of freedom obtained by the linearization procedure. Two of them belong to the lambda matrices. Their two dimensions relate to the two possible signs of the energy and, thus, correspond to the particle and antiparticle, as in the standard Dirac equation. The other four degrees of freedom stem from the space-time coordinates of Minkowski space and the Lorentz transformation. Their physical meaning will become clear below.
It is convenient and appropriate to rewrite Equation (
11) in covariant form by introducing the subsequent Lambda matrices corresponding to Dirac’s gamma matrices. We obtain
They obey
and
, where use was made of Equations (
4) and (
5). The three space components and the single time component of the contravariant Lambda four-vector
anticommute, which is obvious for
and
, and this follows for the
components after some lengthy calculations with the help of the Lorentz algebra of the three-vectors
and
after Equation (
1). Therefore, as shown in the subsequent section, we then obtain the Clifford algebra for the Lambdas:
from which it follows that one can take, so to speak, the root of (
7) and obtain
Finally, by inserting the differential operator of Equation (
8) here, we obtain the Dirac equation in a new non-standard form as
The spinor wave function has eight components, of which two correspond to particles and antiparticles. The other four are related to their associated spin and isospin doublets, as shown in the next section.
We would like to emphasize that other equivalent forms of the Lambda four-vector
are possible. They have been extensively discussed by Marsch and Narita [
6]. In the present framework, there are essentially three options, which correspond to the possible ways in which the
matrices can occur in Equation (
11). They are obtained through the cyclic permutation of the lambda positions in that equation. It was argued in [
6,
15] that these three possibilities reflect the physical fact that there exist exactly three families of fermions in the SM. The number three just corresponds to the three dimensions of the real physical space, which are revealed by the three Pauli matrices (or lambdas in our case) as the generators of the rotation group in its fundamental bi-spinor representation.
4. Spin and Rapidity
The aim of this section is to first calculate the spinorial analogs of the Lorentz group generators
and
in Minkowski space. All one needs to do is use the properties of the Clifford algebra (
13). The cartesian spin
x component is given by
and the y and z components are obtained through cyclic index permutation. Multiplication of the three spin components gives
Thereby, we only used for x, y, z and the fact that the Lambdas anticommute. Moreover, one gets and the same result for the y and z components. In conclusion, here, we are dealing with a spin one-half fermion.
The rapidity or boost operator is adequately defined as
from which it follows that
Similarly, we also find that
As a result of all of these calculations, we obtain, in full analogy to Equation (
1), the spinorial Lorentz algebra
Let us then evaluate more explicitly the components of spin and rapidity matrix vectors. For the spin x component (the y and z components are obtained through cyclic index permutations), we get the matrix
and for the rapidity, we obtain the matrix three-vector
At this point, we should explicitly calculate the involved matrices
. By using their expressions in Equations (
A1) and (
A2) of the
Appendix A, with the definition
, we obtain the three real Lambda
matrices, which are also quoted in the
Appendix A. In terms of these matrices, we can rewrite the spatial Lambda matrix vector in the form
The components (with
x, y, z) of the Lambda vector obey the important metric condition:
which guarantees that the spatial Lambda components obey the Clifford algebra. At this point, we can also define the chiral Lambda matrix:
Multiplying the above product of the lambdas out, one obtains
where the new diagonal matrix Delta corresponds to the metric in Minkowski space and reads as follows:
, which obeys
. This permits one to switch the sign of the superscripts attached to the lambdas because one finds that
. In terms of the matrix vector lambda, the spin and rapidity can be written as
Upon the insertion of the expressions of (
25), we obtain the spin and rapidity vectors in the form
The connection of the Sigmas to the previously used vector lambdas is as follows:
. These matrices are defined in terms of
and are quoted in the
Appendix A. Both of the Pauli-type (but
) matrices that appear here obey
, yet for opposite superscripts, they commute component-wise with each other:
. Moreover, we get
. One finds that
, which is a relation that is very useful for validating the commutation relations of the spin and isospin addressed below. Finally, note that
. This is required to show that
is hermitian.
Using the Sigma matrices is appropriate to rewrite the previous Lambda matrices of Equation (
12) in a new form. We obtain
This version of the extended Dirac equation is formally the same as that derived recently by Marsch and Narita [
6], yet it differs slightly in the definition of the Sigma matrices. However, their way of obtaining this result was also somewhat different. In the
Appendix A of their paper, they provided six equivalent versions of Equation (
31), which are all connected by similarity transformations.
5. Isospin and
Using the results on the Sigma matrices of the previous section, we are now in the position to define a new entity, the isospin, as follows:
This obviously commutes with the spin because Sigma matrices with opposite superscripts commute. However, it is less trivial to show that
, yet when using the algebraic properties of
, with some lengthy matrix multiplications, one can show that this is the case. Moreover, we find more easily that
, and therefore,
,
, and
can have common eigenfunctions in the rest frame of the fermion. Consequently, the multiplet state
can, in that frame, be fully determined by the quantum numbers of those matrix operators. Since their squares are proportional to the unit matrix
, according to Equations (
12), (
A7), and (
A8), we obtain the eigenvalues
for the energies of the Dirac Equation (
14) in the fermion rest frame corresponding to particles and antiparticles. The values
for the spin (or isospin) are obtained as evaluated by means of their block-diagonal
x components, which correspond to the physical spin up and down states and to the up and down components of the isospin doublet. Marsch and Narita [
6] extensively discussed a possible gauge theory based on the isospin (
32) to which we refer and which, therefore, shall not be elucidated any further here.
6. Intrinsic Spin and
In their seminal work, Wigner [
9] and Bargman and Wigner [
10] emphasized the important role played by the two Casimir operators of the Poincaré and Lorentz group, namely, the four-momentum squared (
7) and the square of the Pauli–Lubański operator
, which involves the intrinsic spin
of the particle, where spin just means a physical quantity that obeys the usual angular momentum algebra
The square of the Pauli–Lubański operator is the product of the squared mass with
, and thus, for a massive particle, it reads:
Here, the squares of the space-time Fourier variables
E and
and the intrinsic isospin three-vector
are connected in a simple multiplicative way, which is consistent with the constraint placed by the Coleman–Mandula theorem [
16]. The Casimir operators turn out to be relevant in the derivation of covariant and first-order (with respect to the derivatives) relativistic wave equations for massive charged particles of arbitrary intrinsic spin
.
It was Dirac [
14] who noticed that the first Casimir operator (
7) can be written as the square of an expression that is linear
with the help of the famous contravariant gamma matrices that he introduced. Here, we have extended his approach to include the
-isospin related to the Lorentz group in the Lambda matrices of (
12), which leads to (
14). Furthermore, Dirac [
17] later also noticed how one can write the square of the intrinsic spin
with the quantum number
l as the following product:
which uses the concept of what we now call [
2] spinor helicity
. To define it, we use the standard Pauli matrix three-vector
. We can thus express the three-vector
as a bi-spinor in
-matrix form. Then, we obtain the following result for the spinor helicity:
It is important to note that for any intrinsic spin operator
in its standard representation, the x and z components are real, and the y component is purely imaginary. Consequently, the above spinor helicity is a real (
)-matrix operator and has real eigenvalues. We abbreviate the two operators involved in (
35) as
It is obvious that
. Consequently, we can rewrite, with the help of (
14) and (
35), the Pauli–Lubański operator (
34) in an algebraically new form as
We recall that all operators appearing in this equation commute with each other. In addition to the eight kinetic degrees of freedom associated with the particle–antiparticle doublet, the spin up and down doublet, and the isospin up and down doublet, we now have the spinor helicity multiplet of with degrees of freedom. This is reflected in the product of the unit matrices at the mass term.
However, in what follows, we shall restrict the discussion to
, i.e., we shall consider only two intrinsic configurational or species degrees of freedom. Empirically, the spin-one-half fermions come in only two species, namely, as leptons and quarks, which we can accommodate in the doublet given by
. It was shown by Marsch and Narita [
7] that the operators
can, for
, be brought into diagonal forms based on their four common orthogonal eigenfunctions, and thus, they take the simple form
It is interesting that
is apart from the normalization identical to the fifteenth element of the
Lie group.
can be considered as the hypercharge matrix operator of a unified lepton–quark gauge theory, which was discussed by Marsch and Narita [
18] in their study of the connections existing in the Dirac equation between the Clifford algebra of Lorentz invariance and the Lie algebra of
gauge symmetry. Apparently, the state space of the spinor helicity operator
decomposes into two orthogonal Hilbert spaces with dimensions of three and one. The corresponding wave equation applies to the four-component super-spinor
, where the spinor fields
are solutions of the extended Dirac wave Equation (
15). By using Equation (
37), we can thus write a new second-order (in
) wave equation for
as follows:
which can, again, be decomposed into two linear wave equations:
We recall that all operators commute with each other and relate to the 32 independent inner and kinetic degrees of freedom. Close inspection of the above two equations shows that they are, in fact, identical if we choose
and
for
.
then just act as unit matrices in their respective subspaces, and in this way, we obtain two entirely decoupled Dirac equations. One has three intrinsic degrees of freedom, which we associate with the three colors of the quarks, and the other has one intrinsic degree of freedom, which we associate with the “single color" of the lepton. So, we write the corresponding Dirac equations as
Here, the spinors are
for the lepton and
for the quarks, with the usual color indices of red, green, and blue being used as in the SM. Of course, the related symmetry gauge groups are
for the leptons and
for the quarks. In conclusion, each of the four fields associated with the four degrees of freedom of the spinor helicity obeys the extended Dirac Equation (
15) that yields the
symmetry for the isospin doublet. Thus, we have covered all three known fundamental fermion symmetries of the SM. The color symmetry is a consequence of the Pauli–Lubański operator involving the simplest possible isospin with quantum number
, corresponding to a fermion doublet, which splits into four states by means of the spinor helicity mechanism being applied to
according to (
35).
7. Symmetry Unification Schemes
In the previous section, we showed that the simplest possible internal isospin
for a fermion doublet with a quantum number of
yields, via the spinor helicity formalism, a quadruplet assembling three colored quarks and a single lepton into a four-component super-spinor,
, where each of the four spinor fields obeys the extended Dirac wave Equation (
15). So, we can concisely rewrite the unified extended Dirac equation as
This equation has the symmetry
stemming from Lorentz invariance, which determines the four-vector
, and the symmetry
stemming from the spinor helicity
of (
36), which is associated with the internal isospin. It formally describes a quark–lepton couple that splits into a quadruplet involving a single lepton (with symmetry
) and three colored quarks (with symmetry
). As a result, we obtain the unified fermion symmetry
. Such a unification model was developed before by Marsch and Narita [
19,
20], but on the basis of combinatorial symmetries of the standard Dirac equation, which comes in two main versions, as evaluated on the basis of Dirac and Weyl.
The
symmetry suggested here is equivalent to the symmetry of the orthogonal
group as described by Fritzsch [
21,
22]. The related 16-component spinor representation of
includes all spin one-half fermions of the first generation in the SM. The symmetry
was proposed long ago by Pati and Salam [
23,
24], who considered the lepton number as the fourth color, which was long then an ad hoc assumption, whereas here, we give a good physical reason for such a quadruplet. It is based on the spinor helicity of an intrinsic spin one-half in connection with the Pauli–Lubański operator, which is the second Casimir operator of the Lorentz group.
However, the unification scheme employing
requires 15 gauge fields, which are sometimes called leptoquarks, to link leptons and quarks in the multiplet. There is no observational evidence for the related gauge bosons, and therefore, it is desirable to reduce their number. Such a reduction is offered by the use of the less complex
instead of
in the spirit of Occam’s razor. We suggest installing the
symmetry, which requires nine fewer gauge fields than
. The mathematical reason is that the spinor helicity multiplet involves a real operator acting in the Euclidian space spanned by the four simplest possible and real eigenfunctions
,
,
and
. The six purely imaginary matrices of the
generators are quoted in the
Appendix A. We may assemble them into two three-vectors and name them
or
. They obey the linked algebra
This algebra resembles the Lorentz algebra of Equation (
1), aside from a minus sign. Of course, the group
includes
as a subgroup. Using it instead of
means that the symmetry of the strong SM interactions would be considerably simplified and would not require eight gluons, but just three new gauge bosons. Admittedly, this is a rather speculative new scheme that needs further scientific investigation.
8. Summary and Conclusions
In this paper, we outlined a new route to the extended Dirac equation and its symmetries on the basis of the four-vector representation of the Lorentz group (LG). It turned out that the symmetry emerges naturally via the linearization of the first Casimir operator of the LG and is intimately connected with the isospin. This is an outcome of the chiral nature of the LG. Similarly, the intrinsic isospin related to the second Casimir operator can be linearized by means of the spinor helicity formalism, which yields two independent singlet and triplet fermion states. We suggest interpreting them as being related to the symmetry and the lepton or to the symmetry and the three colored quarks. These results indicate the very different origins and physical natures of these basic symmetries of the SM, but they do not genuinely support the idea of grand unification. However, when combining them in the product group , and then by combining all groups into , one gets a combined symmetry scheme that seems to support unification by the group . It is found that the smaller group, instead of , also seems appropriate for achieving unification, and it offers the advantage that it simplifies the theory and reduces the number of gauge fields required.
Concerning the basic wave Equation (
40) based on the second Casimir operator, we stress again that the space-time differential operator
, the algebraic operators
of the Clifford algebra related to the Lorentz group, and, finally, the operators
related to the
symmetry of the intrinsic spin
do all commute with each other, and therefore, their sequence when operating on the state spinor
does not matter. Its full dimension is
according to Equation (
40). At this point, we are just dealing with a Klein–Gordon equation for each of the many components of
. However, when we linearize that equation with respect to
, we have to use the matrix representations of the algebraic operators. They are then connected through tensor multiplication, where their sequence does matter.
We have two options for constructing the resulting extended Dirac equation. Either the intrinsic spin operator acts first from the left on
and is followed the space-time operators, or vice versa. In the first case, the field equation looks like a single Dirac equation with many internal degrees of freedom; in the second case, it looks like a multiplet of Dirac spinors assembled in an
N-tuple of the
symmetry group in matrix representation. The latter case gives the standard picture of the Yang–Mills theory. The mathematical connections between the two approaches were extensively discussed in [
18]. The way that is is advantageous needs to be worked out and may depend on the symmetry group involved. The analysis of this problem is beyond the scope of this paper.