1. Introduction
After the discovery of the muon in 1936, Rabi is said to have exclaimed: “who ordered that?” But to this day, the same question can still be asked of the proton and neutron which are at the nuclear heart of the observed material universe, and of the other baryons. We do have a very good understanding that the proton and neutron and other baryons are composed of three confined “colored” quarks in the fundamental representation of SU(3), with highly non-linear gluonic interactions among these quarks, wherein baryons interact with one another by exchanging a variety of mesons. But we still do not have a good dynamic answer, rooted in fundamental physics principles, to Rabi’s very basic question: who ordered the baryons? Nor is there a good understanding of the dynamic origin of quark and gluon confinement.
One of the most notable features of Maxwell’s U(1) differential and integral equations
and
is that magnetic monopoles do not exist and that there is never a net flux of magnetic fields across any closed two-dimensional spatial surface. But in the dynamic Maxwell equations for an SU(N) Yang–Mills [
1] theory of non-commuting gauge fields these monopoles do exist, and there is a net flux of the analogs to magnetic fields through closed surfaces surrounding and within the monopole. Given the extraordinary success of Yang–Mills gauge theory in describing weak and strong interactions, and its presumably significant role in any “grand unified theory” which might become generally accepted in the future, the question how the magnetic monopoles of Yang–Mills might manifest themselves in the natural world must be given due consideration.
What is shown here is that when the Yang–Mills (YM) analog of Maxwell’s electric charge equation is inverted, then inserted into the analog of Maxwell’s magnetic equation for what are now non-vanishing monopoles—effectively combining both of Maxwell’s covariant equations into a single indivisible equation—the non-perturbative state of these monopoles contains exactly 3 fermions arising from the monopole density being an antisymmetric tensor of rank 3. Treating the monopole as a “system” to which the Exclusion Principle must be applied, and so using SU(3) to enforce an exclusive quantum state for each of these three fermions, and following a form of spontaneous symmetry breaking which moves a degree of freedom from gauge bosons to fermions, makes the bosons massless and renders SU(3) an exact symmetry, these YM magnetic monopoles acquire the same SU(3) antisymmetric color singlet wavefunction as baryons, and the magnetic field analogs which flow through the monopole surfaces obtain the same symmetric color singlet wavefunction as mesons. This enables the three fermion states inside the monopole to be identified as confined colored quarks, the gauge bosons to be identified as confined colored gluons, the YM magnetic monopoles to be identified as baryons, the mesons to be identified as quanta of the YM magnetic fields which net flow in and out of these monopoles, and the symmetry breaking to be identified with ultra-high-energy hadronization from a plasma of free quarks and gluons.
From this we also answer Rabi’s question: baryons were ordered by Maxwell, Yang and Mills, with an assist from Weyl via gauge theory itself, from Fermi–Dirac–Pauli via Dirac’s quantum theory of the electron and the Exclusion Principle, from Gauss’s law for fluxes of magnetic fields through closed surfaces, from Einstein’s generally covariant formulation of magnetic monopoles as third-rank antisymmetric tensors, and with credit to Hamilton for pioneering non-commuting quaternions which later became the foundation of YM gauge theory in the form of Pauli matrices and their extension to SU(N).
Finally, we employ the foregoing development to fill the Yang–Mills Mass Gap [
2].
2. A Brief Review of Maxwell’s Equations Using Duality and Differential Forms
We start with Maxwell’s equations in covariant form, in flat spacetime. A gauge field/vector potential
with dimensionality of energy/charge is used to first define a field strength tensor
which in turn is used in the two differential Maxwell equations:
Additionally, from (1a) we obtain the continuity equation:
Above, is a current density four-vector in which has dimensions of charge per volume (charge density), is the vacuum permeability of free space, and is a third-rank antisymmetric tensor defining a magnetic charge (monopole) current density.
In flat spacetime where
and the Riemann tensor
, the commutator
for covariant derivatives simplifies to
with partial derivatives commuting. As a result, in (1a) we obtain the configuration space operator
operating on
; and in (1b) we find the magnetic monopole density
, by identity, which is understood to mean there are no isolated magnetic charges in nature (setting aside the monopoles theorized as a possibility by Dirac in [
3]). Likewise, (2) is also true by identity and governs the conservation of electric charge sources. If we impose the covariant gauge condition
then (1a) simplifies to
. If, instead, we introduce a Proca mass
so (1a) becomes
, then because of (2) we find that
, thus
, which is no longer a gauge condition but a continuity requirement.
Duality and differential forms express the differential Maxwell’s Equations (1) in a compact form which simplifies obtaining the integral Maxwell’s equations. Using duality reviewed, e.g., at pp. 87–89 of [
4], we may rewrite these as:
Then, via differential forms reviewed, e.g., in Chapter 4 of [
4] and pp. 218–220 of [
5], we compact:
Likewise, the continuity Equation (2) becomes:
Both (4b) and (5) apply the differential forms relation
that the exterior derivative of the exterior derivative is zero, see, e.g., Section 4.6 of [
4]. (Above, we adopt the convention that the indexes of a dual object match those on the left side of the Levi-Civita tensor, so
,
and
for any vector
and antisymmetric tensors
and
. This is to ensure that (3a) and (4a) maintain the same relative sign between
and
.)
The form relations (4) are ideal for casting Maxwell’s equations into integral form. Specifically, for any
p-form
H on a
-dimensional manifold
M with boundary
and with
:
see, e.g., pp. 218–220 of [
5]. So, using (4) in (6), we arrive at the integral Maxwell’s equations:
In Section 18.3 of [
6], Close uses Gauss’s theorem for electric charge contained in (7a) to “consider the chromodynamics case which is analogous to the above.” Similarly, (7b) contains Gauss’s law for magnetism,
. The reason these surface integrals are of interest is because one way to state the confinement of net color charge inside a baryon is via the schematic expression
over the baryon
B surface. Likewise, with only color-neutral objects (e.g., mesons) net flowing across baryon surfaces, we may write
. Because confinement is fundamentally about what can and cannot flow across baryon surfaces, Gauss’s theorem will be of keen interest in the development to follow.
3. Maxwell’s Yang–Mills Canonic Equations
Maxwell’s electrodynamics is a U(1) abelian gauge theory, so named because its gauge fields are commuting, . Yang–Mills gauge theory, on the other hand, uses gauge fields we denote generally as to distinguish from AV, which are non-commuting, . Specifically, for a simple group SU(N) with traceless N×N Hermitian generators with normalized to for each , and a commutator , these gauge fields are constructed via and so are likewise N×N Hermitian matrices. We may use the commutator to find that . Likewise, while , it can be generally shown that for any , , which means that for a product of two gauge fields. Although weak interactions use SU(2) and strong interactions SU(3), at the outset we shall not examine any particular gauge groups. Our immediate interest is to develop the counterparts to Maxwell’s equations generally, for any SU(N) Yang–Mills (YM) gauge theory.
In general, a gauge-covariant derivative is defined by
, where
g is a charge strength. We then use these to define a field strength tensor in natural units
by:
Were these gauge fields to commute, that is, were we to have
, this would reduce to
which recovers the template
for U(1) abelian electrodynamics noted prior to (1). Of course, however, these
are non-commuting. Using differential forms, the above compacts to:
In Yang–Mills gauge theory we obtain dynamic equations starting with a canonic replacement
of ordinary with gauge-covariant derivatives. Likewise promoting electric and magnetic charge densities
and
to capitalization, Maxwell’s (1) become: (Note that
remains the constant factor in (10) below as it was in (2) et seq. In U(1) electrodynamics
is the ratio between the running fine structure coupling
and charge strength
e. For SU(N) it remains the ratio
between dimensionless running couplings
and charge strengths
g, generally.)
Likewise, the continuity Equation (2) generalizes to:
which in momentum space
becomes
. Unlike (1b) and (2), the zeros above arise not from derivative commutation, but from the Jacobian identity
and
, in view of the further identities
and
operating on any field
, see, e.g., [
7], with careful attention given to the product rule. In the bottom line of (11), we define
to be a “perturbation tensor,” so named because its trace
is the standard expression for the perturbation in the Klein–Gordon (relativistic Schrödinger) equation, and houses the difference
between ordinary and gauge-covariant Laplacians.
4. Maxwell’s Yang–Mills Dynamic Equations
One of the major lessons of the General Theory of Relativity [
8] which Hermann Weyl later adapted to gauge theory [
9,
10,
11] is the canonic prescription of invariantly maintaining the original form of a field equation or Lagrangian density while merely replacing all ordinary derivatives with suitable covariant derivatives. Then, by separating the original equation or Lagrangian and its ordinary derivatives from the new terms arising via covariant derivatives, we ascertain the physical,
dynamic impacts of this prescription. For example, in General Relativity the covariant promotion of derivatives in the flat spacetime commutator
produces
with the Riemann tensor; and Newton’s first law of motion written as
with a four velocity
produces
, i.e.,
, which is the gravitational geodesic equation. And in gauge theory, the Klein–Gordon equation
promotes to
, i.e.,
with the perturbation
noted after (11); while Dirac’s equation
promotes to
, i.e.,
with a Dirac (D) perturbation
. In all instances, this canonic prescription is to (1) replace ordinary with covariant derivatives, then (2) segregate the original equation with ordinary derivatives to see the dynamic impact, whereby what was originally a “zero” becomes a “non- zero.” In
Section 3 we took the first step of promoting ordinary derivatives in Maxwell’s equations to covariant derivatives to obtain canonic equations. Now, we segregate the original Maxwell equations to see the dynamic Yang–Mills content, and what the zeros become as non-zeros.
To do so, we first define lowercase-denoted electric and magnetic charge densities by
and
exactly as in the Maxwell equations (1), using ordinary derivatives of the field strength
, which is now the Yang–Mills (8). Then, we use
and
in (10) which have exactly the same form but for
and
, together with
in natural units, to separate the ordinary derivatives of
, as such:
Comparing (12) with (10) we see the foregoing implies definitions and between lowercase and uppercase source densities, also mindful from (10b) that . In (12b) the ordinary derivative commutator cancels terms just as in Maxwell’s monopole Equation (1b). What is very important, however, is that the Yang–Mills dynamic Equation (12b) contains a non-vanishing magnetic monopole density , versus in (1b) for Maxwell’s U(1) electrodynamics. The “zero,” which here has turned into a “nonzero,” is the magnetic monopole density .
Cast into differential forms, (12) may be compacted to (compare (4)):
These are Maxwell’s dynamic equations in differential forms, for any Yang–Mills gauge group SU(N). Although we continue to apply the exterior calculus relation to remove , again, these magnetic monopoles do not zero out entirely. There remains a residual term which arises directly out of the two-form , that is, directly from the non-commuting nature of Yang–Mills gauge theories.
Another reason it is important to segregate ordinary derivatives is because the
d in
in (6) remains an ordinary, geometric, calculus derivative, and does not change for Yang–Mills theory. So, having placed (12) into the compact form (13) with segregated ordinary derivatives, we may use (6) to recast these into integral form:
These are the Yang–Mills counterparts to (7). The YM “electric” equation (14a) contains a new term
which does not appear in (7a), again, because
. The YM magnetic equation written as
indicates something quite unique in contrast to Maxwell’s electrodynamics: Whereas (aside perhaps from Dirac’s [
3]) there is no net flux of any U(1) magnetic fields through any closed two-dimensional surface, we learn from (14b) that Yang–Mills SU(N) “magnetic” fields can and do exhibit a net flux through such closed surfaces. This is because (13b) does provide Yang–Mills theory with non-vanishing magnetic monopoles.
Yang–Mills gauge theories have proved to be very successful for understanding the natural world. Weak interactions are correctly understood using SU(2), strong using SU(3), and electroweak using SU(2)xU(1). It is plausible that grand unification will eventually start with some larger SU(N) and spontaneously break symmetry in stages down to the phenomenological SU(3)xSU(2)xU(1). In short, we take Yang–Mills gauge theories very seriously for their ability to render what we observe in nature. Accordingly, if Yang–Mills theories also predict magnetic monopoles as in (12b) and (13b), and non-zero magnetic field surface fluxes as in (14b), we must ask equally serious questions about these monopoles. Most importantly: in the natural world, in what form do we observe these YM monopoles? Furthermore, what are the objects which in (14b) net flow across closed surfaces around and within the monopole?
As we shall see, these monopoles are observed as baryons and these objects which net flow across surfaces of these monopoles are observed as mesons. Together, they are the hadrons.
5. Populating Yang–Mills Magnetic Monopoles with Source Currents, by Inverting the Yang–Mills Electric Source Equation and Then Combining Both Maxwell Equations into One
It is common practice to start with an electric equation of the form (1a), or, presently, (12a) in which the source density
is a function of the gauge field
or
, then invert the configuration space operator to obtain the gauge field as a function of the source current. It is well-known, however, that the inverse of (1a) is infinite and cannot be obtained without removing some of the gauge freedom, typically through the gauge condition
. Alternatively, a finite inverse can be obtained with a Proca mass
m added by hand, whereby (1a) is written as
. Of course, a theory with a vector boson mass added by hand is no longer renormalizable, but it is also known how to cure this by revealing a mass through spontaneous symmetry breaking of the sort which underlies electroweak theory. With this in mind, we now add a mass by hand to (12a), and with
write:
With this same
m, (10a) becomes
. Then we find from (11) that
is now a required covariant condition—not merely an optional gauge condition—through which the scalar degree of freedom is removed from
. So, for a massive vector boson in Yang–Mills theory:
which we already used in going from (12a) to the bottom line of (15). It is interesting—and a precursor to solving the Yang–Mills Mass Gap problem [
2] in
Section 17—that this produces a correctly signed term
in the configuration space operator of (15), because this is the form in which the electroweak Lagrangian
reveals renormalizable gauge boson masses following spontaneous symmetry breaking at the Fermi vacuum expectation value (vev).
From here the inverse calculation is straightforward: Via (15) we define a tensor
by:
Because
, we see that
is the left-side inverse of the configuration space operator
. Further, because the inverse
of any
N-dimensional square matrix
M must commute with
M, i.e.,
where Id is a like-dimensioned identity matrix, the full specification of this inverse for operation on either side is:
with the further requirement that
. Then, once we have an
which satisfies (18), we may insert (17) written as
into (12b) with suitable index renaming, and use
where
is the free space vacuum permittivity, to obtain:
This is important for two reasons: First, the U(1) Maxwell Equations (1) are two distinct equations because there are no magnetic monopoles. But in (19)—courtesy of the non-vanishing YM monopole (12b) and the YM current sources (12a) with Proca mass in (15), the latter inverted using (17)—the resulting YM monopole (19) combines both Yang–Mills Maxwell equations into a single equation. Second, this YM monopole
has now been populated with a triplet of YM source currents
,
,
having three distinct indexes. So when expanded with
, and with each source being related to fermion wavefunctions courtesy of Dirac’s [
12], the net result of (19), as we shall see, is that we have populated the Yang–Mills monopole with a triplet of fermions. Via the exclusion principle following symmetry breaking these will end up in the fundamental representation of an exact SU(3) group, and thus have the same character as a quark color triplet. This is the route to discovering that these YM monopoles possess all the Quantum Chromodynamics (QCD) properties of baryons.
6. Nonlinear Recursive Interactions Contained in the Inverse Yang–Mills Electric Equation
The next step is to explicitly calculate the inverse using (18), then insert this in (17) and (19). This inverse calculation is carried out in detail in
Appendix A. The result including the
prescription is (A13). Before proceeding, however, let us establish notation conventions for representing the energy momentum of a particle four-vector: Whenever the energy-momentum is that of a fermion we shall use the notation
. For a massive vector boson we use
. And for a massless boson such as a photon or gluon we use
. Accordingly, with
c restored, because (A13) is for a massive vector boson, with a parenthetical
for highlighting reasons to be momentarily reviewed, and using the quoted “denominator” from (A13), whereby we represent an inverse by
with that inverse placed at the spot denoted by a subscripted
, we shall write this result in (A13) as:
Next, we use (20) in (17) to write:
When we contract the above from the left with another to form the Lagrangian density term , the result represents the propagator for a YM vector boson mediating between two source currents, which interactions are routinely represented with Feynman diagrams.
Now, were it not for the Yang–Mills terms with
parenthetically highlighted in the above, we could say (21) inverts (16) to obtain
as a function
exclusively of
. However,
is in (21), so that is
not what (21) does. Rather, in (21)
is a recursive function of
and of itself. For example, at the first recursion we substitute
on the left into the
to obtain:
In the bottom denominator we use the shorthand . This type of substitution can and must be done indefinitely, approaching an infinite number of substitutions, before we truly have and not . We may symbolically represent this infinite recursive series by , which we condense into the notation in (20) to (22). Obviously, the Lagrangian term has a similarly recursive character.
Likewise, this recursion introduces an infinite number of occurrences of into (22), amplifying the nonlinearity of interactions amongst the themselves. This should not be surprising, because it is well-known Yang–Mills gauge theories are highly non-linear: Using (8), the Lagrangian density in which and , provides the basis for three- and four-gluon vertices. This is often used to illustrate the inherent non-linearity of particle interactions in Yang–Mills gauge theory, versus linear behaviors in QED where photons do not interact among themselves. So, the recursion in (22) is just another manifestation of this Yang–Mills non-linearity.
7. Introducing the Inverse Yang–Mills Electric Source Equation into the Yang–Mills Magnetic Monopoles, then Populating These Monopoles with Dirac Fermions
Next, we substitute the inverse (20) with renamed indexes as needed and recursion-highlighting parentheses removed, into the monopoles (19), to obtain:
This provides a complete, explicit description of the way in which the YM monopoles are populated with the three source currents , , , keeping in mind that every needs to be filled with an unlimited-approaching-infinite number of recursions as in (22).
When we further expand using
obtained from the Yang–Mills continuity equation as reviewed in (A17) of
Appendix B, we can directly populate (23) with Dirac fermions, as such:
Then, by an indefinite-number-approaching-infinity of recursions as reviewed at (21) and (22), the fermion and boson interactions inside these monopoles are seen to be highly nonlinear to infinite order. The final step is to show that (24) indeed represents the QCD properties of a baryon with three quarks and highly nonlinear gluon interactions among these quarks.
8. The Yang–Mills “Signal” Magnetic Monopole, without Perturbative “Noise”
It is well established that protons and neutrons, which are the two most important baryons insofar as they form the nuclei of the observed material universe, are teeming with non-linear interactions. For example, 2019 Particle Data Group (PDG) data [
13] informs us that the free proton and neutron rest masses are
and
, but that the up and down current quark masses in an
renormalization scheme at a scale
are
and
, all respectively. With quark content p(duu) and n(udd), these “current quark” masses contribute about 1% to the overall rest energies of these baryons, with the other 99% arising from nonlinear gluon-mediated interactions among these quarks and from internal kinetic energies. These are distinguished from “constituent quark” masses which stem from attributing about a third of the total rest energy of a nucleon (≈313 GeV) arising from their quark and gluon energies, to each quark. So, for a baryon, we may similarly coin “current baryon” to refer to the bare quark structure of the baryon with all nonlinearity stripped away, and “constituent baryon” to refer to the baryon including all its nonlinear behaviors. Here, going forward, we shall borrow the electrical engineering terms “signal” and “noise,” and use the term “signal baryon” to refer to a baryon with all non-linear behaviors stripped away (the “current baryon”), and use “signal-plus-noise baryon” to refer to the entire observed baryon with all of its nonlinearity (the “constituent baryon”). Using this language, with each
Gμ in (24) treated recursively in the manner of (22), what we have in (24) is clearly a “signal-plus-noise” monopole density. To explore the underlying QCD behaviors of these monopoles, we now shall study just the “signal” monopole density with all “noise” (which comprises about 99% of the observed rest energy of the proton and neutron and a large share of the rest energy of other baryons as well), removed.
It is fair to say that Yang–Mills gauge theory is a theory of perturbations added to Maxwell’s linear electrodynamics in
Section 2. For example, in the opening paragraph of
Section 4 we find this in
arising from the second-order structure of the Klein–Gordon equation, and in
arising from the first-order Dirac equation. This is the “noise” of Yang–Mills theory. This is what Jaffe and Witten in [
2] refer to as “excitations of the vacuum.” If we wish to study just the “signal” without “excitations,” the way to do so is to set the perturbations to zero and see what is left. So we do just that: Working from (23), the sources
can be related to Dirac fermions via
obtained in (A17). Then we set
which likewise means we have set
, thus
. This removes from (23), all recursive non-linearity reviewed at (22), and it turns “denominators” into regular denominators without quotes. Still remaining are objects of the form
. However, as seen following (A17), when
the continuity equation is
which in notation reviewed at the start of
Section 6 means that
. So, all terms of this form as well as
can be removed from (23). At this point all that remains are six numerator terms for which indexes can be raised via
, and signs cancelled. We finally segregate commutators in the numerator, so the pure “signal monopole” density inside the “noisy” (23) simply becomes:
It is helpful to contrast (25) with (12b) from which it originated: All that has changed is that the gauge boson commutators
have been replaced by the electric source commutators
, with the newly appearing denominators reflecting the inversion of Maxwell’s Yang–Mills electric Equation (15) with mass
m into
for the non-recursive signal monopole. Because the monopole is itself an N×N matrix for SU(N), with generators each normalized to
we may additionally use the relation
for
and
, thus
, to take the trace of both sides of (25), with the result that:
9. Populating the Yang–Mills “Signal” Magnetic Monopole with Dirac Fermions: Two Alternatives, Each of Which Shows That the Signal Monopole Contains Exactly Three Fermions
The next step is to populate the “signal” monopole trace (26) with Dirac fermions, similarly to what we did going from (23) to (24). The general relation between each and its fermion wavefunctions is (A17). However, since (25) is a signal monopole in which we have set thus , (A17) becomes . Because is an N×N matrix, the expansion of this is with . For SU(N) these are Nx4 column vectors, with N arising from the Yang–Mills and 4 from the Dirac structure of each fermion. Because these sit in the fundamental representation of SU(N) we need to have N distinct SU(N) state labels for each and adjoint . There are two logical possibilities:
First, because
, for example, has a
index, we may assign the label
to this fermion and its adjoint and write
. Likewise for
and
. We may also label
and
and
(the last with indexes doubling as labels) in the denominator, ditto for
and
. Doing this, and expanding commutators in (26), we obtain:
These fermions as well as objects in the denominators are now labelled with the index of the , or which contained them when we inserted fermions via source currents at (24), prior to removing the “noise.”
Second, alternatively, although the fermions were introduced into the signal-plus-noise monopole at (24), once introduced, they have become part and parcel of a monopole system merging both Yang–Mills–Maxwell dynamic Equations (12), with (12a) given mass at (15) then inverted at (21). So, once this monopole system is established, we can change the labeling in (27) so each fermion and related denominator objects are labelled, not by the index of the source which brought them into the monopole, but the index of the partial derivative acting on the fermion once it is in the monopole system, thus rendering the system indivisible. In this alternative (26) becomes:
As we shall see, (27) and (28) are related through a form of symmetry breaking analogous in some ways to what is used in electroweak theory, which will become identified with hadronization.
Most important—in both (27) and (28)—is that this signal baryon is now seen to contain exactly three fermions , , which arise from the rank 3 antisymmetric tensor which is the monopole of (12b) and its three additive terms. This “three-ness” is structurally fundamental to the covariant representation of Maxwell’s magnetic equations whether in U(1) Maxwell or in SU(N) Yang–Mills gauge theory. Consequently, the non-perturbative signal monopoles (27), (28) each describe a system—and (28) an indivisible system—built upon three distinct fermions. Just like a baryon.
10. Using the Gauge Group SU(3) to Establish Three Distinct Quantum States for the Three Fermions Populating a Yang–Mills Magnetic Monopole
The Fermi–Dirac–Pauli Exclusion Principle mandates that the fermions contained in any system of more than one fermion, e.g., an atom, nucleus, nucleon or baryon, must be distinguishable from all other fermions in that system by assignment of an exclusive quantum state. Accordingly, each of the fermions in (27) and (28)—being part of the monopole system—must have an exclusive quantum state. Because both (27) and (28) contain three fermions, the YM SU(N) gauge group used to provide this exclusion must have
, so it cannot be SU(2). On the other hand, because there are exactly three fermions in both (27) and (28), there is no need for
. Accordingly, we now use the group SU(3) to enforce Exclusion on the three fermions
,
,
in these alternative signal monopole systems (27) and (28). (We note without further detail here, that for these monopoles to be topologically stable, we must eventually employ
following spontaneous symmetry breaking from a larger group, see Cheng and Li [
14] at 472–473 and Weinberg [
15] at 442. This U(1) generator provides the foundation for introducing hadron flavor, which is the next developmental step following the results in this article regarding hadron color.)
So, for what has heretofore been
, we now use the 3 × 3 SU(3) Gell-Mann generators via
with
, normalized to
. We use
, so that the
SU(2) subset is embedded in the lower-right portion of
. We may of course choose any labels we wish for these three exclusive states, so we may as well call these Red, Green and Blue, then see whether and how these can be made synonymous with the colored quark states of Quantum Chromodynamics (QCD). With
T denoting the transpose, we place these in the fundamental SU(3) representation with the explicit column vectors and consequent adjoints:
Each of these
is now a 3 × 4 column vector (ket) and each
a 3 × 4 row vector (bra), with the 3 owing to the YM SU(3) internal symmetry and the 4 owing to the four-components of Dirac wavefunctions and spinors. From here we will use (29) in (27), then in (28).
It is profoundly important that the Exclusion Principle taken together with the aforementioned “three-ness” of magnetic monopoles, combine to put exactly three fermions into the signal monopole for SU(N), and so lead directly to SU(N = 3) as the symmetry group required to establish three exclusive fermion states. Normally, SU(3) with R, G, B states is the starting point upon which QCD is founded. Here, in contrast, we can be entirely agnostic a priori about the N in SU(N), until we find that the inherent structure of a Yang–Mills signal magnetic monopole requires that fermions inside the monopole be placed into the fundamental representation of SU(3). This raises the prospect that QCD has its dynamic physical origins in the non-vanishing magnetic monopoles of Yang–Mills gauge theory. But again, for now, R, G and B are just labels: SU(3)QCD is an exact symmetry because gluons in its adjoint representation are massless. However, for example, early theories of baryon flavor similarly placed (u, d, s) into the fundamental representation of SU(3) with an approximate flavor symmetry which is distinct from the exact color symmetry of SU(3)QCD. So, we must establish that this SU(3) group arising from the monopoles is truly synonymous with the exact SU(3) group of QCD, and not some independent SU(3).
11. The Yang–Mills Signal Magnetic Monopole Prior to Symmetry Breaking
Proceeding, we observe that at the center of each numerator in (27) are terms in which a wavefunction is immediately to the left of a differently labelled adjoint e.g.,
, while in (28) we see the same-labelled e.g.,
. In U(1) gauge theory these are 4 × 4 Dirac matrices, and in view of (29) these are 3 × 3 Yang–Mills matrices of 4 × 4 Dirac matrices. Specifically, focusing on the
, etc. “backbone” centering each numerator in (27), and carrying through antisymmetric signage between any pair of
indexes, we first observe using (29) that:
Then, we use (27) to flesh out the above, obtaining:
where
, etc. is defined as a shorthand for the denominators simply to save space. Taking a second trace of the 3 × 3 matrix in (31), we see clearly that
.
If we look at any of the six off-diagonal entries in (31), for example, the term with a numerator and a denominator (second row, third column) while referring to (29), we see something of a mismatch, with two colors of fermion in the numerator and two massive vector boson propagators in the denominator. Fermions, of course, contain four degrees of freedom (spin up and down, particle and antiparticle), while massive vector bosons contain three degrees of freedom (two transverse, one longitudinal). So with both and , in this numerator there are two colors of fermion totaling eight degrees of freedom, while with both and in the denominator there are two massive vector boson propagator denominators representing a total of six degrees of freedom.
Moreover, SU(3)QCD requires massless gluons, whereas the denominators in (31) are all for massive vector bosons. So this looks like an approximate SU(3) flavor-type rather than an exact SU(3) color symmetry. Moreover, (31) has vanishing trace. Furthermore, knowing that all particle propagators have the general form with spin sum being the completeness relation, we see that at the center of this numerator looks like a spin sum , but is not. This is because and in are not the same fermion but are two differently-colored fermions. This final point indicates the way forward, because if we can turn and in into the same fermion, we can use this as a fermion spin sum. Then, having a spin sum in the numerator and two massive boson propagators in the denominator, we can shuttle a degree of freedom from a boson to a fermion to simultaneously produce a fermion propagator and a massless boson propagator. This entails a form of spontaneous symmetry breaking which starts with (27) then breaks symmetry using (28), because the backbone of (28) does have the requisite same-fermion terms , and needed to use the completeness relation.
12. Spontaneously Breaking Symmetry inside the Yang–Mills Signal Magnetic Monopole
Keeping the fermion state definitions (29) exactly as is, we now examine the “backbone” of (28) formed by the three terms
,
and
. Each wavefunction
has adjoint
, and because these are back-to-back with no
between them, their
are the same. Here, however, unlike with (30), because these are same-labelled with the same
, we may not only write
, but given that
for the spin sum over fermion particle states, we may use
as the basis for a spin sum which can lead to a fermion propagator. So, if we additionally take the sum
over particle spins, what we find in contrast to (30) is the now-diagonalized backbone:
Next, we use (32) to flesh out (28), as we did with (30) for (27) to obtain (31). In doing so, we take the spin sums over all of the fermions inside of (28), which implies taking
over the entire signal monopole system as well. Consequently, with this spin sum included and
c = 1:
This is the diagonalized counterpart of (31). Because of this, we may take another trace which, in contrast to
in (31), is non-vanishing, namely:
Next, we focus on the Yang–Mills aspect of the numerators irrespective of Dirac matrices, which we can do because
and
with
are independent operators acting on independent parts of each Nx4 bra or ket in (29). We find the three structural combinations
,
and
. So we use
to calculate eight column vectors
, eight
, and eight
. The adjoint
,
and
are the Hermitian conjugates of these. The net result, easily confirmed, is: (For SU(N) in general, the factor
shown for SU(3) in (35) below, is equal to ½ times the number of states in the adjoint representation over the number of states in the fundamental representation, that is
. This is also equal to the magnitude of the principal Casimir operator for any SU(N), that is,
.)
We then use (35) in (34) and multiply through by
i to obtain:
Above, the mismatch between numerators containing fermions and denominators with massive vector boson propagators becomes crystalized. However, so too does the solution: In the numerators we have spin sums
for fermions, mismatched in the denominators with a pair
of massive vector boson propagators. Indeed, the reason we carefully respectively established the energy-momentum notation conventions
,
and
for fermions, massive vector bosons and massless vector bosons at the start of
Section 6, was so when we arrived at (36) this mismatch would be clear.
Now, with the
i, let us work with the (36) term operated upon. e.g., by
, ditto for the others. In the denominator we revert to indexes/labels showing the currents
and
which were initially associated with these fermions in (26), which appear in the denominator of (27). This
represents a total of six degrees of freedom, three for each of two massive vector bosons. Although the massive bosons were introduced by hand at (15), it is well-known how to give a renormalizable mass to these via
using the Higgs mechanism. We then break symmetry by releasing one degree of freedom from the
denominator and shuttling this over to be swallowed by the
denominator, or vice versa—it does not matter. Doing so, we demote
into a
massless vector boson such as a QCD gluon while consequently setting
. Simultaneously, we promote
to the energy momentum of the fermion and
and
to the mass and
of the R fermion. Finally, we rename denominator indexes back to those for
in (36). We summarize this symmetry breaking as follows:
Above, the original 3 + 3 = 6 degrees of freedom in the denominator are maintained, but redistributed to 2 + 4 = 6 degrees of freedom, two for the now-massless vector boson labelled with , and four for the red fermion. Doing the same and accounting for all three additive terms in (36) which now contain the three R, G, B fermion states, we multiply by 3 so that 9 + 9 = 18 degrees of freedom at the outset in six massive vector boson propagators have become redistributed as 6 + 12 = 18 into three massless vector boson and three massive fermion propagators. If we suppose that the massive vector bosons on the left of (37) originally come from a scalar field via the Higgs mechanism, then (37) is a later step in a “cascade” wherein a degree of freedom is first passed from a scalar to a massless vector boson to make the latter massive, then is passed from the vector boson to a fermion whereby the vector boson reverts to being massless. Mindful that , the upshot is that we now have a fermion propagator with the prescription sitting in the middle of (37).
Finally, replicating (37) with necessary reindexing and relabeling twice over then inserting into (36) we obtain:
This is now the Yang–Mills magnetic monopole with symmetry broken in two stages: First, by taking (27) where each fermion is inserted into the monopole via a current , or and turning it into (28) whereby the fermions now form part of an indivisible monopole system, no longer distinguishing fermions based on the index of their initial current. In this stage, (31) is diagonalized and turned into (33). Second, by using (37) to transfer longitudinal vector boson degrees of freedom over to fermions thus rendering the remaining vector bosons massless and revealing complete fermion propagators. Now, with the vector bosons being massless, the SU(3) symmetry introduced at (29) becomes an exact symmetry like that of SU(3)QCD, rather than the approximate symmetry of SU(3) flavor, bringing us closer to these R, G, B being true QCD states.
13. Incorporating the Massless Vector Boson Propagators into the Fermion Normalizations
We can further simplify (38) by suitable normalization of the three fermion spinors. Often, a covariant normalization with an energy-dimensioned
is employed for this. If we write this with
as
and place this with the corresponding label in front of each of the three terms in (38), we end up with
in the top line, ditto for the others. Then, because the YM monopole is an indivisible system, we may now choose
as a modified normalization in which
scales with the
of a massless vector boson within the overall system. Doing this, (38) simplifies to:
This becomes our final expression for the signal (non-perturbative) Yang–Mills magnetic monopole, which clearly embeds a propagator for each of its three fermions. Now we are ready to show why these monopoles have the same color and confinement properties as baryons, with interactions mediated by entities which have the same color properties as mesons.
14. Yang–Mills Magnetic Monopoles Have the Color-Neutral Singlet Wavefunction of Baryons, and Interact via Objects with the Color-Neutral Singlet Wavefunction of Mesons
We know that the RGB SU(3) symmetry in the monopole (39) is exact, because the vector bosons were made massless at (37). We know that in its unperturbed signal state, this monopole contains exactly three fermions. We know too, that this monopole is a third rank antisymmetric tensor. However, a key step in going from (27) to (28) in the first symmetry breaking stage, was to assign each fermion, not to the current which caried it into the monopole, but to the index of the partial derivative commonly operating on those fermions once they were already inside the monopole. This led to the associations , and , as very clearly seen in (39).
So, writing out the antisymmetry of the monopole indexes and relating these to their color associations once the fermions are all inside the monopole, the wavefunction symmetry of the now-indivisible monopole system may be schematically represented by:
This is precisely the antisymmetric color-neutral singlet wavefunction of a baryon, see eq. [2.70] of [
16]. Indeed, one can argue that the antisymmetric indexes in
should have been a tip-off that YM magnetic monopoles would make good baryons. Though individual fermions and vector bosons inside the monopole carry color charges, the entire monopole system is a color singlet.
Next, we return to the differential forms relation
of (14b) and again ask as we did at the end of
Section 4: what are these
entities which
do net flow across a YM magnetic monopole surface? Lowering all free indexes in (39), then using
to obtain the monopole 3-form, and using the antisymmetry among each infinitesimal element in
along with index renaming as needed, and also using
while reconnecting this to (13b), we obtain:
We then take the triple integral of all sides and apply (6) via
, to find:
Above, the Gaussian integration has removed the
operator from (41). Extracting the integrands from the surface integrals in (42) and using (9) written as
, then using
, restoring all spacetime indexes and removing
, we find:
By inspection, (42) and (43) have the respective schematic color wavefunctions:
This is precisely the required symmetric color-neutral singlet wavefunction for a meson. So in contrast to the U(1) magnetic monopoles of Maxwell for which there is no net magnetic field flux across any closed surface, there is a net flux of “chromo-magnetic” fields across the surface surrounding a Yang–Mills magnetic monopole, namely the first identified at (14b). However, as seen in (44), these objects are color-neutral, so there is still no flow of net color charge across any closed surface surrounding or within the monopole. We now see these have the required color-neutral wavefunction of mesons known to mediate baryon interactions.
Accordingly, we conclude that Yang–Mills magnetic monopoles have the antisymmetric color-neutral singlet wavefunction of baryons, and objects which net flow across their closed surfaces have the symmetric color-neutral singlet wavefunction of mesons. Together, these are the hadrons.
15. Act of Confinement: Dynamical Hadronization from Maxwell’s Yang–Mills Equations
In his review of the MIT bag model in
Section 18 of [
6], Close reviews Gauss’s theorem for electric charge—contained in
from (7a)—then “consider[s] the chromodynamics case which is analogous to” Gauss’s theorem. He states: “if the demand that no quark current crosses the boundary is supplemented by the demand that colour gluons are also confined then Gauss’s theorem implies that the system have zero colour charge.” He continues that in the bag model, “the introduction of a pressure
B that counterbalances the flow of colour flux automatically requires the system to be colour neutral. If colour symmetry is exact then the system must be a colour singlet.” This is precisely true of (39): its color symmetry is exact because the symmetry breaking in (37) made its gauge bosons massless, and it is a color singlet.
Close then makes the critical points, emphasis added, that “quark confinement arises as a result of colour confinement,” and that the bag model “imposition ad hoc of a boundary condition that confines the coloured gluons has, by Gauss, confined the coloured quarks.” Importantly, he concludes that “a dynamical origin for this boundary condition has not been presented” by the bag model. Or it appears, by any other theory to date.
Here, (43) and (44) demonstrate “by Gauss” that the objects which net flow across the monopole surface are color-neutral. This means, conversely, that objects which are not color neutral, i.e., which do have a color charge, do not net flow across the surface but are confined. (Because the confined objects are those with non-neutral net color, there is nothing in this result which prevents the electroweak photon, W± and Z from flowing across the monopole surface, because these are color-neutral.) Inside these signal monopoles, each of the three fermions has a net color charge in the fundamental representation of an SU(3) gauge group which is exact because its vector bosons are massless, and each of these massless vector bosons has a net bi-colored charge in the adjoint representation of SU(3). We therefore conclude that these fermions and massless gauge bosons are confined. Consequently, we further conclude that: the fermions in (39) are quarks; the now-massless gauge bosons are gluons; the signal monopole (39) is a baryon in a non-perturbative state with all “noise” filtered out; and the object (43) which net flows across the monopole-now-baryon surface is a meson.
Crucially, this is not an ad hoc result. It has a “
dynamical origin” in the very fundamental physics of Maxwell’s equations extended to the non-commuting gauge fields of Yang and Mills, coupled with Dirac’s quantum theory of fermions [
12] and the requirement that each fermion in a system such as an atom or a nucleus or a nucleon or a baryon must occupy an exclusive quantum state. There is nothing new or unsettled in any of the individual elements which are combined to reach this dynamical result. What is new is simply understanding how these all combine together to produce the hadronic phenomenology of QCD, and how the rank-3 antisymmetric structure of a magnetic monopole is dynamically responsible for SU(3)—not SU(2) or SU(4) or anything else—being the gauge group underlying hadronic physics. So, we do not need to postulate SU(3)
QCD as has been done ever since Gell-Mann [
17] and Zweig [
18] first discovered the quark model. The Yang–Mills magnetic monopoles dynamically make that postulate for us, all by themselves.
In cosmology, it is widely believed that hadronization occurred shortly after the Big Bang when the quark-gluon plasma cooled to the temperature below which free quarks and gluons cannot exist. In view of all the above, we can now identify the symmetry breaking of
Section 12 with hadronization of a free quark and gluon plasma believed to exist only at ultra-high GUT energies above ~10
15 GeV, not far below the Planck scale
. Specifically, the signal monopole obtained in (31) prior to symmetry breaking has
and is associated with energies above 10
15 GeV where quarks are free and can mingle with leptons, and where baryons with confined quarks and gluons are not yet formed. The signal monopole obtained in (39) after symmetry breaking has
and is associated with lower energies where free quarks and gluons no longer exist but are confined in color-neutral hadrons. So, the symmetry breaking to go from (27) to (28) in concert with (37) is now seen to take place at some energy
a few orders of magnitude below the Planck energy. The pre-symmetry breaking (27) in which quarks are labelled with the spacetime indexes of the current densities which carry them into the monopole shows the pre-hadronization baryon above
, which at (31) has
. We shall refer to these as the “plasma” labels. The post-break (28) in which the monopole is made indivisible with quarks now labelled independently from their current of origin shows the baryon below
once hadronization is complete, which at (39) now has
with confined color. We shall refer to these as the “confinement” labels. The symmetry breaking from (27) to (28), which via (37) takes us from (31) to (39), then becomes synonymous with
-scale hadronization which is dynamical, not ad hoc. This is what Close refers to as the “act of confinement.”
16. What Is a Baryon, and Who Ordered That?
If we overlook Rabi’s question, and take baryons and mesons as an empirical given without inquiring about their theoretical genesis, then we are pressed into asking questions about how quarks remain confined inside of baryons without knowing what baryons really are, and perhaps, to engineer ad hoc models of attractive and repulsive forces which yield quark and color confinement from a data fitting exercise, rather than obtaining the dynamic understanding referenced by Close [
6]. We are also pressed with only incomplete knowledge to ask how the transition takes place from a quark/gluon plasma at ultra-high energies and temperatures, to baryons with confined, asymptotically free quarks observed at ordinary energies and temperatures. Nor do we have a basis for truly understanding the quark–antiquark pairs which form the mesons that mediate baryon interactions, and which permeate the space inside a baryon. Further, although we do know that QCD stems from an exact SU(3) color symmetry, we do not know
why SU(3), as opposed to some other gauge group, provides this basis for strong interaction physics. All the while, lattice QCD provides only convincing numeric computation, not fully analytic proof, that our theories about strong interactions are correct, because of the perceived intractability of analytically studying the foregoing phenomena. This has produced a substantial body of literature exemplified by [
19,
20,
21,
22,
23,
24,
25], all seeking to answer these questions about confinement and hadronization and quark–antiquark objects and even the core nature of strong interactions, without the benefit of knowing what baryons really are. However, if we do know the true theoretical origin of baryons, then hadronization leading to confinement and to baryon interactions via mesons and even to SU(3) QCD itself ought to flow effortlessly from the natural inherent properties of these baryons.
By attending to Rabi’s question as has been done here, and discovering that baryons are in fact the magnetic monopoles of Yang–Mills gauge theory populated by the fermions in the electric charge sources of Yang–Mills gauge theory, Maxwell’s equations—extended from U(1) abelian to SU(N) Yang–Mills gauge theories—are naturally placed at the center of how we describe the behaviors of these baryons. Then, the integral form of Maxwell’s equations, particularly Gauss’s law for magnetism, simply becomes a statement of what does and does not flow across any closed surface around or within a baryon, in the basic spirit of the MIT bag model.
Consequently, one can examine quark and color confinement by simply deciphering what these Maxwell/Yang–Mills equations tell us about net fluxes across monopole boundaries, without having to construct elaborate models of attractive and repulsive forces engineered to bring about confinement. Specifically, in U(1) electrodynamics, Gauss’s law implies the inverse-square force of Coulomb’s law. So by approaching the confinement of quarks via what does and does not flow across closed surfaces, and finding that there is no flow of net color but there is a flow of colorless antisymmetric mesons in accordance with (44), we are implicitly ensuring that the forces inside the baryon linked to these Gauss’s law surfaces will operate to confine the quarks. That is, if we start with a theoretical monopole baryon in which no net color can ever cross a closed Gaussian surface around, partially through, or entirely within this baryon due to the surface flux properties inherent to monopoles which have been known since the time of Maxwell, then we are assured that the attractive and repulsive forces associated with this Yang–Mills extension of Gauss’s law for magnetism will balance precisely as needed to confine net color without ad hoc engineering. So, because each quark does have a net color charge and each gluon a net bi-colored charge, this means that the quarks and gluons are naturally confined.
In sum, once we engage Rabi’s question and understand that baryons are the Maxwell magnetic monopoles of Yang–Mills gauge theory, the question why quarks remain confined obtains a very simple and natural answer: they are confined because this is how Yang–Mills magnetic monopoles naturally behave. No more and no less. YM monopoles naturally bar net fluxes of color not only across what one might define as an infrared confinement boundary near about 1 fermi (the rough size of a baryon), but also across any smaller closed sub-surfaces wholly or partially inside of the baryon. Because quarks and gluons have net color and bi-color charges, with net color confined in this way, quarks and gluons will also be confined in the very same way.
Moreover, with such an understanding, rather than hypothesize a supersymmetry between bosons and fermions which has long been conjectured but never obtained any empirical support, we discover that hadronization employs a type of spontaneous symmetry breaking where longitudinal degrees of freedom in massive vector bosons are transferred over to fermions during a type of symmetry breaking not dissimilar to the Higgs mechanism which moves degrees of freedom from scalar bosons to vector bosons. This mechanism, particularly via the electroweak
W and
Z bosons [
26,
27,
28] and later through the direct observation of the Higgs boson [
29], has found clear experimental support and so gained wide acceptance. Such hadronization symmetry breaking makes those vector bosons massless and simultaneously renders the underlying symmetry of fermions in the fundamental representation of SU(3) exact, which is the foundation [
17], [
18] of settled QCD. This all proceeds in the ordinary four dimensions of spacetime without the need for additional space dimensions and compactification down to four spacetime dimensions, which originated with the work of Kaluza and Klein [
30,
31,
32].
With this understanding, SU(3) QCD itself becomes no longer just a highly successful postulate both theoretically and experimentally; rather it comes to rest, fully intact, on a deeper dynamic foundation: Because the ground state “signal” monopole is a system containing precisely three fermions, the Exclusion Principle naturally places these fermions into the fundamental representation of SU(3), which becomes exact once symmetry is broken and hadronization of the plasma is complete. What ties this all together very simply, is the fact that magnetic monopoles are third-rank antisymmetric tensors as first elaborated by Einstein in his seminal paper [
8]. So, when these monopoles are promoted from abelian to non-abelian status via Yang–Mills gauge theory [
1], then populated with source currents from the inverted Maxwell’s YM charge equation, they not only contain exactly three fermions in their ground state, but they become color-neutral, having the three-quark antisymmetric wavefunction (40) which carries a fingerprint identical to that of the generally covariant index structure of the magnetic monopole.
Indeed, the third-rank antisymmetric tensor structure of magnetic monopoles having no net magnetic charge and permitting no net magnetic field flux through closed spatial surfaces is perhaps the strongest clue that these monopoles are naturally suited to replicate the three-quark antisymmetric net-colorless wavefunction of a baryon with no net color flux through any closed surface. Thus, when understood through the integral formulation of Gauss’s law for magnetism which again motivates the MIT bag model, confinement boils down to the Yang-Mills magnetic monopoles barring a net flow of color through closed surfaces in the same way that U(1) abelian monopoles bar a net flow of magnetic fields through such surfaces. Meanwhile, hadronic interactions occur via colorless mesons which are enabled to and indeed do net flow through closed surfaces around these monopoles, while inside of monopoles all fluxes through closed sub-surfaces must likewise be color-neutral. The general result is that net color may not flow through closed Gaussian surfaces anywhere, analogously to magnetic fields in U(1) electrodynamics, and any net fluxes which do occur must be by color-neutral objects such as mesons and jets.
It is also important to be mindful that the complete magnetic monopole which corresponds to the physically observed baryons with all non-linear perturbative behaviors included is that of the infinitely recursive (24) which contains both the monopole “signal plus noise,” that (39) shows the “signal” rendition of this monopole with all “noise” removed, and that this “signal” monopole (39) is what directly reveals the color-neutral antisymmetric wavefunction (40) of a baryon, and following Gaussian integration, the color-neutral symmetric wavefunction (44) of a meson for fluxes through closed surfaces. The is the analytical way to understand what is ordinary studied using the numerical approach of lattice QCD. Indeed, the perceived intractability of analytical QCD is the result of the highly nonlinear nature of strong interactions and a large running coupling at low probe energies on the order of 1 GeV, because this seems to foreclose the small-coupling perturbative treatments which are effective, for example, in QED with its small running “fine structure” coupling ~1/137.036 and inverse-square force. In the results here, however, the non-linearities of QCD including the jets required to maintain color neutrality and color confinement are fully embedded in the infinitely recursive nature of (24), which provides an exact analytic expression for the monopole baryon in closed recursive form. So to carry through the analytics, one would resubstitute the gauge fields (21) into (24) in the manner of (22) over and over. Even more preferably, one would try to discern in closed from, the infinite series that emerges from these recursive resubstitutions, because this would finally produce the analytic QCD solution that lattice QCD only achieves numerically.
So, returning to Rabi, who ordered the baryons? Fundamentally, because a baryon is described by Maxwell’s equation for a magnetic monopole in covariant form, populated by fermions housed in Maxwell’s inverted equation for an electric charge, all generalized to non-abelian SU(N) Yang–Mills gauge theory, the answer is that they were primarily ordered by Maxwell and Yang and Mills, whose theories have gained universal confidence as to their underlying validity. Moreover, the similarly-established surface flux laws first developed by Gauss play a crucial role, because they specify both what is confined within closed YM monopole surfaces (net color thus colored quarks and bi-colored gluons) and what is allowed to net flow across these surfaces (colorless quark-antiquark pairs including colorless mesons and jets). A role is also assumed by Dirac whose quantum theory of the electron first exposited the existence and nature of fermions, and by Weyl for elaborating gauge theory itself. That these monopoles contain exactly three fermions in their ground state which fit the fundamental representation of SU(3) also uses the Exclusion Principle of Fermi, Dirac and Pauli. This “three-ness” itself, as well as the color-neutral nature of a baryon, is rooted in the third-rank antisymmetric tensor first taught by Einstein for the generally covariant description of magnetic monopoles. Finally, one should not overlook Hamilton who first developed non-commuting quaternions in order to compactly describe simple rotations in three space dimensions. This is because non-commuting mathematical objects subsequently came to play a central role in many areas of physics, including the extension of quaternions and their Pauli spin matrix descendants into the non-commuting vector potentials of Yang–Mills gauge theory. Importantly, all of these theories and theoretical tools are well-settled, uncontradicted and universally accepted. All that is novel in this present work, is the illumination of how these all combine to produce the observed QCD physics of hadrons.
17. Filling the Yang–Mills Mass Gap
It is well-known that Maxwell’s electric charge Equation (1a) has no inverse, or, to be precise, that the inverse
of its operator on
is infinite (singular). To deal with this, one of several approaches is typically required. A first option is to impose a gauge condition, often the covariant
. Then it is easy to obtain
alone. Another is to introduce a Proca mass by hand, which is what we did at (15). However, doing so means the theory is no longer renormalizable, so we must eventually find a way to remove this mass and introduce it some other way. Using
to obtain a “signal” inverse, the inverse of the operator in (15) becomes the familiar finite (A8). We also see from (A8) that when
this inverse becomes infinite, which directly demonstrates why (1a) has no inverse. Using notations reviewed at the start of
Section 6, this is not just because of
in the numerator which can be removed when contracted with a source current because of the signal continuity equation
reviewed near the end of
Appendix B. More importantly, it is because
in the denominator becomes zero when the vector boson is “on-shell,” meaning that
. When we remove the Proca mass so the boson is again massless, the denominator becomes
which on-shell is also
. These singularities are why we also need the
prescription to take a particle “off-shell,” i.e., to render it “virtual” about the singular “pole.”
In view of this, we sum the inverse (20) from the left with electric source density
and from the right with
, both from (10a). Recall, using
, that
as reviewed following (12). As obtained in (A15) the continuity equation
whereby the term
and so drops out. So, with
we obtain:
Now, we see how to remove the Proca mass thus restoring renormalizability while maintaining a finite inverse: Because the term
arises naturally from the Yang–Mills gauge theory and is itself square-mass-dimensioned, and because this term has the same form as what is in
whereby vector boson masses arise in a well-known way from spontaneous symmetry breaking, we simply use
to replace
by removing the latter entirely. In other words, Yang–Mills gauge theory puts a mass-producing term right where it needs to be in the form it needs to have, so we can set the mass added by hand at (15) back to
in the above without adverse consequence. Likewise, neither do we need
. So, removing these, and expanding
, (45) now becomes:
What is important now is that
is not an ordinary spacetime scalar, but rather, is an N×N matrix for any Yang–Mills gauge group SU(N), with adjoint structure established by the
products
. Furthermore, some of the matrices
and thus
contain imaginary components, which can be seen even in the simplest case of SU(2) where
and the Pauli identity
. The
which are also N×N YM matrices will contain additional imaginary components. Moreover, nothing restricts (46) to SU(3). As reviewed in
Section 10, this restriction is imposed by the magnetic monopoles. Here, we are dealing with Maxwell’s Yang–Mills Equation (12a) for electric source densities independent of magnetic monopoles, with the Proca mass of (15) now removed.
Because (46) is an N×N Yang–Mills matrix and a spacetime scalar, and additionally houses a finite-matrix (
f) inverse which by definition is invertible, i.e.,
, it has finite eigenvalues
which are calculated in the usual way for a square matrix
M via the determinant relation
, where Id is an N×N unit matrix for any SU(N). These
are obtained by:
However, (A8) provides the basis for seeing what these eigenvalues look like in the non-perturbative limit with all recursion reviewed at (21) and (22) removed via
so that
becomes
and
becomes
. In
Section 6 notation with what is now a continuity equation
from
Appendix B, these eigenvalues become:
Finally, combining (47) and (48) and reintroducing the infinite recursive series notation of (22) with … wherever a substitution
or
is required, and noting the similarity of
to
discussed at the start of
Section 4 whereby the perturbation is the difference “noise = (signal + noise) −
signal,” we obtain:
This is the mass gap solution. Specifically, referring to page 6 of [
2], for “any compact simple gauge group G,” that is, for any SU(N), “a non-trivial quantum Yang–Mills theory exists on
and has a mass gap > 0 namely there must be some constant
such that every excitation of the vacuum has energy at least
”. “Excitations of the vacuum” is another phrase for non-zero perturbations
defined following (11), and these arise from the canonic
promotion of
reviewed in
Section 3.
It is also another phrase for what we have referred to here as the “noise” of Yang–Mills dynamics over Maxwell’s U(1) electrodynamics. Indeed,
in the determinant (49), like
V, is also a direct
measure of “noise = (signal + noise) − signal.” When these excitations of the vacuum
, the inverse
has the infinitely recursive, highly nonlinear form of (22). However,
is clearly invertible (again,
), so it will have a non-zero determinant and non-zero
eigenvalues.
Further, is a correctly signed term for positive vector boson rest energy , has the same form as what appears in the symmetry-breaking , and at (46) was used to remove the Proca mass to restore renormalizability while maintaining finite invertibility. Because contains both real and imaginary numbers, these eigenvalues can be real, imaginary, or complex. So, when the eigenvalues (48) and then the in the eigenvalue denominator are computed using (49) for “any compact simple gauge group G,” we will find that “every excitation of the vacuum has energy at least ”. Moreover, the imaginary parts of in (49) can produce which corresponds physically to finite particle lifetimes. This is how Yang–Mills gauge theory reveals masses and finite lifetimes via in the (48) denominator while maintaining renormalizability by removing Proca masses and , thereby filling the Yang–Mills mass gap.
The manifest recursion highlighted in (49) is also important for understanding how to carry out exact closed analytic calculations in Yang–Mills theory, as opposed to using numerical methods such as those of Lattice QCD. In this regard, Jaffe and Witten state on page 7 of [
2] that “since the inception of quantum field theory, two central methods have emerged to show the existence of quantum fields on non-compact configuration space (such as Minkowski space). These known methods are (i) Find an exact solution in closed form; (ii) Solve a sequence of approximate problems, and establish convergence of these solutions to the desired limit.” The foregoing (49) suggests a third method which is really a hybrid of (i) and (ii): find an exact recursive kernel in closed form, and then expand that kernel in successive iterations approaching the limit of infinite recursive nesting to identify the underlying infinite series in a closed form. As noted toward the end of
Section 16, in the specific context of QCD rather than the general context of any compact simple Yang–Mills gauge group, this is the basis for developing an exact, closed analytical form of QCD, versus having to resort only to the numerical methods of lattice QCD.
18. Conclusions
For an entire century we have known experimentally about the existence of protons. For almost 90 years we have known about neutrons. However, beyond knowing baryons contain three quarks with exact SU(3) chromodynamic symmetry, contain massless gluons in the adjoint representation of SU(3), have an antisymmetric color-neutral singlet wavefunction, and interact via symmetric color-neutral singlet mesons, we still cannot answer Rabi’s simple query “who ordered that?,” and we do not understand the dynamic basis of quark and gluon confinement and hadronization.
Here, we have shown that the magnetic monopoles of Yang–Mills gauge theory in their “signal” state contain three fermions in the fundamental representation of SU(3). Following symmetry breaking which moves a degree of freedom from the gauge bosons to the fermions, the gauge bosons become massless, SU(3) becomes an exact symmetry, and a propagator is established for each fermion. The monopoles then have the same antisymmetric color singlet wavefunction as a baryon, and the field quanta of the magnetic fields fluxing through their surface have the same symmetric color singlet wavefunction as a meson. Consequently, we can identify these fermions with colored quarks, the massless gauge bosons with gluons, the magnetic monopoles with baryons, the fluxing entities with mesons, and the symmetry breaking with hadronization, while establishing that the quarks and gluons remain confined following hadronization. The result is a quantum chromodynamic (QCD) theory of the hadrons. Using analytic tools developed along the way, we also fill the Yang–Mills mass gap.
Finally, as previewed in the introduction and further detailed in
Section 16, Rabi’s question is answered: Protons, neutrons and other baryons were ordered primarily by Maxwell, Gauss, Yang and Mills. Additional foundations were provided by Weyl who was the father of gauge theory, as well as Fermi, Dirac and Pauli via Dirac’s quantum theory of the electron and the fermion Exclusion Principle, and Einstein’s generally covariant formulation of Maxwell’s magnetic monopoles as a third rank antisymmetric tensor which directly accounts for ground state baryons containing three colored quarks while remaining color-neutral. Finally, seminal credit must be attributed to Hamilton for pioneering non-commuting quaternions which more than a century later became the foundation of Yang–Mills gauge theory.