1. Introduction
The importance of the proper choice of the degrees of freedom can hardly be underestimated in physics. Taking the form of canonical transformations and generalized coordinates formalism in classical mechanics, the transformations of degrees of freedom had been taken to a qualitatively different level in quantum physics and are responsible for such dramatic phenomena as phase-transitions, superconductivity, superfluidity and quark-hadron duality. Representations of physical theory in terms of different generalized coordinates in classical mechanics were emphasized and elaborated in the theory of canonical transformation (CT) using Hamiltonian dynamics and the Jacobi Equation. The theory of CT plays a central role in classical mechanics both by providing a powerful tool for solving dynamics and by setting up a conceptual framework to establish bridges between various physical models. However, the general theory of CT fails in quantum mechanics due to the operator nature of the generalized coordinates that leads to the ordering ambiguities in generating functions. Although some conjectures existed that identified CT in quantum mechanics with unitary transformations and even some classes of non-unitary transformations, the true relevance of CT in quantum physics is yet to be established. Out of a large body of CT known in classical mechanics, the only transformation that survived the quantum transition is what is known today as the Bogoliubov Transformation (BT).
In this work, we consider the role of BT in the field theory of mixing and the definition of the flavor quantum states and make an attempt toward a complete formulation of such a field theory of mixing. This involves an introduction of general formalism as well as phenomenological implications. Quantum mixing of particles is among the most interesting and important topics in particle physics [
1,
2,
3,
4,
5,
6]. The standard model involves quantum mixing in the form of the Kobayashi–Maskawa (CKM) mixing matrix [
7], a generalization of the original Cabibbo mixing between
d and
s quarks [
8]. Also, recently, convincing evidence of neutrino mixing have been provided by Super-Kamiokande and SNO experiments [
5,
6,
9,
10,
11,
12,
13,
14], thus suggesting neutrino oscillations as the most likely resolution for the solar neutrino puzzle [
15,
16,
17,
18] and the neutrino masses [
19,
20,
21,
22].
Since the middle of the century, when the quantum mixing was first observed in meson systems, this phenomenon has played a significant role in the phenomenology of particle physics. Back in the 1960s, the mixing of
and
provided evidence of CP-violation in weak interactions [
23] and till today meson mixing is used immensely to experimentally determine the precise profile of the CKM unitarity triangle [
7,
8,
24]. Upgraded high-precision B-meson experiments would be vital to search for deviations from the unitarity in a CKM matrix, which can put important constraints on the physics beyond the standard model. At the same time, in the fermion sector, the discovery of neutrino mixing and neutrino masses challenged our fundamental understanding of the P-violation and, in part, of the standard model itself.
Regarding the vanishing magnitudes of the expected new physics effects (such as the unitarity violation in the CKM matrix and/or neutrino masses), it is imperative that the theoretical aspects of the quantum mixing are precisely understood. In this respect, it was noticed recently that the conventional treatment of flavor mixing, where the flavor states are defined in the Fock space of the energy eigenstates, suffers from the problem with the probability conservation [
25,
26]. This suggested that the mixed states should be treated independently from the energy eigenstates. It was found, indeed, that the flavor mixing in quantum field theory introduces very nontrivial relationships between the flavor and the energy quantum states, which lead to unitary inequivalence between the Fock space of the interacting fields and that of the free fields [
27,
28,
29,
30]. This is quite different from the conventional perturbation theory where one expects the vacuum of the interacting theory to be essentially the same as one of the free theory (up to a phase factor
[
31,
32]).
Recently, the importance of mixing transformations has prompted their fundamental examination from a field-theoretical perspective. The investigation of two-field unitary mixing in the fermion sector by Blasone and Vitiello [
25,
30,
33,
34] demonstrated a rich structure of the interacting-field vacuum as a SU(2) coherent state and altered the oscillation formula to include the antiparticle degrees of freedom. Subsequent analysis of the boson case revealed a similar but much richer structure of the vacuum of the interacting fields [
35,
36]. Especially, the pole structure in the inner product between the vacuum of the free theory and the vacuum of the interacting theory was found and related to the convergence radius of the perturbation series [
36]. Attempts to look at the mixing of more than two flavors have also been carried out [
37,
38]. Also, a mathematically rigorous study of 2-flavor quantum field mixing has been offered by Hannabuss and Latimer [
39,
40].
In this article, we present a brief review of CT and apply the quantum linear CT (or BT) to the advances in quantum field theory of mixing. The mixing relations for quantum states here are derived directly from the definition of the mixing for quantum fields and the unitary inequivalence of the Fock spaces of energy and flavor eigenstates is shown by a direct algebraic method. The time dynamics of the interacting fields is explicitly solved and the flavor oscillation formulas are derived in a general form with emphasis on antiparticle content and the effect from the nontrivial flavor vacuum.
The paper is organized as follows. In
Section 2, we discuss the theory of CT with a special emphasis on quantum linear CT (or BT) as they appear in classical and quantum physics discussing their novel applications even in superfluidity and low-energy QCD. In
Section 3, we analyze quantum mixing from a general perspective of quantum field theory and introduce most general field-theoretical linear mixing relations. We then analyze implications of these relations for the flavor vacuum state, flavor Fock space and time dynamics of flavor states in quantum field theory. In
Section 4, we present the mixing matrices and discuss the mass parameterization issue. We obtain the mass parameters in rather general form for boson/fermion mixing with an arbitrary number of flavors and discuss the issue of the mass parametrization. Conclusions follow in
Section 5.
Supplemental materials are placed in the Appendices.
Appendix A is devoted to explicitly solving the structure of the nontrivial flavor vacuum state in the general quantum field theory of flavor mixing. In
Appendix B, we list explicitly the mixing parameters for the most important cases of particle mixing. In
Appendix C, we present the results of application of our general formalism to 2-flavor mixing for spin 0, 1/2 and 1 particles.
2. Canonical Transformations in Quantum Physics
The theory of CT is a powerful tool of classical mechanics. Yet, the strength of CT is yet to be fully realized in quantum mechanics [
41,
42,
43,
44]. The ground for CT in classical mechanics is laid down by the Hamilton formalism in which the Lagrangian function and the dynamics in terms of generalized coordinates
is translated into dynamics in terms of twice as many coordinates and momenta and the Hamiltonian function
Hamiltonian formalism is advantageous over Lagrangian formalism in that it reduces the system’s dynamics to the solution of a set of first order differential equations. On the other hand, one has to deal with twice as many variables. The main advantage of the Hamiltonian formalism is, however, in the conceptual framework. Note that in Lagrangian formalism the only “fundamental” degrees of freedom are the generalized coordinates and the momenta are merely derivatives
. In Hamiltonian formalism both coordinates and momenta are treated on equal footing and are independent except for the dynamical link established by Hamilton’s Equation (
2). Such independence provides the most important piece of the foundation for the theory of canonical transformations.
Thus, in classical mechanics CT is such a change of the phase space variables
that preserves the Poisson bracket
The Poisson bracket is defined as usual,
The main property of CT is the way the
action is transformed
where the full differential is given by
As long as Equation (
5) holds, the dynamics of the system in new coordinates is described with Hamilton formalism and, in this sense, is similar to that of the original description.
is often used to characterize classical CT and is typically called a generating function [
45]. CTs are extremely helpful tools that allow to change the system’s Hamiltonian to a simpler form thus leading to great simplifications in the equations of motion. A textbook example in this respect is the oscillator dynamics, which can be transformed to a trivial problem with decoupled variables with [
45]
conventionally denoted as
a and
. CT can also be used to generate families of exactly solvable Hamiltonians out of a single Hamiltonian where the dynamics are known, thus, providing a set of “toy” models in which the properties can be studied exactly. Finally, CT plays a central role in the Jacobi theory where a special transformation is sought that reduces the dynamics to a trivial one
and thus provides immediately the full solution to the classical equations of motion [
45].
In quantum mechanics, however, the use of CTs is practically completely lost. Three major problems exist in the translation of the formalism of CTs to the quantum mechanics [
46,
47,
48]: the ordering of operators must be specified, the inverse and fractional powers of operators that may appear in the transformation must be handled and the possibility of non-unitary CT must be addressed. While in classical mechanics three major roles of canonical transformations (evolution, physical equivalence and solving theory) are blurred together, in quantum mechanics they are distinct. In quantum mechanics the evolution is produced by unitary transformations, while the physical equivalence is proved with isometric transformations (norm preserving isomorphisms between different Hilbert spaces) and the solution of a theory is achieved by general transformations, which may involve non-unitary transformations [
49,
50].
A number of approaches had been pursued in a general attempt to resolve these issues [
41,
42,
43,
44,
47,
48,
51,
52,
53,
54,
55,
56,
57,
58,
59]. The conjecture that CTs in quantum mechanics are one and the same with the unitary transformations is one of the oldest such attempts [
51,
52]. Non-hermitian linear transformations [
56] and general form integral transformations [
53,
54,
55] had also been considered in this respect. Such approaches typically experience the problem that the procedure necessary to build and apply a CT is not at all simpler than finding the solution of the original Schrodinger equation itself, thus undermining the very first idea of the use of CT for simplification of the original problem [
51,
52,
53,
54,
55].
A rather different approach embraces the path integral as the base for further development of CT in the quantum theory [
57,
58,
59]. The Feynman path integral (PI) written in the phase space of
p and
q provides one of the most startling paradoxes in the breakdown between classical and quantum CTs. Conventionally, one writes the PI in the Euclidean space as
The similarity between Equations (
5) and (
8) may tempt one to use this relation as the basis for the program of quantization of CT [
57,
58,
59]. Nonetheless it had been noted in the literature that the main obstacle along this line consists in the discretized nature of the expression (
8) [
44]. Specifically, one needs to remember that Equation (
8) is only a formal representation in which the time derivatives, for example, should be properly defined. In this sense Equation (
5) can not hold if
in a finite-difference form. Also, an explicit application of canonical transformation to formal expression (
8) to derive, for example, the analog of the Jacobi equation for the propagator
immediately yields an inconsistent answer.
The breakdown of Equation (
6) in the quantum case can be seen as the primary source of the lack of correspondence between quantum and classical CTs. It may be of interest to examine this point more closely. Consider, for example, the point transformation
in the usual Lagrangian form of the path integral
This transformation is widely applied in the field theory and generally known to work well even though Equation (
9) is only a formal representation of a properly discretized expression. One reason for this is that the contributions to the PI in Lagrangian form come only from the trajectories that are continuous, thus, justifying the use of transformation (
9). Really, one considers a trajectory that has a discontinuity
at some time
. If sampled with time step
, such trajectory would contribute to the integral a quantity
As
, this contribution becomes exponentially suppressed relative to the contributions coming from the comparable continuous trajectories. Then, it can be said that the support of the integral in Equation (
9) consists only from the continuous trajectories
and this is why the formal operation with the integrand in Equation (
9) works.
In the case of the phase space form of PI (
8), one may immediately observe that integration over discontinuous trajectories
is not suppressed. Really, the contribution of such a trajectory would come with merely a factor of
where
is the discontinuity in
. Moreover, to derive the Lagrangian PI from Equation (
9), one needs to integrate each
from
to
regardless of the values of
at the neighboring points. Thus, in Equation (
8) the discretized nature of the integral is important and Equation (
6), infinitesimally correct, cannot be generally used. It is useful to note, however, that Equation (
6) will retain its general validity in finite-differenced form if the canonical transformation is
linear. Indeed, it had been known for quite some time that linear canonical transformation can be applied successfully in quantum mechanics (most typical example is, again, quantum oscillator) [
41,
42,
43].
Despite such severe limitations of the apparatus of CTs in quantum mechanics, the use of linear CT had proved to be of great advantage in the study of nonperturbative features of quantum systems. The quantum linear CT was first put forward in 1947 by Russian physicist N. Bogoliubov in order to build a microscopic theory of superfluidity and often bears his name [
60]. The method was then extended to superconductivity in 1958 [
61,
62]. The theory of BT is usually formulated in terms of creation/annihilation operators of the quantum many-body problem. Two forms of BT (one for fermions and another for bosons) are known. For bosons BT reads
where
and
are transformation amplitudes such that
. For fermions
and
.
The theory of BT can be viewed mathematically as a rotation of basis in the linear space of quantum fields built on
. BT is a unitary transformation and can be represented in the form
Physically, due to the manifestation of fundamental physical properties as observable effects inherent to quantum physics, this transformation describes an alternative set of degrees of freedom of the field-theoretic model that appear as quasi-particles with different properties than those of the original particles.
Over the years, BT found a wide range of applications in various areas of quantum physics from condensed matter theory [
63] to strongly interacting QCD [
64,
65,
66,
67]. It provided a powerful nonperturbative tool that helps to understand many central features of macroscopic behavior of field-theoretic models. To illustrate the power of BT, let us begin by considering its first success in the theory of superfluidity in more detail.
Superfluidity is the phenomenon of loss of viscous friction in a flowing fluid at superlow temperatures. Superfluidity was discovered by Petr Kapitza and Arno Allen Penzias and Robert Woodrow Wilson in 1938 [
68]. P. Kapitza later received the Nobel prize for this discovery. The theoretical explanation of superfluidity was obtained in the works of L. Landau and N. Bogoliubov [
60,
69,
70]. According to Landau, viscous friction is due to the transfer of momentum between neighboring fluid elements toward the walls of the fluid container via the gradient in the fluid velocity. At low temperature, the viscous drag is transferred to the fluid from the stationary walls by means of elementary excitations. If such elementary excitation is created at the wall with momentum
p, it will transfer to the fluid a momentum
and an energy
so that the momentum and energy of the fluid become
The central point of this argument is the notion that the viscous drag excitations can be created at the walls only if there is free energy to do so, i.e., if the energy of the fluid will decrease when such excitation is created. That means that if
then none such elementary excitation can be created. For any fluid in which elementary excitations at low temperature are phonons with speed
such condition occurs when
.
In our argument, we intentionally dismissed the role of momentum transfer by the population of thermal elementary phonons that always exist in the liquid and that is capable of carrying momentum from one point to another and thus provide a source of viscous friction. The momentum, which thermal elementary excitations may support, is finite and decrease with the temperature. It may be shown that such momentum is proportional to fluid velocity and is
Whenever the momentum capacity of thermal phonons becomes insufficient to decelerate the fluid, i.e., superfluidity is observed. In these conditions, one component of mass acts like an ordinary fluid being subject to viscous forces and the remainder experiences zero viscous effect and forms the superfluid component. The amount of viscous components at a given temperature can be calculated in the microscopic theory of superfluidity, which we will describe below, and can be shown to fall as as the temperature decreases.
The microscopic theory of superfluidity had been suggested originally by N. Bogoliubov [
60,
63] and made use of three fundamental points, which are thought to be valid for low-temperature real fluid. The three fundamental assumptions of Bogoliubov’s treatment are that at low temperature a macroscopic number
of particles in the fluid occupies one single-particle state, for example,
, the interaction between particles is essentially short range and its main effect consists in scattering particles in/out of the Bose condensate, thus making pairs of particles with momenta
k and
as the second largest population in the system. With these assumptions the original many-body Hamiltonian
can be transformed into
where we also neglected “small” commutator
relative to
. Here,
g is the strength of effective short range interaction between the particles in the superfluid fluid and
is the particle density.
It was further observed by N. Bogoliubov that Hamiltonian (
19) can be exactly diagonalized with the linear canonical transformation of the form (
11) where
and
After BT, the Hamiltonian becomes
Equation (
22) solves the original problem in its entirety yielding the energy spectrum of the elementary excitations, ground state energy and vacuum structure. In particular, all elementary excitations correspond to coherent superpositions of particles
and holes
in the Bose condensate with the wave vector
k and energy
. In a long-wavelength limit (
) the elementary excitations represent sound waves with the propagation speed
For short wavelength, the spectrum is that of a free particle shifted upward by a constant arising from the interaction with the Bose condensate.
From Equation (
21), the thermal spectrum of the excitations can be easily derived and shown to be that of a Bose gas and all other thermodynamic properties of superfluid can be found [
63]. In particular, one can easily show that the momentum, which can be carried by thermal excitations, at low temperatures is proportional to
where
is the Bose–Einstein distribution. In this brief example, the power of the application of CT can be vividly seen because an exact solution to a highly nontrivial problem is obtained with ease and full information about the system becomes available.
Even when BT does not lead to a full diagonalization of the model Hamiltonian, the reduction gained by its use may be beneficial. In QCD, BT has been used continuously to describe nontrivial structure of a QCD vacuum, its superconducting properties and values of quark and glue condensations [
64,
65,
66,
67,
71]. In a number of variational and field-theoretic works, it was suggested that the quantum configurations with quark pairs may have a lower free energy than a perturbative QCD vacuum and thus be energetically preferred leading to existence of nontrivial color condensation in a QCD vacuum. Such models had also been extended to describe properties of dressed constituent quarks and to derive from fundamental QCD hadron structure and mass spectrum [
71]. In BT treatment, the QCD vacuum
is modeled as the BT vacuum annihilated by quasi-particle operators obtained from the original current-quark ladder operators with a rotation
Here,
is the helicity of the quark. Such vacuum can be explicitly related to the perturbative vacuum by
where the gap angle
is a free parameter used to minimize the energy of the trial QCD vacuum. The condition of the energy minimization typically results in a nonlinear integral gap equation, which needs to be solved in order for the explicit structure of the vacuum to become transparent. Such approach is able to describe the vacuum condensations in QCD as well as superconducting BCS features of the QCD vacuum. Operators
can be seen to describe the dressed quarks and may be further used to model dynamical mass generation of the constituent quarks in the Constituent Quark Model (CQM) and to produce a CQM-like description of the hadrons starting from the current quarks and fundamental QCD Hamiltonian [
71]. The BT application to the QCD in 1+1 dimensions [
72] has been exhibited for the mass gap of the quark [
73,
74,
75] and the quark–antiquark bound-state equations [
76,
77,
78,
79].
Along this line, one employs the above QCD vacuum to construct meson states as produced on top of
by means of the meson creation operator
The application of a variational principle, or Tamm–Dancoff truncation, to
leads to a Schrodinger type equation, which can be solved for the spectrum and wavefunctions of the mesons [
71]. Further improvements of this approach, which rely on the QCD vacuum improved by an introduction of two and four quasi-particle correlations, is known in QCD as Random Phase Approximation (RPA) [
71,
80,
81]. RPA, based on taking into account particle–particle and particle–hole correlations in the mean field, also applied in many other areas of quantum physics.
In all of these applications, BT serves as a powerful tool to help gain nonperturbative knowledge about quantum-mechanical and field-theoretical properties of physical systems. BT, stemming from the classical apparatus of CTs, is more an exception rather than a rule given that the systematic translation of classical CTs into quantum framework experiences detrimental difficulties. Nonetheless, since its introduction in 1947, BT found a wide area of applications ranging from problems of condensed matter physics to strong interacting QCD as well as meson physics. It is repeatedly employed to describe nontrivial correlations in physical systems responsible for some dramatic physical behavior such as superfluidity, superconductivity, phase transitions, nontrivial vacuum condensations. In this article, we will now focus our attention on the application of BT that appeared in the field theory of flavor oscillations. In the upcoming sections, we will examine this development in greater detail.
3. General Theory of Quantum Field Mixing
Quantum field mixing is a fascinating phenomenon, first observed in weak interactions, where the interacting states of a particle are dramatically different from the free-propagation states. As a result, the particle, say, produced in a weak decay evolves over time into a drastically different weak-interaction state with a very different weak decay signature. One usually thinks of this phenomenon in terms of weak-interaction (or flavor
) eigenstates and free-propagation (or energy
) eigenstates. The flavor state produced in weak interaction shall be represented as a superposition of energy-eigenstates, which then propagate independently from each other. Should weak decay happen once more, however, the evolved superposition of energy-eigenstates should be represented again in terms of flavor-eigenstates to find appropriate weak-decay modes. In quantum mechanics, mixing of flavors is described by the interaction Hamiltonian
1
where
are creation (annihilation) operators for quantum flavor-states. The full Hamiltonian
can be straightforwardly diagonalized by introducing quantum energy states
where
is an appropriate unitary mixing matrix. This simple transformation allows one to immediately solve for time dynamics of flavor states in quantum mechanics and arrive to oscillation formulas for probability for a flavor state
A to appear as a flavor state
B after time t;
In the case of only two flavors, one recovers the famous Pontecorvo oscillation formulas
where
is a 2-flavor mixing angle.
In quantum field theory, analogously, mixing is described with interaction Hamiltonian density
Full classical Hamiltonian can be similarly diagonalized with appropriately chosen linear transformation from flavor-fields
to mass-fields
Here
is a free-theory Hamiltonian and the free-fields
are given in terms of their Fourier transform as usual
and
with the standard equal time commutation/anticommutation relationships. In Equation (
37),
and
are the free particle and antiparticle amplitudes, respectively, and
is the helicity quantum number given by
where
is the spin operator and
.
Differently from the quantum mechanics, in the quantum field theory the transformation (
35) does not immediately imply a specific form for the mixing relations between flavor and energy eigenstates. In fact, the intrinsic presence of antiparticle degrees of freedom in quantum field theory introduces a dramatic difference. In field theory any operators with the same
conserved quantum numbers can mix. This means that in general in Equation (
31) not only the flavor particle annihilation operators with momentum
and helicity
will mix, but also the flavor antiparticle creation operators with momentum
and helicity
may enter. Thus, the most general linear mixing relations in quantum field theory are
where
stands for flavor-eigenstate (mass-eigenstate) particle annihilation operator and
stands for flavor-eigenstate (mass-eigenstate) antiparticle annihilation operator. In these relations, we explicitly imply that antiparticles and "particle-holes" are treated on an equal footing, as they enter covariant field
. Factor
with
S being the spin of the mixed fields (
is +1 for bosons and −1 for fermions) has the quantum-statistics related origin and is required to guarantee
. Consequently, we have introduced two mixing matrices,
and
, describing particle–particle mixing and particle–antiparticle cross-mixing. For brevity, we will suppress momentum notation implying that all quantities are taken at given momentum
and helicity
or
,
as indicated by the sign in front of the flavor/mass-eigenstate index (i.e.,
stands for
and
for
).
A few simple properties of quantum mixing transformation are in place. For Equation (
39) to preserve commutation/anticommutation relationships,
so that one can relate
and
to cos and sin for fermions or cosh and sinh for bosons, respectively;
Equation (
39) must also furnish a representation of the classical mixing transformation (
35) in the linear space of quantum fields. Then, it follows that the following relation should hold
independent of
. We will consider a specific form of
and
quantum mixing matrices in
Section 4. Let us continue with this general form for the moment.
The possibility of antiparticle admixture in (
39) has important consequences. In particular, it is clear that the vacuum
, associated with the free fields
, is not annihilated by
. In fact, the use of
to define flavor quantum states introduces various normalization problems. For example, if one proceeds to define the flavor quantum state as
then obviously, as follows from Equation (
40),
This forces one to introduce much artificial normalization factors as in, for example, [
27,
30]. In a more general form, a similar normalization problem had been demonstrated for a regular perturbation theory in the mixing of two fermions of spin 1/2 [
25,
26].
Quantum field theory of 2-flavor mixing is a theory simple enough for the perturbation series to be summed exactly. For example, one may obtain for a time-ordered two-point function
where the “bare” propagators are
. The transition amplitude for a fermion
A created at time
to go into the same particle at time
t then is given by
where
is forward (
) propagation function,
are bi-spinors used to expand mass-eigenstate fields and
Upon computing
one can explicitly observe that
and thus
[
25,
26]. These results indicate that special care needs to be taken in quantum field theory to properly define flavor quantum states. In particular, the flavor vacuum state
shall be properly defined as a state annihilated by all flavor particle/antiparticle annihilation operators and flavor quantum states shall be built on top of
.
The explicit structure of a flavor vacuum state can be obtained by solving the set of equations
Let us express the flavor vacuum state as a linear combination of the mass eigenstates, i.e., in the most general form,
with
. After applying Equation (
45) to Equation (
46) we get an infinite set of coupled equations
where
. The solution of this problem is presented in
Appendix A. For the flavor vacuum state we find explicitly
where
is an
element of the matrix
. The normalization constant
is fixed by
;
for fermions and
for bosons. We see that the flavor vacuum state has a rich coherent structure. This situation is different from the perturbative quantum field theory, where the adiabatic enabling of interaction ensures that
. A rich flavor vacuum renders additional effects in the flavor dynamics. In particular, the normalization constant
is always greater than 1 so that in the infinite volume limit, when the density of states is going to infinity, we have
Thus, any possible state for the flavor vacuum shall have an infinite norm in the free-field Fock space and therefore the flavor vacuum state cannot be found in the original Fock space. The unitary inequivalence of the flavor Fock space and the original Fock space is therefore established as as .
The time dynamics of quantum field theory can be entirely described in terms of its non-equal time commutation/anticommutation relationships. We note that in quantum field theory of mixing only
and
operators and their conjugates can mix together. We denote the set of quantum fields formed by all linear combinations of these operators and their products (algebra on
,
and h.c.) as a cluster
with a particular momentum
and helicity
. It follows then that
’s are invariant under mixing transformation (
39) and we can analyze each cluster independently from each other.
Then, the non-equal time commutators/anticommutators for flavor fields with given
and
can be derived from (
39) using the standard commutation/anticommutation relationships for the mass-eigenstate ladder operators;
where ± in
corresponds to commutation/anticommutation. The two matrices
F and
G represent the only nontrivial commutators/anticommutators in the sense that all others are either zero or can be written in terms of elements of these matrices. It is useful to note that, for
, Equation (
50) shall be reduced to
and
. We also note that
Equation (
50) allows one to compute many quantities in the quantum field theory of flavor mixing directly. For example, the time dynamics of the flavor-field ladder operators can be written as
. Then, one can directly get
and
, while all other coefficients are zeros:
We can now consider various quantities in quantum field theory of mixing, such as the condensate densities of the mass-eigenstate particles in the flavor vacuum (), the number of flavor-eigenstate particles in the flavor vacuum () and the particle number expectation for a single flavor-particle initial state, which is related in the Heisenberg picture to , .
The free-field particle condensates in the flavor vacuum state can be computed directly from Equation (
39);
In the following, the particle–antiparticle symmetry should be taken into account, so that a corresponding antiparticle quantity can be found from the particle expression after a necessary substitution (particles→antiparticles and vice versa). It means that the antiparticle condensate is also given by Equation (
53). Similarly, the flavor particle condensates in the free-field vacuum are given by Equation (
53) as well. Using Equation (
52), we can also get the flavor-particle condensates in the flavor vacuum;
It is remarkable that this number is not zero but oscillates with time.
The evolution of the particle (
) and antiparticle (
) expectation number with flavor
can be found using the standard technique of normal ordering, i.e., moving annihilation operators to the right side and creation operators to the left side of the expression. With this technique, in general, we obtain
For a specific case of the number evolution in the beam with a fixed 3-momentum, we find:
We emphasize that no general reason can be found in the above theory for
to be equal to
. In fact, explicit computation shows that if the mixing matrix
is not real, for example, CP violating CKM matrix,
will have nonzero contribution proportional to the imaginary part of
U. We must understand this quite unexpected result as a manifestation of T-violation in the presence of CP-violating mixing, which should be required by the CPT-theorem. Interestingly, Equation (
51) demonstrates then that T-violation can manifest itself only in “particle–antiparticle” cross-mixing, described by
and not in “particle–particle” mixing described by
.
The flavor charge
can be defined by [
33,
34,
35]
Taking into account Equation (
42), we can write, for example, for fermions (S=1/2),
Thus, we find
where
and
, and finally
These formulas are valid for bosons with the substitution , .
Finally, we note that Equation (
56) may be viewed as a superposition of two terms:
propagation and background vacuum contribution
. Thus, one may introduce the particle–particle and particle–antiparticle propagation amplitudes, respectively,
Indeed, such propagation amplitudes also appear from the flavor-field Green functions defined on the flavor vacuum : .
4. Mixing Matrices and Mass Parameters Issue
As we have seen in the previous section, the possibility of antiparticle admixture in quantum field-theoretical flavor mixing is capable of introducing new effects in flavor dynamics. The flavor vacuum state requires rich coherent structure and oscillation formulas change with high-frequency antiparticle terms.
If the description of classical (or quantum-mechanical) mixing involves one unitary matrix
, in quantum field theory in general one needs two matrices
and
to describe a representation of the mixing transformation. In field theory, these matrices should be related to their classical counter-part. A way to establish such a relationship was proposed in the quantum field theory of flavor mixing by Blasone and Vitiello [
30].
Blasone and Vitiello based their theory on the observation that an explicit quantum transformation
in the linear space of quantum fields can be constructed out of fields
and their canonical momenta
that provides a representation of the classical relation (
35);
In the simplest case of two-scalar mixing such a transformation is explicitly
This indeed generates the mixing transformation for 2-flavors
and diagonalizes the quantum Hamiltonian, which can be checked by a straightforward computation. In the associate Fock-space
acts similarly;
and the ladder operators are transformed in the same way;
These definitions are consistent in the sense that
and the flavor vacuum state satisfies
The two-flavor mixing and the three-flavor mixing (where explicit building of
is a much more difficult task) for both spin-1/2 fermions and scalar bosons had been analyzed along this line in the literature [
30,
33,
34,
35,
38].
To develop our general formulation, we shall pursue a slightly different strategy. We shall avoid the step of explicitly building the quantum representation
by employing Equation (
35) and an observation, pioneered in [
28], that the ladder operators can be directly extracted from the covariant fields with a linear operation. For example, for spin-1/2 fermions
where
H and
h parameters are defined as follows
Since the Fourier component
is obviously a linear combination of
, one can express the ladder operators as linear combinations of the original covariant fields. Using linearity of the mixing transformation (
62), we can find the structure of
without explicitly building
;
For the bosons, however, the ladder operators do not separate as in the fermion case, for example,
and in general
. Equation (
70) implies that particles and antiparticles in boson case can not be distinguished unless time dynamics is considered. Therefore, we define the ladder operators for bosons by
With Equations (67) and (71), we then derive for fermions
and for bosons:
By defining
where
this can be put in the form given by Equation (
39).
Using the formulas presented in
Appendix B, we can explicitly verify for
that the conditions (
40) and (
42) are satisfied and Equations (
72) and (73) provide a representation of the classical mixing transformation. As an example, consider
for spin-1/2 fermions. One notices that
can be reduced to
, for example, for fermions
so that
, where
and
. Thus, Equation (
42) is trivially satisfied. One can use then the general results from the previous sections to analyze the flavor dynamics in this case.
We note that
’s as well as
’s are in general dependent on the choice of mass parameters
used to define
H and
h in (
72) and (73) via amplitudes
and
. One may explicitly see this in the example of the charge operator. As can be seen from Equation (
60),
will not depend on the choice of mass parameters
only for real mixing matrix
. Otherwise, as in the example of CP-violating CKM matrix, there will be a nontrivial mass dependence from the imaginary part of
U. Interestingly, even in the latter case, there is no dependence on the mass of the flavor field
(
) but only on the mass associated with the initial flavor state
. Blasone and Vitiello maintained that the set of the free-field masses
shall be chosen as
.
An alternative view considers Equation (
69) as a result of expanding the flavor fields
in the basis parametrized by masses
, where Blasone and Vitiello’s choice corresponds to setting
to the free-field masses observed in the experiment. Still, one may as well expand the flavor fields on a basis with other flavor mass parameters
, corresponding to choosing
as different free-field amplitudes with flavor mass parameters
in Equations (
72) and (73) [
28].
This ambiguity can be understood in terms of the original Blasone and Vitiello’s idea as follows. For any
that is a valid representation of the classical mixing transformation,
, which can be obtained by means of a similarity transformation that mixes
and
but leaves their covariant combination
unchanged (i.e.,
), is also a valid representation of the mixing transformation. The ladder operators, defined by Equations (
72) and (73), therefore depend on the choice of
or, equivalently, the choice of “bare" mass
associated with the flavor states.
Although there are different opinions about whether or not the measurable quantities in the theory shall depend on the mass parameters [
28,
34,
35,
36,
82], we note that the mass parametrization problem indeed is not specific to the quantum mixing but exists in almost any instance of quantum field theory. For example, consider a free theory with Hamiltonian
one may change the mass parametrization
[
35];
where
and
. The number operator in such a transformation is not conserved [
36], for example, for fermions we can get
That may lead to an obviously incorrect assertion that the number of particles in the free field case is not an observable quantity. Mathematically, this is understood once we note that the above transformation is equivalent to the splitting of the initial Hamiltonian into
The additional self-interaction term there is responsible for oscillations in .
Physically, the transformation given by Equation (77) shall be viewed as a redefinition of one-particle state. The tilde quantities correspond to a new quasi-particle, so that the tilde number operator describes a different type of particles and thus it does not have to be invariant under such a transformation. Still, the particle–antiparticle difference is conserved in the transformation (77). The situation here may be analogous to the dependence of physical observables upon the choice of the coordinate system. Although the Casimir operator (e.g., in the spin observables) must be independent from the coordinate system, other physical quantities (e.g., , and ) do depend on the choice of the coordinate system. To compare the value of , say, between theory and experiment, one should first fix the coordinate system. Similarly, specific mass parameters may need to be fixed in a quantum field theory from physical reasons to compare theoretical results (e.g., the occupation numbers) with the experiment.
From the above example it is clear that the mass parametrization problem is present in any regular perturbation theory if one attempts to redefine the physical one-particle states as in Equation (77). In the free theory and the perturbation theory this issue is resolved by the presence of the mass scale of well defined asymptotic physical states, which therefore fix the mass parameters. In this sense, one can agree with Blasone and Vitiello in that the mass scale of the energy-eigenstates is the most natural choice for fixing
. Still, at least one other possible mass scale exists for the flavor mixing problem, which corresponds to the free-field part of the Hamiltonian (
36).
One may find the above analogy with coordinate systems crude and deem the mass parameters in Blasone and Vitiello theory “unphysical" or “arbitrary" and, thus, conclude that measured quantities cannot depend on such arbitrary parameters. However, we emphasize again that, as we demonstrated above, the same mass parameterization problem is present and well-tested by experiment perturbative quantum field theory. Many observables that are considered well defined in perturbative quantum field theory depend on the choice of such parameter. In perturbative theory, the asymptotic free fields provide a physical ground to fix those masses. In flavor mixing, where there is no option to asymptotically enable the mixing interaction, we are not yet aware of such an unambiguous way.