Since
interaction can change the muonium state into the anti-muonium one, the possibility to study muonium–anti-muonium oscillations arises. Theoretical analyses of conversion probability for muonium into antimuonium have been performed, both in particular new physics models [
11,
12,
13,
14,
15,
16], and using the framework of effective theory [
7], where all possible BSM models are encoded in a few Wilson coefficients of effective operators. Observation of muonium converting into anti-muonium provides clean probes of new physics in the leptonic sector [
17,
18].
3.1. Phenomenology of Muonium Oscillations
In order to determine experimental observables related to
oscillations, we recall that the treatment of the two-level system that represents muonium and antimuonium is similar to that of meson-antimeson oscillations [
1,
19,
20]. There are, however, several important differences. First, both ortho- and para-muonium can oscillate. Second, the SM oscillation probability is tiny, as it is related to a function of neutrino masses, so any experimental indication of oscillation would represent a sign of new physics.
In the presence of the interactions coupling
and
, the time development of a muonium and anti-muonium states would be coupled, so it would be appropriate to consider their combined evolution,
The time evolution of
evolution is governed by a Schrödinger-like equation,
where
is a
Hamiltonian (mass matrix) with non-zero off-diagonal terms originating from the
interactions. CPT-invariance dictates that the masses and widths of the muonium and anti-muonium are the same, so
,
. In what follows, we assume CP-invariance of the
interaction
1. Then,
The off-diagonal matrix elements in Equation (
11) can be related to the matrix elements of the effective operators introduced in
Section 1, as discussed in [
1,
19],
To find the propagating states, the mass matrix needs to be diagonalized. The basis in which the mass matrix is diagonal is represented by the mass eigenstates
, which are related to the flavor eigenstates
and
as
where we employed a convention where
. The mass and the width differences of the mass eigenstates are
Here, () are the masses (widths) of the physical mass eigenstates .
It is interesting to see how the Equation (
12) defines the mass and the lifetime differences. Since the first term in Equation (
12) is defined by a local operator, its matrix element does not develop an absorptive part, so it contributes to
, i.e., the mass difference. The second term contains bi-local contributions connected by physical intermediate states. This term has both real and imaginary parts and thus contributes to both
and
.
It is often convenient to introduce dimensionless quantities,
where the average lifetime
, and
is the muonium mass. Noting that
is defined by the standard model decay rate of the muon, and
x and
y are driven by the lepton-flavor violating interactions, we should expect that both
.
The time evolution of flavor eigenstates follows from Equation (
10) [
7,
19,
20],
where the coefficients
are defined as
As
we can expand Equation (
17) in power series in
x and
y to obtain
The most natural way to detect
oscillations experimentally is by producing
state and looking for the decay products of the CP-conjugated state
. Denoting an amplitude for the
decay into a final state
f as
and an amplitude for its decay into a CP-conjugated final state
as
, we can write the time-dependent decay rate of
into the
,
where
is a phase-space factor and we defined the oscillation rate
as
Integrating over time and normalizing to
we get the probability of
decaying as
at some time
,
The equation Equation (
21) [
7] generalizes oscillation probability found in the papers [
12,
14] by allowing for a non-zero lifetime difference in
oscillations. We will review how
x and
y are related to the fundamental parameters of the Lagrangian below [
7].
3.2. The Mass Difference x
The physical mixing parameters
x and
y can be obtained from Equation (
15). The mass difference
x comes from the dispersive part of the correlator.
Neglecting the SM contribution to the local
Hamiltonian,
, so the dominant contribution is only suppressed by
. We can neglect the contribution of the second term in Equation (
22), as it is suppressed by either
or by
. In the former case this suppression comes from the double insertion of the
term, each of which is suppressed by
, as follows from Equation (
4). In the later, the suppression comes from the insertion of the SM
and
terms with the operators given in Equation (
7).
Computation of the mass and the lifetime differences involves evaluating the matrix elements between the muonium states for both the spin-0 singlet and the spin-1 triplet configurations. Since is a QED bound state, such matrix elements can be written in terms of the value of the muonium wave function at the origin.
In the non-relativistic approximation, which is applicable for the muonium, it is given by a Coulombic bound state the wave function of the ground state,
where
is the muonium Bohr radius,
is the fine structure constant, and
is the reduced mass. The absolute value
at the origin can be written as
where we substituted the value of the Bohr radius. The applicability of a non-relativistic approximation to muonium can be established from a simple scaling argument. The typical momentum in the muonium state is
where, for a moment, we reinstated
c and
ℏ. We can see from Equation (
25) that
, justifying the non-relativistic approximation.
However, it might be easier to apply a factorization approach familiar from the description of meson flavor oscillations. In this approach the matrix elements in Equation (
22) are obtained by inserting a vacuum state to turn matrix elements of four-fermion operators
into products
of matrix elements of current operators. Such matrix elements can be further parameterized as
where
is the muonium decay constant [
6,
7],
is muonium’s four-momentum, and
is the ortho-muonium’s polarization vector. Note that
in the non-relativistic limit. The decay constant
can be expressed in terms of the bound-state wave function using the QED version of Van Royen-Weisskopf formula,
This factorization gives the exact result for the QED matrix elements of the six-fermion operators in the non-relativistic limit, as can be explicitly verified [
7]. Note that muonium mass and lifetime differences, and decay probabilities are thus suppressed by
.
Para-muonium. The matrix elements of the spin-singlet states can be obtained from Equation (
6) using the definitions of Equation (
26),
Combining the contributions from the different operators and using the definitions from Equations (
24) and (
27), we obtain an expression for
for the para-muonium state,
Ortho-muonium. Computing the relevant matrix elements for the vector ortho-muonium state, we obtain the matrix elements
Again, combining the contributions from the different operators, we obtain an expression for
for the ortho-muonium state,
The results in Equations (
29) and (
31) are universal and hold true for any new physics model that can be matched into a set of local
interactions.
3.3. The Lifetime Difference y
The lifetime difference in the muonium system, defined in Equation (
15), is obtained from the absorptive part of Equation (
12) and comes from the on-shell intermediate states common to both
and
[
21],
where
is a phase space function for a given intermediate state. There are many possible intermediate states composed of electrons, photons, and neutrinos. However, only the intermediate state containing neutrinos gives the largest contribution. This follows from the following argument. Noting that the contributions of multibody intermediate states are suppressed by the phase space factors
for
, only two body intermediate states need to be considered. All possible SM two-body intermediate states that can contribute to
y are
,
, and
.
The
intermediate state corresponds to a
decay
, which implies that
in Equation (
32). According to Equation (
4), it appears that, quite generally, this contribution is suppressed by
, i.e., will be much smaller than
x. The decays of muonia to the
intermediate states are generated by higher-dimensional operators and therefore are suppressed by even higher powers of
or the QED coupling
than the contributions considered here.
The only other possible contribution to
y comes from the on-shell
intermediate state. This intermediate state can be reached by the standard model tree level decay
and the
decay
, i.e., it is common for both
and
. This contribution is only suppressed by
and represents the parametrically leading contribution to
y [
7].
Writing
y in terms of the absorptive part of the correlation function,
where the
is given by the ordinary standard model Lagrangian of Equation (
3), and
only contributes through the operators
and
of Equation (
7).
Since the decaying muon injects a large momentum into the two-neutrino intermediate state, the integral in Equation (
33) is dominated by small distance contributions, compared to the scale set by
. We can compute the correlation function in Equation (
33) by employing a short distance operator product expansion, systematically expanding it in powers of
[
7].
Using Cutkoski rules to compute the discontinuity (imaginary part) of the transition amplitude (see
Figure 1), calculating the relevant phase space integrals, and taking the matrix elements for the spin-singlet and the spin-triplet states of the muonium we arrive at the lifetime differences for the two spin states [
7].
Para-muonium. The relevant matrix elements of the spin-singlet state can be read off the Equation (
28). Recalling the definitions in Equations (
24) and (
27), we obtain an expression for the lifetime difference
for the para-muonium state,
One should note that if current conservation assures that no lifetime difference is generated at this order in for the para-muonium.
Ortho-muonium. Employing the matrix elements for the spin-triplet state computed in Equation (
30), the lifetime difference for the vector muonia is
We emphasize that Equations (
34) and (
35) represent parametrically leading contributions to muonium lifetime difference, as they are only suppressed by two powers of
. Nevertheless, additional suppression by
makes the observation of the lifetime difference in the muonium system extremely challenging.
3.4. Experimental Studies of Muonium Oscillations
Both x and y are the observable parameters, so experiments studying oscillations could probe them by producing the state and looking for the decay products of the state. Such experiments must overcome several considerable challenges.
First, muonium states must be produced inside the target and moved to the decay volume of the detector. The efficiency of such a process strongly depends on the target material. For example, the Muonium–Antimuonium Conversion Spectrometer (MACS) at PSI [
18] used SiO
powder target with efficiency of 4–5%. A recently proposed Muonium-to-Antimuonium Conversion Experiment (MACE) [
22] at the China Spallation Neutron Source (CSNS) will use an aerogel target with laser-drilled channels. The efficiency of producing muonia on such target could reach 40%.
Second, the strategy of looking for the “wrong-sign” final state, while working well in the studies of or oscillations, faces new challenges when applied to muonium oscillation searchers. The main reason is that the final state in the muonium decay and that in the antimuonium decay , differ only by the flavor combination of the neutrino states. Since the neutrinos are not detected, a method based on the kinematics of the decay must be employed: the decay products of the involve fast electron with the momentum of about 53 MeV and slow positron with the momentum of about 13.5 eV. This follows from the fact that the main decay channel of the state is the decay of a muon.
Third, the experiments usually involve a setup where produced muonia propagate in the magnetic field
. This magnetic field suppresses oscillations by removing degeneracy between
and
[
15,
23]. It also has a different effect on different spin configurations of the muonium state and the Lorentz structure of the operators that generate mixing [
24,
25]. Fortunately, these effects can be taken into account. MACS experiment corrected the oscillation probability by introducing a factor
[
18],
The values of
, presented in Table II of [
18], are different for different values of magnetic field and chiral structure of the operators governing the
oscillations.
We can now use the derived expressions for
x and
y to place constraints on the BSM scale
(or the Wilson coefficients
) from the experimental constraints on muonium–anti-muoium oscillation parameters. Since both spin-0 and spin-1 muonium states were produced in the experiment [
18], we should average the oscillation probability over the number of polarization degrees of freedom,
where
is the experimental oscillation probability from Equation (
36). We shall use the values of
for
T from the Table II of [
18], as it will provide us the best experimental constraints on the BSM scale
. We set the corresponding Wilson coefficient
. We report those constraints in
Table 1.
As one can see from Equations (
29), (
31), (
34) and (
35), each observable depends on the combination of the operators. Assuming that only one operator at a time gives a dominant contribution, i.e., employing the single operator dominance hypothesis, it is possible to constrain the Wilson coefficients of each operator. While this ansatz is not necessarily realized in many particular UV completions of the LFV EFTs, as cancellations among contributions of different operators are possible, it is, however, a useful tool in constraining parameters of
.
We must emphasize that the constraints presented in
Table 1 use the data obtained more than twenty years ago! New results from new experiments are therefore highly desired. The MACE experiment [
22] is expected to improve the sensitivity to
by at least two orders of magnitude. A new experiment at J-PARC is also expected to improve the constraints on the muonium–antimuonium oscillation parameters [
26].
3.5. Constraints on Explicit Models of New Physics
It might be instructive to consider explicit models of new physics which could be probed by oscillations. This approach lacks the universality of the EFT approach described above. However, what it lacks in the universality it compensates for in the applicability: new degrees of freedom introduced in explicit models can contribute to other processes, some of which might not even include FCNCs or muons. Even restricting our attention to the sector of those frameworks containing and/or interactions reveals such a multitude of models that reviewing each and every one of them here would be impractical. We will take another approach.
We will review two specific models, one with heavy NP particles (a doubly-charged Higgs model), and one with light NP states (a model with flavor-violating axion-like particle (ALPs)) as examples. Then we discuss classes of NP interactions that can be probed by muonium oscillations, and refer the readers to further examples.
One way to classify the models of NP that contribute to
mixing is by the masses of NP particles
m. The models with heavy (
) new physics degrees of freedom can be matched to the effective Lagrangian of Equation (
5). With that, experimental constraints on the parameters of this Lagrangian discussed in
Section 3.4 and
Table 1 lead to constraints on model parameters. The constraints on models with light (
) new physics degrees of freedom can be implemented within the framework of either concrete or simplified models.
Heavy new physics. Let us consider a model which contains a doubly-charged Higgs boson [
27,
28,
29]. Such states often appear in the context of left-right models [
30,
31], where an additional Higgs triplet is introduced to introduce neutrino masses
A coupling of the doubly charged Higgs field
to the lepton fields can be written as
where
is the charge-conjugated lepton state. Integrating out the
field, this Lagrangian leads to the following effective Hamiltonian [
27,
31]
below the scales associated with the doubly-charged Higgs field’s mass
. Examining Equation (
40) we see that this Hamiltonian matches onto our operator
(see Equation (
6)) with the scale
and the corresponding Wilson coefficient
.
The constraints on the masses and coupling constants depend on a particular model and other assumptions, such as whether the hierarchy of the neutrino masses is direct or inverse. It is however claimed that future experiments, such as MACE, could provide constraints on the doubly-charged Higgs state mass of
TeV [
29].
It is interesting to point out that such a doubly-charged scalar state would also contribute to the anomalous magnetic moment of the muon, both at one-loop [
32] and at two-loops via the Barr-Zee type of mechanism [
33]. It was shown that
with the mass of a few hundred GeV could explain the discrepancy between the theoretical prediction and experimental measurement of
of the muon [
34], particularly due to the enhancement from the two units of electric charge of the
.
Light new physics. Let us consider a model with light axion-like particles that couple derivatively to the lepton current [
16,
35]. For the flavor off-diagonal interactions of the ALP
a, the Lagrangian would contain a term
where
are Hermitian matrices of coupling constants, and
is a decay constant related to the scale of the symmetry breaking associated with the ALP.
Since the mass of the ALP,
, is a free parameter, it is possible that, while
a is light, to have
. In this case the ALP can be integrated out and the constraints from the
Table 1 imply that [
35]
In the opposite case
the constraint can be obtained by taking a limit
[
35]. Note that in both cases one can only constrain the combination
. These results have direct implications for ALP contributions to muon
[
16] (c.f., [
36]), the cross-section of
[
35], and other quantities.
A useful classification of lepton-flavor-violating interactions probed by the muonium oscillations was given in [
37]. Such interactions can be classified by the way they break the lepton flavor.
. In such models, the interactions do not violate lepton flavor quantum numbers. The muonium oscillations are induced at one-loop order by the mass terms of the fields that transform as singlets under the SM gauge group and have
. Examples of such models include constructions with Majorana neutrinos [
13,
37].
and
. In such models, the lepton flavor quantum numbers are separately broken. The mediators of such interactions, sometimes called dilepton bosons, have electric charge
and lepton number equal to two. The mediators could be scalar or vector bosons. Examples of such models include the doubly-charged Higgs model considered above [
24,
25,
27,
28,
29,
32,
33] and many other constructions [
37]. Muonium oscillations can be generated at tree level in such models.
. In such models, interaction terms violate both lepton flavor numbers. The mediators of such interactions are electrically neutral and could be both scalar and vector bosons. Examples of such models include models with flavor-violating Higgs boson [
38], and many others [
37]. Muonium oscillations can also be generated at the tree level in such models. These models can also be probed in muon conversion experiments unless the mediator is introduced such that it only interacts with the leptons.
and . In such models, effective operators mediating muonium oscillations are generated at one-loop order. These models can also be probed in other muon transitions, such as .
As was pointed out in [
37] if a
discrete symmetry is imposed to suppress the
interactions, while allowing for
terms, the oscillation rates are not well-probed by other experiments and can be as large as allowed by the current experimental bound. Clearly,
oscillations probe a wide variety of NP models and serve as effective tools that are complementary to other searches.