1. Introduction
Since the development of quantum field theory, it has been appreciated that the quantum vacuum is a fundamental type of physical reality which potentially contains all varieties of elementary particles and their interactions. Although an infinitely large energy density of zero-point oscillations of quantum fields (the so-called
virtual particles) could be considered to be catastrophic [
1], convenient self-consistent procedures have been elaborated on how to makeit equal to zero in the empty Minkowski space and take it into account when calculating the probabilities of arbitrary processes in the framework of the Standard Model. In doing so, the quantum vacuum is usually responsible for some part of the measured quantity (for instance, the Lamb shift), whereas the rest of it is determined by the ordinary (real) particles.
There is, however, one physical phenomenon, where the measured quantity is determined entirely by the quantum vacuum. This is the Casimir effect arising in the quantization volumes restricted by some material boundaries or in cosmological models with nontrivial topology [
2,
3,
4,
5]. An essence of this effect is that although the vacuum energy in restricted or topologically nontrivial volumes remains infinitely large, it becomes finite when subtracting the vacuum energy of empty topologically trivial Minkowski space. The negative derivative of the obtained finite vacuum energy with respect to the length parameter (either a separation between the boundary surfaces or a scale of the topologically nontrivial manifold) results in the Casimir force, which generalizes the familiar van der Waals force in the case of larger separations when one should take into consideration the effects of relativistic retardation [
5].
Recently it was understood that the quantum vacuum may be responsible for an impressive phenomenon in nature, i.e., for acceleration of expansion of the Universe [
6]. This can be explained by an impact of the energy density of quantum vacuum (which is often referred to as
dark energy) corresponding to a nonzero renormalized value of the cosmological constant in the Einstein equations of general relativity theory [
7].
Another subject is that at sufficiently short separations between the boundary surfaces, vacuum forces are stronger than Newtonian gravitation. In this case, they form a background for testing new physics, such as Yukawa-type corrections to Newton’s law of gravitation arising due to exchange of light hypothetical scalar particles [
8] or due to spontaneous compactification of extra spatial dimensions at the low-energy compactification scale [
9]. Forces of this kind would alter the energy eigenstates of a neutron in the gravity potential of the Earth, and are searched for by a technique called Gravity Resonance Spectroscopy [
10,
11,
12] by the
qBounce collaboration. On the background of Casimir forces, one could also search for axions and other axion-like particles [
13,
14,
15,
16,
17,
18,
19,
20] which are considered as hypothetical constituents of dark matter. It is remarkable that taken together, dark energy and dark matter contribute for more than 95% of the energy of the Universe, leaving less than 5% to the forms of energy we are presently capable to observe directly [
6].
Precise measurements of the Casimir force revealed a problem that the experimental data agree with theoretical predictions of the fundamental Lifshitz theory only under the condition that in calculations one disregards the relaxation properties of conduction electrons and the conductivity at a constant current for metallic and dielectric boundary surfaces, respectively (see review in [
5,
21] and more modern experiments [
22,
23,
24,
25,
26,
27]). Theoretically, it was shown that an inclusion of the relaxation properties of conduction electrons and the conductivity at a constant current in computations results in a violation of the Nernst heat theorem for the Casimir entropy (see review in [
5,
21] and further results [
28,
29,
30,
31]). Taking into account that both the relaxation properties of conduction electrons for metals and the conductivity at a constant current for dielectrics are well studied really existing phenomena, there must be profound physical reasons for disregarding them in calculations of the Casimir force caused by the zero-point and thermal fluctuations of the electromagnetic field.
All precise experiments on measuring the Casimir interaction mentioned above have been performed in the sphere-plate geometry at surface separations below a micrometer. In this paper, we consider the
Cannex (Casimir And Non-Newtonian force EXperiment) that was designed to test the quantum vacuum in the configuration of two parallel plates at separations up to 15
[
32,
33]. In addition to the already discussed possibility of testing the nature of dark energy [
32], we consider here the potentialities of this experiment for searching thermal effects in the Casimir force, stronger constraints on Yukawa-type corrections to Newton’s gravitational law, and on the coupling constants of axion-like particles. For this purpose, a modification in the measurement scheme is proposed, which allows simultaneous measurement of both the Casimir pressure and its gradient. It is shown that after making several improvements to the setup, the
Cannex test would be capable of performing the first observation of thermal effects in the Casimir interaction and to strengthen the presently available limits on Yukawa-type corrections to Newtonian gravity by up to a factor of 10. Stronger limits could also be obtained on the coupling constants of axion-like particles to nucleons within a wide range of axion masses.
The paper is organized as follows. In
Section 2 we briefly describe the experimental setup with two parallel plates and possible improvements.
Section 3 is devoted to the calculation of thermal effects in the Casimir pressure and its gradient at separations relevant for
Cannex. In
Section 4 the prospective constraints on non-Newtonian gravity and axion-like particles, which could be obtained from the improved setup, are found.
Section 6 contains our conclusions and discussion.
2. Experimental Setup with Improved Precision
We recently demonstrated the feasibility of Casimir pressure gradient measurements with the parallel plate
Cannex setup [
33]. In the present article, we propose several improvements to the experiment that will lead to a significant increase in sensitivity and reduce the influence of systematic effects.
In our setup shown in
Figure 1, the parallel plate geometry is implemented by a rigid vertical SiO
cylinder (lower plate) of radius
mm separated by a vacuum gap of width
a from a movable (upper) sensor plate that can be described as a lumped mass-spring system [
32]. This sensor with an interacting area
A, elastic constant
k, and effective mass
has a free resonance frequency
. In [
33] an interferometric detection scheme was used to measure the gradient of the Casimir pressure. Here, we propose to implement the same scheme to measure both pressure gradients and the pressure acting between the two flat plates. Gradients
are detected using a phase-locked loop (PLL) that senses shifts
of the sensor’s resonance frequency—a technique widely used in the literature [
34,
35,
36,
37,
38].
In the absence of thermal drifts, the pressure
P between our flat surfaces can be measured by monitoring the extension
of the spring-mass system using the same interferometer. For the determination and maintenance of parallelism, however, we recently used a feedback mechanism based on the capacitance between the two plates. While this scheme has been demonstrated to work in principle, practice has shown that vibrations in combination with the high
Q-factor of our sensor lead to unacceptably long integration times, and associated susceptibility to thermal drifts [
33]. To improve the performance and reach the full potential of the setup, we propose to replace the capacitive scheme by an interferometric one. As shown in
Figure 1, three Fabry Pérot interferometers below the sensor plate (created by the end faces of optical fibers and the reflective surface of the sensor plate) monitor the distance
a at different positions around the lower plate. (Please note that in the two-dimensional scheme in the figure, only two of these cavities are depicted.) The three fiber ends can be polished together with the lower plate to exactly match the surface of the latter. In such a system, the three lower interferometers could measure the tilt and frequency shift at all times. Synchronously, the upper (fourth) interferometer, being mechanically connected to the sensor frame, can be used to measure the extension
of the sensor and, hence, the pressure acting on it.
Besides this conceptual change several technical improvements are planned, which will result in the following: First, we aim to improve the seismic attenuation by a factor of 10 around
with respect to the present performance by means of active control techniques [
39]. This would not only reduce the direct influence on measurements via non-linear effects and rms noise, but would also improve the stability of various feedback circuits in a nontrivial way. Second, a newly designed distributed thermal control concept will guarantee mK stability throughout the setup, thereby eliminating drift and uncertainty in the sensor characteristics. Third, electrostatic patch effects that create a systematic pressure background could be reduced using in situ Ar ion bombardment [
38]. In the following, we analyze the major sources of experimental uncertainty and show the predicted effect of the mentioned improvements.
2.1. Sources of Error
Here, we consider experimental errors in the pressure and its gradient due to vibrations, the determination of displacement, and frequency shift of our sensor, variations in the temperature, tilt of the plates, and discuss the role of electrostatic patch effects.
Previously, we identified vibrations as a major source of error and gave respective tolerable limits for pressure measurements [
33]. Using a five-axis seismic attenuation system, we achieved a damping factor of 60 in vertical direction around the sensor resonance
Hz. The residual vibrations caused at times a peak displacement noise
nm of the sensor relative to the lower plate, which proved to be a severe nuisance during the measurements. For pressure gradients, the main influence is via non-linear effects [
34,
40]. For a function
representing either the pressure (
) or its gradient (
), we expand
for small sensor movements
. Recognizing that
, as usual for stochastic fluctuations, the main contribution to the error comes from the second order term. Thus, for the non-linear shifts in the pressure and its gradient, we therefore have
respectively. Similarly, the rms pressure and pressure gradient (entering measurements mainly as a nuisance within the mechanical bandwidth of the sensor
mHz around
) are computed from the spectral sensor movement
via
We note that vibrational noise also hampers the convergence of various feedback circuits and thereby influences the achievable sensitivity in a nontrivial way. Experience has shown that such effects are negligible below a peak amplitude of nm.
Another fundamental source of error is the uncertainty in various parameters involved in the evaluation of the frequency shift. We consider the following calibration procedure. At large separation
we apply the AC electrostatic excitation voltages
and
driving the sensor resonance and the surface potential compensation circuit, respectively [
33]. At this separation, the Casimir pressure
P is negligibly small with respect to the electrostatic pressure
. The free resonance frequency
is determined from a measurement of the resonance frequency
under the influence of the well-known electrostatic and Casimir forces. Here,
is the dielectric permittivity of vacuum. The effective mass
is determined during a separate sweep recording
as a function of an applied DC electrostatic potential. Eventually, we evaluate the resonance frequency shift measured at different separations to obtain the Casimir pressure gradient from Equation (
1) after subtraction of the electrostatic pressure gradient
. For pressure measurements at the same separations, we evaluate the detected extension
of the sensor that is related to the total pressure
. The achievable sensitivity for both types of measurement is limited by the uncertainties in all experimental quantities entering the respective evaluation. Numerical values for these and some other uncertainties considered below are given in
Table 1.
Variations in the temperature contribute in two ways to the experimental error. First, different material expansion coefficients influence the separation between the two interacting surfaces by roughly nm/K—an effect most influential at smaller a. Second, the temperature influences the Youngs modulus of our sensor, which leads to an additional error in a, but also offsets the resonance frequency and, thereby, mimics a pressure gradient.
We also consider errors due to tilt of the plates with respect to each other. For small angles
between the plates, we may estimate the influence on the pressure and its gradient by averaging
over the sensor area
A with the local separation
deviating from its nominal value
a due to the tilt. Numerical calculations show that the respective relative corrections to the pressure and its gradient are both of the order
, in agreement with the literature [
5]. For the achievable values of
given in
Table 1, these corrections are negligible.
As has been discussed in Ref. [
33], electrostatic effects can have a significant influence. While we recently compared our measurements with the model [
41], we now use the model [
42] that has been shown to describe the observed forces more realistically. The average patch size
and the value for
mentioned in
Table 1 were derived from the auto-correlation of actual Kelvin probe data for our surfaces. As
is much smaller than the plate separation, we can use the approximation
Here, mV, and is the Riemann zeta function. Please note that represents a systematic effect that can be characterized and removed from experimental data.
2.2. Sensitivity Estimation
Based on the models described in
Section 2.1, we have calculated the expected level of experimental uncertainty in measurements of the pressure and its gradient for both the present situation and presuming a successful implementation of the proposed improvements. For these calculations, we have assumed the parameters listed in
Table 1. As can be seen in
Figure 2a, the largest uncertainty in force measurements comes from the determination of the sensor extension, which includes contributions from the calibration and interferometry. Vibrations play, under the assumption that the proposed measures work as expected, a minor role. For force gradients (
Figure 2b) the determination of the frequency shift is the limiting factor at separations larger than
, while at smaller separations sensitivity is limited by vibrations. Temperature variations are influential on force gradient measurements for all separations, as they modify the Youngs modulus of our sensor. For comparison, we also plot the achievable uncertainty for the present version of the setup (dashed upper red lines), which are determined by the same factors as for the improved version. Please note that the patch pressure can be characterized separately and removed from the data.
The results for the improved uncertainties in
Figure 2 are based on very conservative estimates. In the experiment, especially at larger separations, the uncertainty in the pressure could be reduced statistically by longer measurements. For further calculations in
Section 4, we therefore assume a pressure sensitivity of 1
/
2 (corresponding to
) at
, which is slightly higher than the value corresponding to the bottom red line in
Figure 2a. The uncertainty in the pressure gradient is mainly determined by the resolution of the PLL’s frequency measurement. Here, the possibility for a statistical reduction of the uncertainty may be more problematic and therefore we use the pressure gradient sensitivity given by the bottom red line in
Figure 2b for calculations in
Section 4.
3. Possibilities to Measure Thermal Effects in the Casimir Force
As mentioned in
Section 1, most of the already performed experiments on measuring the Casimir interaction exploited the sphere-plate geometry. In doing so, the measured quantity was either the Casimir force acting between a sphere and a plate (in the static measurement scheme) or its gradient (in the dynamic measurement scheme). Due to the proximity force approximation, the latter quantity can be recalculated into the effective Casimir pressure between two parallel plates [
5,
21]. There is only one modern experiment on the direct measurement of the Casimir pressure between two parallel plates [
43], but it is not of sufficient precision to observe the thermal effects (an attempt to measure the Casimir effect between two parallel Al-coated plates at separations larger than a few micrometers was unsuccessful due to the presence of large background forces [
44]).
The distinctive feature of the
Cannex test of the quantum vacuum is that it can be adapted for simultaneous measurements of the Casimir pressure between two parallel plates and its gradient (see
Section 2).
The Lifshitz formula for the Casimir pressure between two material plates spaced at a separation
a at temperature
T is given by [
5,
21]
Here, it is assumed that the parallel plates made of a nonmagnetic material described by the dielectric permittivity are in thermal equilibrium with the environment at temperature T and the following notations are introduced. The Boltzmann constant is , the prime on the summation sign divides the term with by 2, is the magnitude of the projection of the wave vector on the plane of the plates, with are the Matsubara frequencies, and .
The reflection coefficients
are defined for two independent polarizations of the electromagnetic field, transverse magnetic (
) and transverse electric (
). Explicitly they are given by
where
By differentiating Equation (
8) with respect to separation between the plates
a, one obtains the gradient of the Casimir pressure
Equations (
8) and (
11) take an exact account of the effects of finite conductivity of the plate metal. As to corrections due to surface roughness, at separations exceeding
m they are much smaller than an error in the pressure measurements [
5].
For numerical computations it is convenient to introduce the dimensionless variables
In terms of these variables Equations (
8) and (
11) take the form
and
respectively, and the reflection coefficients from Equation (
9) take the form
For application to the experimental setup of
Cannex described in
Section 2, computations should be made for Au plates. The dielectric permittivity of Au along the imaginary frequency axis is obtained by means of the Kramers-Kronig relations using the available tabulated optical data for the complex index of refraction extrapolated down to zero frequency [
5,
21]. According to
Section 1, this extrapolation can be made either by means of the Drude model taking into account the relaxation properties of conduction electrons or the plasma model disregarding these relaxation properties. In dimensionless variables, the dielectric permittivity of the Drude model along the imaginary frequency axis is given by
where the dimensionless plasma frequency
and relaxation parameter
are connected with the dimensional ones by
and
. For Au the standard values
eV and
meV are used here. The dielectric permittivity of the plasma model
is obtained from Equation (
16) by putting
. Please note that for computations at separations
m performed below the optical data contribute negligibly small, so that the obtained gradients of the Casimir pressure are mostly determined by the extrapolations to lower frequencies.
Before presenting the computational results, we note that in the high-temperature limit
all the terms in Equations (
13) and (
14) with
are exponentially small and both the Casimir pressure and its gradient are given predominantly by the terms with
. The zero-frequency term of the Lifshitz formula takes different forms depending on the extrapolation used. If the plasma model
is used for extrapolation, one obtains
and
where in accordance to Equation (
15)
Calculating the first integrals on the right-hand side of Equations (
17) and (
18) and expanding the second ones in the powers of a small parameter
, we find [
5]
If, however, the Drude model
is used for extrapolation, one arrives at the result
and
Please note that at
K Equations (
20)–(
22) are applicable under a condition
Taking into account, however, that the powers in all exponentially small terms with
depend on
, one can see that at
K Equations (
20)–(
22) lead to rather precise results at all separations exceeding 6 or
m.
The computational results for the magnitude of the Casimir pressure and its gradient as functions of separation between the plates are presented in
Figure 3 by the red and blue solid lines computed at
K using extrapolations of the optical data of Au by means of the plasma and Drude models, respectively. For comparison purposes, the computational results at
are presented by the dashed lines. They are obtained by Equations (
13) and (
14) where summation over the discrete Matsubara frequencies is replaced with a continuous integration. It is taken into account that with decreasing temperature the relaxation parameter
quickly decreases towards a very small residual value at
that is determined by the defects of the crystal lattice. As a result, the values of
and
computed at
using the plasma and Drude models coincide at high accuracy.
For a better visualization, in the insets to
Figure 3a,b the computational results in the separation region from 5 to
m are presented using a uniform scale on the vertical axis. It is clearly seen that the theoretical predictions from using the plasma and Drude model extrapolations of the optical data can be discriminated if to take into account the errors in measuring
and
in the improved version of
Cannex discussed in
Section 2. Please note that if the Drude model extrapolation is used the thermal effect in the Casimir pressure and its gradient vanishes at approximately 6.4 and
separations, respectively, where the blue and dashed lines intersect.
To determine the specific role of thermal effects in the Casimir pressure and its gradient, we have also computed the relative thermal corrections defined as
where on the right-hand sides we indicated the dependence on temperature explicitly. Computational results for
and
are presented in
Figure 4a,b as functions of separation by the red and blue lines for the cases when extrapolation of the optical data for Au to lower frequencies is made by means of the plasma and Drude models, respectively. As is seen in these figures, within the separation range from 3 to
m the thermal effects make a considerable contribution to the Casimir pressure and its gradient. If the plasma model extrapolation is used, it reaches 440% and 240% of the zero-temperature pressure and its gradient, respectively, at
m. When using the extrapolation by means of the Drude model, the relative thermal effect in the Casimir pressure varies from approximately −34% at
m to 130% at
m and in the pressure gradient from approximately −40% at
m to 60% at
m. In this case, the thermal effect in the Casimir pressure vanishes at
m and in its gradient at
m in agreement with
Figure 3a,b, respectively.
4. Prospective Constraints on Non-Newtonian Gravity and Axion-Like Particles
As mentioned in
Section 1, the Casimir forces originating from the quantum vacuum form a background for testing the Yukawa-type corrections to Newton’s gravitational law and for searching the axion-like particles. It is convenient to parametrize the Yukawa-type potential between two point masses
and
situated at the points
and
as [
8]
Here, and are the interaction constant and the range of Yukawa interaction, and is the Newtonian gravitational constant (we note that in the experimental configuration under consideration one can neglect by the Newtonian gravitational pressure because it is less than an error in measurements of the Casimir pressure).
The Yukawa-type pressure between the top and bottom plates in the experimental setup of
Section 2 should be calculated taking into account the layer structure of both plates shown in
Figure 1. The top plate is made of high-resistivity Si of density
coated with a layer of Cr of density
and thickness
nm followed by a layer of Au of density
and thickness
. The thickness of Si substrate (
m) is sufficiently large to treat it as a semispace. The bottom plate is made of SiO
quartz crystal with the density
coated with a layer of Cr of thickness
nm followed by a layer of Au of thickness
. The thickness of SiO
substrate (
mm) again allows to consider it as a semispace.
Now we assume that one mass
belongs to the top plate and another one
to the bottom one and integrate Equation (
25) over the volumes of both parallel plates separated by a distance
a taking into account their layer structure. Calculating the negative derivative of the obtained interacting energy with respect to
a, we find the Yukawa force and finally the pressure [
45]
where the function
is defined as
The gradient of the Yukawa pressure is obtained by differentiating Equation (
26) with respect to
aNow the constraints on the parameters
and
of Yukawa-type interactions can be obtained from the inequalities
where
and
are the total experimental errors in the measured Casimir pressure and its gradient estimated in
Section 2. The meaning of Equation (
29) is that the experimental data are found in agreement with theoretical predictions for the Casimir pressure and its gradient and no extra contribution of unknown origin was observed.
To estimate the strength of prospective constraints, which can be obtained from the improved version of
Cannex, we use the thickness of the top and bottom Au layers
nm and
m and the total experimental errors
and
corresponding to the bottom red line in
Figure 2b. The computational results for
and
obtained by using the first and second inequalities in Equation (
29) are shown by the red lines in
Figure 5a,b, respectively. In so doing, the regions of
-planes above each line are excluded and below each line are allowed.
For comparison purposes, we present in
Figure 5a,b the strongest constraints obtained in the same interaction range from other experiments. The top and bottom blue lines demonstrate the constraints found from the results of the Casimir-less experiment [
46] and its improved version [
47], respectively. Both experiments were performed by means of a micromechanical torsional oscillator. The green line shows the constraints obtained from measuring the difference in lateral forces [
48]. The constraints indicated by the black line are found in [
49] from the torsion pendulum experiment. Finally, the constraints of the gray line follow from the Cavendish-type experiments performed at short separations [
50,
51,
52].
As is seen in
Figure 5a,b, the strongest constraints obtained up to date follow from the improved Casimir-less experiment [
47] (the bottom blue line). It is seen also that the largest strengthening of these constraints, which could be reached from the
Cannex test of quantum vacuum, follows from measurements of the Casimir pressure (this is because the pressure gradient is linear in
whereas the pressure is quadratic in
). The prospective constraints are stronger by up to a factor of 10 over a wide interaction range.
Next we consider the prospective constraints on the axion-to-nucleon coupling constants which could be obtained from the
Cannex test of the quantum vacuum. Taking into account that the exchange of one axion between two nucleons results in the spin-dependent interaction potential [
53] and the test bodies in this experiment are not polarized, any additional force of the axion origin could arise due to two-axion exchange.
Below we deal with axion-like particles coupled to nucleons by means of pseudo-scalar interaction Lagrangian [
54]. In this case, the effective potential between two nucleons, spaced at the points
and
of the top and bottom plates, arising due to exchange of two axions, takes the form [
53,
55]
Here,
is the dimensionless coupling constant of an axion to a nucleon (we assume that the coupling constants to a neutron and a proton are equal [
53]), the mean of the proton and neutron masses is denoted as
m, the axion mass is
,
is the modified Bessel function of the second kind and it is assumed that
.
Similar to the case of a Yukawa-type potential, the additional pressure between the test bodies due to two-axion exchange, is obtained by integrating Equation (
30) over the volumes of both plates with account of their layer structure, calculate the negative derivative of the obtained result with respect to
a and finally find the pressure (see [
15] for details)
Here,
is the mass of atomic hydrogen and the function
is defined as
where the coefficients
C for each material are given by
In Equation (
33),
is the density of the respective material already indicated above,
Z and
N are the number of protons and the mean number of neutrons in the atoms of materials with mean mass
M and
. For materials under consideration
, 0.46518, 0.50238, and 0.503205 for Au, Cr, Si, and SiO
, respectively [
8]. The values of
are 0.60378, 0.54379, 0.50628, and 0.505179 for the same respective materials [
8].
The gradient of the pressure due to two-axion exchange is obtained from Equation (
31) by the differentiation with respect to
aThe constraints on the parameters of axion-nucleon interaction,
,
, are obtained from the inequalities
similar to Equation (
29).
To find the strongest prospective constraints on the axion-to-nucleon interaction, we have used the same parameters of the improved setup, as listed above when considering the Yukawa interaction. The computational results for
as a function of
obtained by using the first and second inequalities in Equation (
35) are shown by the red lines in
Figure 6a,b, respectively. Again, the space above each line is excluded (will be excluded, in the case of the red line) by the results of the respective experiment.
The strongest laboratory constraints obtained up to date [
17] at the same region of axion masses are shown by the blue line. They follow from the improved Casimir-less experiment [
47]. For comparison purposes, the constraints following from the Cavendish-type experiment [
56,
57], measurements of the effective Casimir pressure [
15,
58,
59], and the lateral Casimir force between corrugated surfaces [
16,
60,
61] are shown by the gray, black, and green lines, respectively. As can be seen in
Figure 6a,b, the stronger constraints again follow from measuring the Casimir pressure. According to
Figure 6a, the
Cannex test of the quantum vacuum could strengthen the presently known constraints by up to a factor of 3.