1. Introduction
The general framework of perturbation theory consists of the construction of some distribution-valued operators called chronological products [
1]. We prefer the framework from [
2]: for every set of Wick monomials
acting in some Fock space
one associates the distribution-valued operator
such that a set of axioms, essentially proposed by Bogoliubov, are verified. The modern construction of the chronological products can be done recursively according to the Epstein–Glaser prescription [
3,
4,
5,
6,
7] (which reduces the induction procedure to a distribution splitting of some distributions with causal support) or according to the Stora prescription [
8] (which reduces the renormalization procedure to the process of extension of distributions). These products are not uniquely defined but there are some natural limitations on the arbitrariness. If the arbitrariness does not grow with
n (the order of perturbation theory) we have a renormalizable theory. An equivalent point of view uses retarded products [
9].
In this paper we will use the causal approach to perturbative quantum field theory: this in fact the Epstein–Glaser prescription adapted to gauge models by G. Scharf and collaborators. The literature is quite extensive: a number of books had appeared [
5,
6,
10,
11] and also some review papers [
12,
13,
14,
15].
The description of higher spins in perturbation theory can be problematic. If we describe them by fields carrying only physical degrees of freedom, then the theories are usually not renormalizable. However, one can save renormalizability using ghost fields. Such theories are defined in a Fock space
with indefinite metric, generated by physical and un-physical fields (called ghost fields). One selects the physical states assuming the existence of an operator
Q called gauge charge which verifies
and such that the physical Hilbert space is by definition
The fact that two distinct mathematical states from
can be associated to the same physical context is called gauge freedom and the corresponding theories are called gauge theories. The graded commutator
of the gauge charge with any operator
A of fixed ghost number
(where
denotes the graded commutator) verifies
so
is a co-chain operator in the space of Wick polynomials.
A gauge theory assumes also that there exists a Wick polynomial of null ghost number
called the interaction Lagrangian such that
for some other Wick polynomials
This relation means that the expression
T leaves invariant the physical states, at least in the adiabatic limit. Indeed, we have:
up to terms which can be made as small as desired (making the test function
f flatter and flatter, i.e.,
f is approaching 1 in the distributional sense). In all known models one finds out that there exists a chain of Wick polynomials
such that:
where the brackets express completely antisymmetric in all indexes; it follows that the chain of relations stops after a finite number of steps. We can also use a compact notation
where
I is a collection of indexes
and one can write compactly the relations (
5) as follows:
All these polynomials have the same canonical dimension
and the ghost number:
If the interaction Lagrangian T is Lorentz invariant, then one can prove that the expressions can be taken Lorentz covariant.
Now we can construct the chronological products
according to the recursive procedure. We say that the theory is gauge invariant in all orders of perturbation theory if the following set of identities generalizing (
6):
are true for all
and all
Here we have defined
Such identities can be usually broken by anomalies, i.e., expressions of the type
which are quasi-local and might appear in the right-hand side of the relation (
10). These anomalies are constrained by some identities (the Wess–Zumino relations). It is still an unsolved problem to prove that the anomalies can be eliminated by convenient redefinitions of the chronological products. This problem is not resolved satisfactorly in other approaches also, at least in our opinion.
One idea to eliminate the anomalies is to try to find new restrictions verified by them beside the Wess–Zumino relations. One possibility could be the anti-BRST transform introduced in [
16]. We will prove in this paper a negative result, namely that up to the second order of perturbation theory, the anti-BRST transform does not produce new constraints on the model; that is if we impose BRST symmetry up to the second order, we automatically have anti-BRST up to the second order.
We should mention that the literature is rather extensive on the BRST transform in the functional formalism, based on the master equation. There are some differences between the two approaches and there is no proof that they are equivalent. We only mention [
17] where a result of the same nature as ours is proved, namely that BRST and anti-BRST transforms are connected by the Hodge map. For the treatment of anomalies in the functional formalism we cite [
18,
19].
In the next section, we will briefly present the Yang–Mills model in our preferred compact notations. In
Section 3, we prove our main result.
2. Yang–Mills Models in the Causal Formalism
We give some results from [
20].
2.1. Massless Particles of Spin 1 (Photons)
We consider a vector space
of Fock type generated (in the sense of Borchers theorem) by the vector field
(with Bose statistics) and the scalar fields
(with Fermi statistics). The Fermi fields are usually called ghost fields. We suppose that all these (quantum) fields are of null mass. Let
be the vacuum state in
In this vector space we can define a sesquilinear form
in the following way: The (non-zero) 2-point functions are by definition:
and the
n-point functions are generated according to Wick theorem. Here
is the Minkowski metrics (with diagonal
) and
is the positive frequency part of the Pauli–Jordan distribution
of null mass. To extend the sesquilinear form to
we define the conjugation by
Now we can define in
the operator
Q according to the following formulas:
where by
we mean the graded commutator. One can prove that
Q is well defined: basically it leaves invariant the causal commutation relations. The usefulness of this construction follows from:
Theorem 1. The operator Q verifies The factor space is isomorphic to the Fock space of particles of zero mass and helicity 1 (photons).
2.2. Massive Particles of Spin 1 (Heavy Bosons)
We repeat the whole argument for the case of massive photons, i.e., particles of spin 1 and positive mass.
We consider a vector space
of Fock type generated by the vector field
the scalar field
(with Bose statistics) and the scalar fields
(with Fermi statistics). We suppose that all these (quantum) fields are of mass
In this vector space we can define a sesquilinear form
in the following way: The (non-zero) 2-point functions are by definition:
and the
n-point functions are generated according to Wick theorem. Here
is the positive frequency part of the Pauli–Jordan distribution
of mass
m. To extend the sesquilinear form to
we define the conjugation by
Now we can define in
the operator
Q according to the following formulas:
One can prove that Q is well defined. We have a result similar to the first theorem of this section:
Theorem 2. The operator Q verifies The factor space is isomorphic to the Fock space of particles of mass m and spin 1 (massive photons).
2.3. The Generic Yang–Mills Case
The situations described above (of massless and massive photons) are susceptible of the following generalizations. We can consider a system of particles of null mass and helicity 1 using the idea of the first subsection for triplets of massless fields; here is a set of indexes. All the relations have to be modified by appending an index a to all these fields.
In the massive case we have to consider quadruples of fields of mass and use the second subsection; here is another set of indexes.
We can consider now the most general case involving fields of spin not greater that 1. We take a set of indexes and for any index we take a quadruple of fields with the following conventions: (a) For we impose and we take the masses to be null (b) For we take the all the masses strictly positive: (c) For we take to be null and the fields of mass The fields are called Higgs fields.
If we define
then we can define in
the operator
Q according to the following formulas for all indexes
If we consider matter fields also, i.e., some set of Dirac fields with Fermi statistics:
then we impose
2.4. The Yang–Mills Interaction
In the framework and notations from the end of the preceding subsection we have the following result which describes the most general form of the Yang–Mills interaction [
21,
22,
23]. Summation over the dummy indexes is used everywhere.
Theorem 3. Let T be a relative cocycle for , i.e., verifies (3) and also: (1) is tri-linear in the fields; (2) is of canonical dimension (3) has ghost number Then: (i) T is (relatively) cohomologous to a non-trivial co-cycle of the form:where we can take the constants if one of the indexes is in also if or one of the indexes a and b are from and if if Moreover we have: (a) The constants are completely antisymmetric (b) The expressions are antisymmetric in the indexes a and b:and are connected to by: (c) The (completely symmetric) expressions verify (d) the expressions and are bilinear in the Fermi matter fields: in tensor notations;where for every we have defined the chiral projectors of the algebra of Dirac matrices and are matrices. If M is the mass matrix then we must have (ii) The relation is verified by: (iii) The relation is verified by: Now if we impose gauge invariance in the second order of perturbation theory, i.e., (
10) for
we get new constraints on the constants from the interaction Lagrangian (for an extensive treatment see [
20,
21,
22,
23]):
(Jacobi identity)
(the representation property for the Bose sector)
(the representation property for the Fermi sector)
(a tensor representation property for the expressions
).
This result was obtained independently in [
22,
23] and [
24,
25], respectively. It gives a new perspective on Higgs mechanism; some new references on this subject are [
26,
27,
28].
2.5. Causal Perturbation Theory
We give the idea of the construction of the chronological products only in the second order of perturbation theory, relevant for our result. The basic construction of Epstein and Glaser is the construction of the causal commutator. More details can be found in [
20]. We mention the basic procedure used in the causal approach. If we want to construct the chronological product
for arbitrary Wick monomials
A and
B, the idea of Epstein and Glaser is to consider the (graded) commutator
It is easy to see that the tree contribution to this commutator is of the form:
where
are polynomials in the partial derivatives and
are Wick polynomials. Then one can obtain the associated chronological products if one makes in the preceding formula the substitution
i.e., if we replace the Pauli–Villars causal distribution with the Feynman propagator:
In this way we fulfill Bogoliubov axioms in the second order for the tree contributions. However, in this way we can produce anomalies in the relation (
10). The reason is that for the gauge invariance of the causal commutators—e.g., relation (
55) from the next section - we need the Klein–Gordon equation for the Pauli–Villars distribution
but for the gauge invariance of the chronological products we must use
In some cases it is possible to eliminate these anomalies if we use finite renormalizations
where
N are finite renormalizations; they must be quasi-local expressions i. e they are of the form
with
Wick polynomials. This program can be extended, in principle to loop contributions and to higher orders of perturbation theory, but a complete analysis is not available now.
3. Anti-BRST Transform
In
Section 2, we have introduced the BRST transform; we prefer to rewrite it using the (graded) commutator
:
These expressions are the linear part of the non-linear BRST transform from the classical field theory. We proceed in the same spirit with the anti-BRST transform from [
16] and obtain the operator
(associated to the anti-BRST transform
and given by:
and
As in the case of the BRST transform (
37), we immediately check that
Now we want to investigate if a theorem similar to 3 is valid for the anti-BRST transform. Indeed we have:
Theorem 4. Suppose that the conditions from theorem 3 are fulfilled. Then:
(i) The relation is verified for (ii) The relation is verified by: Proof. It is elementary, by direct computations. We must cleverly use all the linear relations between the constants of T derived in Theorem 3. □
We remark that the expression
can be obtained from
if we make the transformation
which also preserves the canonical anti-commutation relations and maps
Q in
however this is not true for
The preceding result shows that, as in [
16], we get nothing new from anti-BRST in the first order of perturbation theory. We investigate now if the same is true for the second order of perturbation theory. We are interested to impose gauge invariance for the anti-BRST transform, i.e., a relation of the same type as (
10) for the chronological products
Such relations should be of the form:
The hope would be to obtain new constraints on the possible anomalies. We have succeeded to study the tree contribution in the second order of perturbation theory, i.e., the preceding relation for and tree contributions. We have obtained the following result.
Theorem 5. We can impose the relationfor the tree components of the chronological products if we perform the following finite renormalizations:with the explicit expressions:withandwithOnly the linear and bilinear constraints from the end of the Section 2.4 are needed for this result. The finite renormalization (53) is identical to the finite renormalization needed for the usual gauge invariance of the tree contributions in the second order of perturbation theory. Proof. (i) We start with the case
of identity (
46). We start from the identity
and do the substitution (
34) → (
35). We need to collect all the terms from the commutator
containing the factor
The identity (
55) holds if we use the Klein–Gordon equation for the Pauli–Villars distribution
However, when we make the substitution (
34) → (
35) we must use
and an anomaly appears. For instance, if we consider the first term of
—see the expression (
42)—we have
and after the substitution (
34) → (
35) we obtain the anomaly
We have another term from the fourth term of
and it is convenient to exhibit the end result in the form:
After some work, using Jacobi identity, we obtain that
where
is the expression from the statement. This anomaly can be eliminated by performing the finite renormalization (
49).
(ii) Now we consider the identity
Now we need to collect all the terms from the two commutators
and
containing the factor
afterwards we make the substitution (
34) → (
35) and obtain the corresponding anomaly. The computation leads, after using Jacobi identity, to the anomaly
where
with
the expression from the statement. This anomaly can be eliminated by performing the finite renormalization (
50).
(iii) The next step is the identity
There are a lot of terms with the factor
contributing to the anomaly. There is another subtlety: in the right hand side we have to use in fact the chronological products renormalized according to (
49) and (
50). This brings additional contributions to the anomaly:
We must use all the bilinear identities (
29)–(
32) from the end of
Section 2 and end up with an anomaly of the type
where the expression for
is rather complicated. We try to equate it with an expression of the type
where
generic form (
51). We end up with a system of four equations: from three of them the expressions of
from the statement are derived and the fourth is an identity if we use the bilinear identities (
29)–(
32). So, this anomaly can, again, be eliminated by a finite renormalization (
51).
(iv) This repetitive process ends with the consideration of the identity:
As before, we select the terms from the commutators with the factor
They will produce a piece of the anomaly after the substitution (
34) → (
35). There is another piece coming from the finite renormalization (
51):
If we use the bilinear identities (
29)–(
32) from the end of
Section 2 and end up with an anomaly of the type
where the expression for
d is complicated. We try to equate it with an expression of the type
where
N generic form (
53). We end up with a system of four equations: from three of them the expressions of
from the statement are derived and the fourth is an identity if we use the bilinear identities (
29)–(
32). So, this anomaly can, again, be eliminated by a finite renormalization (
53). □
We end with an explanation of the fact that we obtain nothing new from the anti-BRST transform. In fact, anti-BRST is equivalent to BRST transform. One can see this if replaces the interaction Lagrangian
T from (
20) by an equivalent Lagrangian, i.e., an expression
differing from
T by a co-boundary. In fact, we can rewrite the second term from (
20) in the following way:
where we omit the explicit expressions
B and
It is known that we can discard the co-boundary
without modifying the values of scattering matrix taken in the subspace of the physical states [
29]. We can prove that the new interaction Lagrangian obtained in this way is anti-symmetric with respect to the transform (
44). This anti-symmetry property can be extended to higher order chronological products, so if we have gauge invariance with respect to the BRST transform, then we must have gauge invariance with respect to the anti-BRST transform also.