1. Introduction
In 1865, Maxwell presented his treatise on the theory of electromagnetism (EM) [
1]. However, what we generally denote as Maxwell’s equations (the four equations with divergence and curl) have never been written by Maxwell [
1]. In fact, by using Hamiliton’s quaternions, in his treatise, Maxwell summarized the theory in a set of ten equations. At the beginning of 1900, by using symbolic vector calculus, Heaviside [
2], along with Gibbs [
3] and Helmholtz, reduced the system of ten equations in the well-known system of four equations as reported in modern books dealing with (EM) theory.
However, the scientific progress in the last century has proposed some considerable novelties. Around 1928, Wolfgang Pauli has introduced the representation of a vector in terms of a two-by-two matrix, thus allowing novel possible operations as, e.g., taking the inverse of a vector. In the same years, completely independently, Paul A. M. Dirac has introduced the Dirac matrices in order to express the Dirac equation [
4,
5]. In addition, in the last century, the contributions of Hamilton, Grassmann, Clifford and others have been clarified and better understood [
6].
In fact, it has been recognized that both Pauli and Dirac algebras can be related to the Clifford algebra [
7], often referred to also as Geometric Algebra (GA). The description of the space in terms of GA is considerably more satisfying than traditional vector analysis. In fact, in vector analysis, there are only scalars (points) and vectors. In addition, vector analysis does not represent an algebra and therefore a plethora of different rules have to be followed for performing various tasks. Conventional vector analysis only holds true in the three-dimensional space, while GA can be used in whatever number of dimensions. In particular, according to GA, elements in a three–dimensional space can be represented by using a multivector, which is the sum of a scalar (i.e., a point), a vector (i.e., an oriented line), a bivector (i.e., an oriented surface) and a trivector or pseudoscalar (i.e., a volume element). In three dimensions, it is possible to multiply multivectors as simply as we can multiply two-by-two matrices. Such 3D multivectors can be represented by two-by-two Pauli matrices. According to these considerations, in recent years, the interest on the application of geometric algebra has grown significantly and many excellent books have appeared [
8,
9,
10,
11,
12,
13,
14].
Recently, a plethora of GA applications has arisen [
15,
16], spreading over different disciplines, such as signal and image processing [
17], artificial intelligence [
18], robotics [
19] and molecular geometry [
20]. Its application potential is not surprising. Indeed, whenever Euclidean transformations are present, the issue can be expressed in terms of GA. In particular, with regard to EM problems, an excellent introduction to GA and the related advantages has been provided in [
21]. It is shown that Maxwell equations can be grouped into a single equation [
6,
22] or can be expressed in a compact form similar to the Dirac equation for null mass [
23].
In this paper, assuming that the reader is familiar with GA, the advantages related to the use of Pauli matrices are highlighted for engineering purposes. While Clifford algebra or GA are very nice for a theoretical framework, in practical engineering computations, the use of Pauli matrices provides a significant advantage. In fact, they are very suitable for introduction in a Computer Algebra System (CAS). As a matter of fact, modern CAS allow for developing the required vector algebra entirely at the computer level, so that tedious computations can be avoided. In this paper, the Pauli matrices in cylindrical and spherical coordinates are reported for the first time and their use for representing a three-dimensional vector is discussed. They provide a representation which is independent of the coordinate system, which does not exist in the conventional vector approach. Additionally, the Pauli matrix representation of the nabla operator ∇ is introduced and discussed.
The paper is structured as follows: first, the multivector representation via Pauli vectors is introduced (
Section 2). Next, the Pauli matrices in rectangular coordinates are converted to cylindrical and spherical coordinates in
Section 3. In addition, the ∇ operator can be expressed in rectangular, cylindrical and spherical coordinates by using Pauli matrices as demonstrated in
Section 4. Finally, the results are applied on a dipole as example (
Section 5).
2. Multivector Representation via Pauli Matrices
In this section, first, the conventional representation of vectors is recalled and some drawbacks are highlighted. Next, a Clifford algebra vector representation is recollected to overcome these disadvantages. Finally, the well known Pauli matrices in rectangular coordinates and Pauli vectors are introduced and discussed.
2.1. Conventional Representation of Vectors
Three–dimensional vectors are usually represented as three ordered numbers, e.g.,
. Vectors are generally represented in the rectangular, cylindrical and spherical coordinate systems as
Different conventions exist for representing the coordinates of a spherical coordinate system. In this work, the convention usually applied in physics is used, specified by the ISO standard 80000-2:2019. The right-handed coordinate system
is applied, with
the inclination unit vector (corresponding to the angle with respect to the polar axis) and
referring to the azimuthal angle (
Figure 1).
The representation in (
1), which is based on the use of unit vectors, has several drawbacks:
vectors cannot be multiplied;
it is not possible to find the inverse;
given two vectors and , it is not possible to find the transformation which leads from to ,
the cross product of vectors is misleading; instead a bivector, which is an oriented surface, should be used,
no volume elements are present,
it depends on the coordinate system.
As demonstrated in [
21], the use of a Clifford algebra
, recalled in the next section, allows for overcoming these issues.
2.2. Clifford Algebra of the Three-Dimensional Space
In a Clifford algebra
, a vector is represented as follows:
The basis elements
,
and
form an orthonormal basis for the vector space
. In order to establish a Clifford algebra
,
,
and
have to satisfy the following conditions for
:
By using the correspondence shown in
Table 1, Equation (
2) can be used to refer to all the three coordinate systems.
However, there is also another possible representation in terms of 2 × 2 matrices. Naturally, a 2 × 2 real matrix is defined by four numbers, while a complex one requires eight numbers. It is therefore fairly natural that it is possible to represent a vector via a matrix. However, there are many possible representations, but, among them, the representation introduced by Wolfgang Pauli has several advantages. In the next section, the Pauli matrices in rectangular coordinates will be introduced, and their properties will be discussed. They are related to the Clifford’s algebra, and they satisfy (
3) and (
4). Accordingly, the representation of a vector in terms of Pauli matrices provides the same advantages of a Clifford’s algebra. Some instructive relationships between conventional vector analysis and GA are given in
Appendix A.
2.3. Pauli Matrices in Rectangular Coordinates
In rectangular coordinates, the Pauli matrices have the following form:
Notice that the trace of the Pauli matrices is zero, and their determinant is −1. The square of the Pauli matrices equals the identity matrix
I:
By multiplying, e.g.,
with
, the result is
and similar for the other cases:
The above relations are very important. In fact, they show that, in the three-dimensional case, it is always possible to replace the quantities
with the orthogonal vector
(with the appropriate combination given in (
7)). An equivalent property is also present in the Clifford algebra. Let us first note that, for the trivector
, it applies that:
and therefore
. In fact, if we consider the bivectors
and multiply them by
, we have
From the above properties, it is seen that, similarly for the Clifford basis, we have
i.e.,
The three Pauli matrices and the identity matrix
constitute a basis in the space of two-by-two Hermitian matrices. A matrix
A can be written as:
It is worthwhile to note that, when the coefficients () are complex, non Hermitian matrices can also be described by the basis of ().
2.4. The Pauli Vector
2.4.1. Definition
The Pauli vector is a vector constructed by using the three matrices (
) as follows:
with
,
and
orthonormal basis unit vectors.
For a vector
in the three-dimensional space, the matrix
is defined as the following product:
The vector can be expressed as a two-by-two matrix . The standard vector representation and the matrix represent the same quantity, and it is always possible to pass from one to the other.
The inner product of two matrices
, not necessarily representing vectors, is defined as
From the properties of the Pauli matrices, it can be easily verified that:
Referring to (
12), this provides a simple way to retrieve the coefficients of the Pauli matrices for a given matrix
A:
and similarly for the other components.
2.4.2. Space Description
It is noted that elements in three-dimensional space are described by eight numbers (i.e., a complex 2 × 2 matrix). In particular, they are
one scalar (): grade 0;
3 basis vectors () corresponding to three directions: grade 1;
3 basis bivectors (): grade 2;
one pseudoscalar (): grade 3.
All these elements are contained in a matrix and, similarly to what we do for complex numbers, they can be written together in a
multivector as
where
is a scalar,
is a vector,
is a bivector and
is a pseudoscalar.
We have started this section showing that a vector can be represented by a Pauli matrix. It is now possible to conclude that, in the three–dimensional space, a Pauli matrix not only can represent a vector, but it can encode all the information of the eight-dimensional base of a multivector! In other words, a multivector can be represented as a Pauli matrix, which will be illustrated next. Note that, since matrix algebra is well-known, we can also multiply, take the inverse, etc. of multivectors with ease.
2.4.3. Pauli Matrix Representation of a Multivector
Let us see with more details the Pauli matrix representation of a multivector. The corresponding matrices of the multivector in (
17) are given next:
The matrices in (
18) can be summed together giving, for the multivector
,
For a given Pauli matrix, it is possible to retrieve the elements of the different grades as described next.
2.4.4. Retrieving the Elements of a Multivector
Let us assume that the matrix in (
19) is given, and we want to retrieve the various elements. It is convenient to extract the real and imaginary part of
as
By inspection, it is seen that we have the following identities that express the different elements of the multivector:
3. Pauli Matrices in Different Coordinate Systems
The Pauli matrices in rectangular coordinates and their properties have been introduced in
Section 2. In this section, their expressions in cylindrical and spherical coordinates are derived by two different methods: first by applying the transformation equations, and next by rotations.
3.1. Pauli Matrices in Cylindrical Coordinates
For the rectangular coordinate system, the Pauli matrices are , with . When expressing a vector, it is possible to identify with , with and with .
Consider the unit vectors
of a cylindrical coordinate system. By using the equations for transforming the rectangular coordinates into the cylindrical ones, it is possible to write for the radial unit vector:
resulting in the corresponding Pauli matrix:
Similarly, it applies that:
from which the following Pauli matrix follows:
Naturally, for the vertical z component, the relation is that .
While in rectangular coordinates the Pauli matrices are not dependent on the coordinate, for the cylindrical coordinates, it is noted that
and
are dependent on
. For the derivatives with respect to
, it applies that, with the abbreviated notation
,
It is readily proved that the matrices
when multiplied by themselves give the identity matrix
, their trace is null, their determinant is always
and their dot product is zero if they are not the same. In addition, we have
Therefore, the generic vector
can be expressed in cylindrical coordinates in terms of Pauli matrices as
By performing the inner product of the above expression with a selected sigma matrix, it is possible to recover the desired component in terms of the other basis. Thus, all the transformation between vectors in different coordinate systems can be simply obtained by matrix multiplication and trace operation. As an example, the expression of
in terms of the rectangular components can be calculated by performing the inner product of both sides of (
28) times
:
where the inner product corresponds to making the matrix product and taking half of the trace.
3.2. Pauli Matrices in Spherical Coordinates
The procedure to derive the Pauli matrices in spherical coordinates is the same adopted for the cylindrical coordinate system. Consider the unit vectors
of a spherical coordinate system. By using the equations for transforming the rectangular coordinates into the spherical one, it is possible to write for the radial unit vector:
resulting in the corresponding Pauli matrix:
Similarly, for the unit vector along
:
resulting in the Pauli matrix:
Not surprisingly for the coordinate, the result is the same as in cylindrical coordinates.
It is readily proved that the matrices
, when multiplied by themselves, give the identity matrix
, their trace is null, their determinant is always
and their dot product is zero if they are not the same. In addition, the following relations can be derived:
Therefore, the generic vector
can be expressed in spherical coordinates in terms of Pauli matrices as
As for cylindrical coordinates, by performing the inner product of the above expression with a selected sigma matrix, it is possible to recover the desired field component in terms of the other basis. The Pauli matrices in rectangular, cylindrical and spherical coordinates are summarized in
Table 2.
3.3. Obtaining Pauli Matrix in Cylindrical and Spherical Coordinates by Rotations
The Pauli matrices in cylindrical and spherical coordinates can be obtained using a more efficient method. It can be noted that the right-handed triplet of coordinate unit vectors
,
,
of a cylindrical coordinate system can be obtained by rotating the triplet of coordinate unit vectors of a Cartesian system
,
,
by an angle
about the
z-axis. This rotation is represented by the matrix
Hence, the Pauli matrices that represent the coordinate unit vectors of the cylindrical system can be expressed as
where † denotes conjugate transpose.
In a similar way, it can be observed that the right-handed triplet of coordinate unit vectors
,
,
of a spherical system can be obtained from coordinate unit vectors of a Cartesian system by combining two rotations: a first rotation by an angle
about the
y-axis and a second rotation by an angle
about the
z-axis. The overall rotation is represented by the matrix
Hence, the Pauli matrices for a spherical system can be derived as
3.4. Transformation of a Vector from One Coordinate System to Another
Let us consider a vector
and its expressions in terms of its Cartesian, cylindrical, and spherical components
In terms of Pauli matrices, the vector
is represented by a matrix
given by
An overview of the vector
expressed in terms of rectangular, cylindrical and spherical Pauli matrices can be found in
Table 3.
By means of (
37) and (
39),
can also be expressed as
where the matrices
and
which can be obtained from
by means of the inverse rotations of
and
, respectively, representing two vectors whose Cartesian components coincide with the cylindrical or spherical components of
.
3.5. An Example
Let us consider the vector
with components
evaluated at the position
. In rectangular coordinates, vector
represented as a Pauli matrices is
The position
P corresponds the following angles in degrees:
At this point, the Pauli matrices are in cylindrical coordinates:
and in spherical coordinates:
The components in cylindrical coordinates are:
When we perform
we obtain the same matrix as before, i.e.,
, showing that the vector is independent from the coordinate system. Similarly, the components in spherical coordinates are:
and again the vector is given by
It is noted that such representation of a vector, independent from a coordinate system, does not exist in the conventional approach.
5. Dipole Example
As an example, the electric and magnetic field of a Hertzian dipole in spherical coordinates is calculated from the magnetic vector potential in rectangular coordinates using Pauli matrices.
In this section, the symbol j with is used as is appropriate when applying conventional time-harmonic analysis. In this way, any possible geometric interpretation is avoided, since the symbol i has a geometric meaning in GA.
Consider an ideal or Hertzian dipole symmetrically along the
z-axis of
Figure 1, with length
(much smaller than the wavelength) and a uniformly distributed current
I along its length. Since the current only has a
z-component, the magnetic vector potential
in the point
will only have a
z-component as well, given by:
with
the permeability of the medium and
k the wavenumber.
We have seen that, by using Pauli matrices, the representation of a vector is independent of the coordinate system. As a result, we can apply (
85), which is given in rectangular coordinates, to calculate the field in spherical coordinates. Referring to (
14), the magnetic vector potential in Pauli representation is given by:
The Pauli matrix of the magnetic field
can be calculated by [
21]
with
the medium impedance and
v the light velocity in the medium. The quantity
is readily evaluated as
Since the Pauli representation is independent of the coordinate system, the magnetic field can be extracted easily in any coordinate system. Grade extraction in spherical coordinates provides the following expression for the magnetic field
The Pauli matrices of the magnetic and the electric field are connected via [
21]:
Evaluating the latter equation and performing grade extraction in spherical coordinates results in the components of the electric field
:
By using the multivector concept, the field can be retrieved in a more direct way. Start from (
86) and evaluate its ∇ operator obtaining
From the latter expression, the divergence is readily evaluated and therefore also the scalar potential
It is now possible to form the multivector potential as
and apply the operator
on the multivector potential
to obtain the field
thus recovering the field in (
89) and (
91) simply from grade extraction.
6. Conclusions
EM computational calculations in practical engineering problems are still based on Gibbs’ vector algebra. Applying GA presents an alternative which has several advantages, among others the ability to use simple matrix multiplications. In three dimensions, GA is isomorphic to Pauli algebra, allowing for expressing the necessary vector operations in EM calculations with Pauli matrices.
To date, GA has been used by physicists to solve EM problems and not by engineers. Indeed, in engineering, time-harmonic analysis in EM is widely applied, whereas no relevant publications discuss the benefits of GA in the time-harmonic regime. For example, in network circuit theory, two-ports are fully described by two-by-two matrices. In an analogous way, EM problems could be expressed by two-by-two matrices.
In this work, a unified representation of 3D multivectors with Pauli algebra was presented: by use of the correspondences in
Table 1 and
Table 2, the expression (
2) can be used to refer to a vector in rectangular, cylindrical and spherical coordinate systems. Applying Pauli matrices results in a representation that is independent of the coordinate system. An overview of the Pauli matrices and the ∇ expressions in different coordinate systems can be found in
Table 2,
Table 3 and
Table 9.
By using Pauli matrices, we have a set of three matrices satisfying the rules of GA. When a higher number of dimensions is required (4, 5), one can use Dirac matrices, which can be obtained by the Kronecker product between Pauli matrices. The introduction of Pauli matrices in different coordinates systems also allows for introducing different sets of Dirac matrices. The investigation for such composite Dirac matrices and relative quadrivectors is left for future work.