Hermitian and Unitary Almost-Companion Matrices of Polynomials on Demand

Markovich, Liubov A.; Migliore, Agostino; Messina, Antonino

doi:10.3390/e25020309

Open AccessArticle

Hermitian and Unitary Almost-Companion Matrices of Polynomials on Demand

by

Liubov A. Markovich

^1,2,3,4,*

,

Agostino Migliore

⁵

and

Antonino Messina

⁶

¹

Instituut-Lorentz, Universiteit Leiden, P.O. Box 9506, 2300 RA Leiden, The Netherlands

²

QuTech and Kavli Institute of Nanoscience, Delft University of Technology, 2628 CJ Delft, The Netherlands

³

Institute for Information Transmission Problems, Bol. Karetny Per. 19, 127051 Moscow, Russia

⁴

Russian Quantum Center, Skolkovo, 143025 Moscow, Russia

⁵

Department of Chemical Sciences, University of Padova, Via Marzolo 1, 35131 Padova, Italy

⁶

Dipartimento di Matematica ed Informatica dell’Università di Palermo, Via Archirafi 34, 90123 Palermo, Italy

^*

Author to whom correspondence should be addressed.

Entropy 2023, 25(2), 309; https://doi.org/10.3390/e25020309

Submission received: 17 January 2023 / Revised: 31 January 2023 / Accepted: 1 February 2023 / Published: 8 February 2023

(This article belongs to the Special Issue Quantum Mechanics and Its Foundations III)

Download Review Reports Versions Notes

Abstract

:

We introduce the concept of the almost-companion matrix (ACM) by relaxing the non-derogatory property of the standard companion matrix (CM). That is, we define an ACM as a matrix whose characteristic polynomial coincides with a given monic and generally complex polynomial. The greater flexibility inherent in the ACM concept, compared to CM, allows the construction of ACMs that have convenient matrix structures satisfying desired additional conditions, compatibly with specific properties of the polynomial coefficients. We demonstrate the construction of Hermitian and unitary ACMs starting from appropriate third-degree polynomials, with implications for their use in physical-mathematical problems, such as the parameterization of the Hamiltonian, density, or evolution matrix of a qutrit. We show that the ACM provides a means of identifying the properties of a given polynomial and finding its roots. For example, we describe the ACM-based solution of cubic complex algebraic equations without resorting to the use of the Cardano-Dal Ferro formulas. We also show the necessary and sufficient conditions on the coefficients of a polynomial for it to represent the characteristic polynomial of a unitary ACM. The presented approach can be generalized to complex polynomials of higher degrees.

Keywords:

companion matrix; almost-companion matrix; hermitian matrix; unitary matrix; complex polynomial; density matrix; sub-parameterization

1. Introduction

Given a complex and monic polynomial

P_{n} (z)

(see Comment A1), it is always possible to define a matrix with a specified arrangement of the polynomial coefficients as its entries, such that

P_{n} (z)

coincides with the characteristic polynomial of the matrix. The set

S (P_{n} (z))

of all these

n \times n

matrices with complex entries, sharing the same characteristic polynomial

P_{n} (z)

, is infinite and can include both derogatory and non-derogatory matrices (see Comment A2). In fact, we observe that, by definition, the Frobenius Matrix [1] of the shared characteristic polynomial always belongs to

S (P_{n} (z))

. A remarkable property of this matrix, which stems directly from its construction, is the coincidence between its characteristic and minimal polynomials, whatever

P_{n} (z)

. Therefore this matrix is classified as non-derogatory and, following Horn and Johnson [2], it is known as the Companion Matrix (CM) of its characteristic or minimal polynomial (see Comment A3).

Henceforth, we refer to the Frobenius Matrix as the Frobenius Companion matrix (

F C M

) of

P_{n} (z)

. When the algebraic multiplicity of each of the n eigenvalues of the

F C M

is 1, any matrix in

S (P_{n} (z))

is non-derogatory, since these n distinct eigenvalues are all necessarily roots of its minimal polynomial, which therefore has degree n [2]. We also remark that this condition guarantees that each matrix in the

S (P_{n} (z))

set is diagonalizable [2] and that, consequently, all matrices

\in S (P_{n} (z))

can be generated from the

F C M

\in S (P_{n} (z))

by means of a similarity transformation. In this way, by definition and under the conditions established for the spectrum

σ (F C M)

of

F M C

,

S (P_{n} (z))

includes all and only the companion matrices of

P_{n} (z)

.

When, instead, the distinct roots of the common characteristic polynomial are

p < n

,

S (P_{n} (z))

includes infinite derogatory matrices, which cannot be structurally similar to the Frobenius matrix (see Comment A4) and to any non-derogatory matrix belonging to

S (P_{n} (z))

. For example, consider that the set

S (P_{n} (z))

contains the diagonal matrices whose n entries are nothing but the p distinct roots of the characteristic polynomial repeated as many times as their multiplicities, whose sum is n. The degree of their characteristic polynomial is n, while the degree of their minimal polynomial is

p < n

[2]. Therefore, these matrices are derogatory, as are the infinite matrices generated from them by similarity.

The Frobenius matrix dates back to 1879 and was given in the form [1]:

\begin{matrix} C_{n} = (\begin{matrix} 0 & 0 & \dots & 0 & - c_{n - 1} \\ 1 & 0 & \dots & 0 & - c_{n - 2} \\ 0 & 1 & \dots & 0 & - c_{n - 3} \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & \dots & 1 & - c_{1} \end{matrix}) \end{matrix}

(1)

by the German mathematician Ferdinand Georg Frobenius [3]. Sometimes, in the literature it is presented in three other unitarily transformed forms, all parameterized in terms of the n coefficients of the polynomial

P_{n} (z)

and with the same number of entries equal to 0 or 1 as in (1) [4]. When the n eigenvalues of the Frobenius matrix are distinct, the unitary matrix that yields the diagonal form of (1) is the Vandermont matrix of its n eigenvalues. This property, as well as other properties and some applications of the Frobenius matrix may be found in [4].

Despite the fact that the first CM was proposed more than 140 years ago, the generation of different CMs (with further properties) has attracted a great deal of applied research activity. CMs emerge naturally in mathematical methods for finding and characterizing the roots of polynomials [5,6,7,8,9,10] and can be applied as well in the determination of solutions of high-order scalar linear differential and difference equations [11]. CMs can give matrix representations of some fields [12] and are widely used in control theory, for example in writing the controllable canonical form associated with the transfer function of a system [13]. The product of CMs is also used in the study of random walks and Markov chains [14].

The original structure of the FCM shows a modest level of flexibility and, in fact, has stimulated the search and the emergence of generalizations leading to new proposals of CMs that pave the way to applications beyond the FCM. To this end, a successful strategy is based on a radical change of the basis in which the characteristic polynomial

P_{n} (z)

of the FCM is represented. By definition,

P_{n} (z)

is monic and is written as the sum of

z^{n}

and a linear combination with coefficients in

C

of

1, z, z^{2}, \dots, z^{n - 1}

. All these

n + 1

powers of z constitute the monomial basis, which can be replaced by other polynomial bases. In this way, one can introduce new (still non-derogatory) CMs with nonvanishing elements on the main, sub, and super diagonals. For example, the Chebyshev basis, independently adopted by Specht [15,16] and Good [17], led to a new CM called the colleague matrix. This approach has been further generalized by Barnett [4], who considered a basis of orthogonal polynomials and named the newly emerged CMs comrade matrices. Subsequently, Barnett proposed to call confederate matrices the CMs arising from the use of a general polynomial basis [18].

The applications dedicated to the classic problem of finding the real zeros of a real coefficient polynomial of arbitrary degree deserve a special mention, because in this context the CMs have inspired a different approach, alongside exquisitely mathematical and computational investigations [19,20,21,22,23,24]. In the last decade, new quantum theory-based root-finding algorithms exploiting the construction of Hermitian companion matrices [25,26,27,28,29,30] have also been proposed, thus increasing the interest in the study of CMs and related matrices in the rapidly growing research field of quantum computing. Furthermore, recent studies [31] have shown the opportunity of using CMs in mathematical constructions useful to the investigation of quantum entanglement, quantum state tomography, and quantum information in general.

The above is the general context for the main question addressed by this paper: is it possible to find a CM of a real or complex polynomial that is also hermitian/unitary, for example, or possesses some other prescribed special matrix structure? This question has so far been answered only partially.

In [32] the CM for a

P_{n} (z)

with real coefficients and real zeros is constructed as a real symmetric tridiagonal Hermitian matrix. This provides a complete solution to a problem raised and partly solved by M. Fiedler in [33], which has rekindled the interest in the general structure of CMs. Note that the Frobenius matrix is itself a Fiedler matrix after a reverse permutation matrix similarity. In [34] all CMs are characterized in terms of combinatorial structure to generate new CMs. It is interesting to note that both the Frobenius and Fiedler CMs are sparse matrices, as they have

2 n - 1

nonzero elements [35]. A new class of sparse CMs, also known as intercyclic CMs, was introduced in [34] and includes the Fiedler matrices as a special case. In [35] the non-sparse CMs are introduced noting that they are not connected with the sparse ones by a reverse permutation matrix similarity.

To the best of our knowledge, the question of whether the CM of a complex polynomial can be sought as unitary or Hermitian is still open.

1.1. Purpose and Contribution of this Study

CMs generally show relatively limited versatility due to their combinatorial structure. For example, the FCM is never Hermitian or unitary. Searching for CMs that satisfy additional constraints of this kind is important when a given class of parametric polynomials is designed to reach a reliable theoretical control of a quantum physical scenario. Finding exact flexible solutions to well-defined inverse problems of this kind is a target of the present study. We stress that, for application purposes, we often do not need to associate non-derogatory matrices to a given polynomial. To emphasize this particular aspect of our matrix construction, we introduce the term almost-companion matrix (ACM) to refer to matrices that have a given polynomial as their characteristic polynomial but can be derogatory. Clearly, every CM is also an ACM, i.e., the set of all ACMs is a superset of the set of all CMs of a given polynomial.

In short, in this study, we address the following inverse problem: given a real or complex polynomial of any kind (for example, it may belong to a special class of polynomials, which is reflected in some special condition satisfied by its coefficients), we find a parametric ACM (see Comment A5) that satisfies preassigned conditions (e.g., those required to be Hermitian or unitary) and whose characteristic polynomial coincides with the given one. For definiteness, our investigation is here limited to complex polynomials of the third degree (

n = 3

). The extension of the analysis to higher-degree polynomials is discussed.

The solution of the inverse problem outlined above for polynomials of order

n = 3

is accompanied by some useful applications. The relaxed constraints that characterize an ACM, as compared with a CM, allow the freedom to search, from the outset, for a trial ACM of a given

P_{n} (z)

that is a Hermitian, unitary, or positive matrix, for example. If our inverse problem can be solved systematically through an approach that finds such ACMs whatever the given

P_{n} (z)

, then we readily have at our disposal a good platform for successful applications to problems such as those mentioned below.

1.2. Physical Applications

Constructing an ACM of a given generally complex polynomial, in addition to being interesting in itself, also has considerable applications. In elementary algebra, for example, it could be a solution tool for counting the number of the real roots (and consequently that of the complex roots in the case of a complex polynomial) of a real or complex polynomial. In addition, it may help determine a (or the only) real root of a real polynomial of odd degree. In quantum mechanics or quantum information, it could provide new parametric representations of the density operator or the evolution operator of a physical system living in a finite-dimensional Hilbert space.

The results found for a generic complex polynomial can be applied to the important particular case of a real polynomial. Investigating such a link is certainly of interest in physics. For example, in classical physics, cubic real polynomials appear when looking for the principal axes of symmetric Cartesian tensors of rank two, such as inertia or magnetic/electric dipolar tensors [36]. In quantum mechanics, they enter the scene as characteristic polynomials of any observable of a physical system that lives in a three-dimensional Hilbert space, such as a three-level atom or a qutrit. Recipes for constructing an ACM of a cubic polynomial possessing real roots after the appropriate assignment of parametric real coefficients could provide an easy way to build, e.g., Hamiltonian qutrit models on demand for control purposes, or even the density operator describing a mixed state of a three-level atom.

The paper is organized as follows. In Section 2, we formulate the inverse problem consisting of the search of the ACM for a generic complex polynomial. In Section 3, we construct the ACMs for a generic cubic complex polynomial. Through these ACMs, we introduce a way to find the roots of the given polynomial without using the Cardano-Dal Ferro formulas. The case of the polynomial with real coefficients is also discussed in detail. In Section 4 we present an application in quantum mechanics, constructing on demand the density matrix of a qutrit system as an ACM. Section 5 shows the construction on demand of the unitary ACM of a qutrit. Possible extensions to higher-degree polynomials, as well as further possible applications, are discussed in Section 6.

2. Formulation of the Inverse Problem

Consider a matrix

A \in M_{n}

, where

M_{n}

is the set of all

n \times n

matrices over the complex field

C

. Denoting

I_{n} \in M_{n}

the identity matrix, the monic polynomial in the complex variable z

\begin{matrix} det (z I_{n} - A) & = & z^{n} + c_{1} z^{n - 1} + \dots c_{n - 1} z + c_{n} \end{matrix}

(2)

is, by definition, the characteristic polynomial of A and belongs to the set

P_{n} [C]

of all complex monic polynomials of degree n.

It is well known that its n coefficients

c_{k}

,

k = 1, 2 \dots, n

contain information about the elements of A that is invariant under arbitrary similarity transformations (see Comment A6). In fact,

{(- 1)}^{k} c_{k}

is the sum of all the principal minors of order k of A. In particular,

c_{1} = - Tr A

and

c_{n} = {(- 1)}^{n} det A

. The profound interrelationship between a matrix and its characteristic polynomial becomes even more surprising considering the Cayley–Hamilton theorem (see Comment A7) [2] and/or Newton’s identities [37], which reveal the existence of finite algebraic expressions for the coefficients of the characteristic polynomial of a matrix in terms of traces of powers (up to n) of the matrix. [38,39].

The function

C : M_{n} \to P_{n} [C]

is surjective but not injective and, hence, it cannot be inverted. In fact, it is easy to convince oneself that, for any element

P_{n} (z) \in P_{n} [C]

,

C^{- 1} (P_{n} (z))

is indeed an infinite subset of

M_{n}

, since by definition it consists of all and only the matrices belonging to

M_{n}

whose characteristic polynomial is

P_{n} (z)

.

Thus, while the direct or forward problem of finding the characteristic polynomial of a given

n \times n

matrix is certainly well-posed according to Hadamard [40], conversely, the problem of finding a matrix

A \in M_{n}

generating a given complex polynomial

P_{n} (z)

is an ill-posed inverse problem [41,42], as it manifestly violates Hadamard’s uniqueness requirement, considering that every element

\in C^{- 1} (P_{n} (z))

is a solution to the problem.

It is possible to overcome such an ill-posedness by introducing a restriction

C |_{{[C]}_{n}}

of the function C to a subset

{[C]}_{n}

of

M_{n}

, which is injectively and surjectively valued on

P_{n} [C]

. To this end, let us first observe that the function C is surjective and, by definition,

P_{n} (z)

is the characteristic polynomial of all and only the matrices belonging to

C^{- 1} (P_{n} (z)) \subset M_{n}

. Moreover,

C^{- 1} (P_{n} (z)) \cap C^{- 1} ({P^{'}}_{n} (z)) = \emptyset

when

P_{n} (z) \neq {P^{'}}_{n} (z)

. Therefore, the infinite subsets

C^{- 1} (P_{n} (z))

of

M_{n}

corresponding to the infinite n-degree polynomials

P_{n} (z)

represent a partition of

M_{n}

. We can say equivalently that we are introducing in

M_{n}

the equivalence relation

A \sim B

consisting in the condition that

A \in M_{n}

and

B \in M_{n}

share the characteristic polynomial and thus belong to a given equivalence class

C^{- 1} (P_{n} (z))

. At this point, we define the subset

{[C]}_{n}

by choosing one element from each equivalence class. According to Zermelo’s postulate,

{[C]}_{n} \neq \emptyset

can always be constructed (in infinitely many ways in the present case, since each equivalence class is infinite), and the cardinality of its intersection with

C^{- 1} (P_{n} (z))

is precisely one for any

P_{n} (z)

by construction. Therefore, every one-to-one function

C |_{{[C]}_{n}} : {[C]}_{n} \to P_{n} [C]

obtained by applying the axiom of choice to the quotient set

M_{n} / \sim

to generate

{[C]}_{n}

is invertible. Hence, function

C |_{{[C]}_{n}}^{- 1} : P_{n} [C] \to {[C]}_{n}

defines a

{[C]}_{n}

-dependent Hadamard well-posed inverse problem, whose solution, by construction, can be given in terms of

{[C]}_{n}

in the form

\begin{matrix} C |_{{[C]}_{n}}^{- 1} (P_{n} (z)) = {[C]}_{n} \cap C^{- 1} (P_{n} (z)) . \end{matrix}

(3)

We remark that different legitimate choices of the subset

{[C]}_{n}

lead to different inverse problems, all well-posed in the fixed domain

P_{n} [C]

, and the corresponding solutions (3) differ in the generally non-similar images of one or more polynomials

P_{n} (z)

.

We also point out that any derogatory matrix D cannot be classified as a companion matrix of its characteristic polynomial

P_{D} (z) \equiv det (z I_{n} - D)

, since D annihilates a polynomial having a degree lower than that of

P_{D}

[2].

In this paper, a matrix whose characteristic polynomial coincides with a given polynomial

P_{n} (z)

is called an almost-companion matrix of

P_{n} (z)

. Clearly, any CM of

P_{n} (z)

is an ACM too. A derogatory matrix D such that

P_{D} (z) = P_{n} (z)

is an ACM. In addition, a matrix similar to an ACM is still an ACM. The converse of this statement is generally false: two ACMs of the same given polynomial are not necessarily similar [43]. The set of all the ACMs of

P_{n} (z)

cannot be generated by similarity transformations starting from an assigned ACM, since this set always includes both derogatory and non-derogatory matrices.

3. Almost-Companion Matrices of a Cubic Complex Polynomial

In this section, we focus on the search for an ACM of the third-degree polynomial

\begin{matrix} P_{3 c} (η) = η^{3} + p η + q, \end{matrix}

(4)

which is the canonical form of

\begin{matrix} P_{3} (z) & = & z^{3} + c_{1} z^{2} + c_{2} z + c_{3}, \end{matrix}

(5)

obtained by the translation

\begin{matrix} η = z + \frac{c_{1}}{3} . \end{matrix}

(6)

The generally complex numbers p and q in (4) are related to the coefficients of

P_{3} (z)

as follows:

\begin{matrix} p = - \frac{c_{1}^{2}}{3} + c_{2}, q = \frac{2 c_{1}^{3}}{27} - \frac{c_{1} c_{2}}{3} + c_{3} . \end{matrix}

(7)

We denote

Q_{3 c}

the ACM of

P_{3 c} (η)

defined by

\begin{matrix} P_{3 c} (η) & \equiv & det (η I_{3} - Q_{3 c}) = det ((η - \frac{c_{1}}{3}) I_{3} - (Q_{3 c} - \frac{c_{1}}{3} I_{3})) \\ = & det (z I_{3} - (Q_{3 c} - \frac{c_{1}}{3} I_{3})) \equiv P_{3} (z), \end{matrix}

(8)

which means that

\begin{matrix} Q_{3} = Q_{3 c} - \frac{c_{1}}{3} I_{3}, \end{matrix}

(9)

is the simple recipe to obtain the corresponding ACM

Q_{3}

of

P_{3} (z)

from

Q_{3 c}

. This analysis sheds light on the advantage of first deriving

Q_{3 c}

for the simpler canonical form of a given polynomial and then finding

Q_{3 c}

from the straightforward relation (9).

Next, we formulate a trial ACM

Q_{3 c}

of (4). To this end, we observe that, in accordance with the Vieta-Girard formula for the sum of the roots of (4) [44], the absence of the quadratic term in

P_{3 c} (η)

implies that

t r (Q_{3 c}) = 0

. Moreover, every matrix with elements in

C

is unitarily equivalent to a matrix with equal main diagonal elements [2]. Thus, it is legitimate to set the diagonal elements of our trial

Q_{3 c}

equal to zero. In constructing an ACM of (4), we aim to write its non-diagonal elements in such a way that, in the particular case of a real cubic

P_{3}

and, hence,

P_{3 c}

, the trial matrix

Q_{3 c}

becomes structurally Hermitian provided that p and q in (4) satisfy specific conditions, which will also be derived from our approach. The feasibility of this approach will highlight the greater flexibility of the ACMs compared to that of the CMs.

Following this strategy, we propose the following trial

Q_{3 c}

:

\begin{matrix} Q_{3 c} \equiv Q_{3 c} (ρ, φ, φ_{13}) = - (\begin{matrix} 0 & ρ e^{i \frac{φ}{2}} & ρ e^{i \frac{φ}{2}} e^{i φ_{13}} \\ ρ e^{i \frac{φ}{2}} & 0 & ρ e^{i \frac{φ}{2}} \\ ρ e^{i \frac{φ}{2}} e^{- i φ_{13}} & ρ e^{i \frac{φ}{2}} & 0 \end{matrix}), \end{matrix}

(10)

where the minus sign was introduced for convenience, considering the form of the characteristic polynomial. In Equation (10),

ρ

is real and positive,

φ

is real, whereas

φ_{13}

is, in general, a complex number. It is readily seen that, when

φ = 0

or

π

and

φ_{13}

is real, the matrix

Q_{3 c} (ρ, φ, φ_{13})

is Hermitian, consistent with our search strategy. It is useful to note that the complex conjugate of

e^{i φ_{13}}

is

e^{- i φ_{13}^{★}}

, where

φ_{13}^{★}

denotes the conjugate of

φ_{13}

.

The characteristic polynomial of

Q_{3 c} (ρ, φ, φ_{13})

is

\begin{matrix} det (η I_{3} - Q_{3 c} (ρ, φ, φ_{13})) = η^{3} - 3 ρ^{2} e^{i φ} η + 2 ρ^{3} e^{\frac{3}{2} i φ} cos φ_{13} . \end{matrix}

(11)

Then, identifying polynomial (4) with (11) yields:

\begin{matrix} \{\begin{matrix} p \equiv | p | e^{i Θ_{p}} = - 3 ρ^{2} e^{i φ} = 3 ρ^{2} e^{i (φ + π)}, \\ q = 2 ρ^{3} e^{\frac{3}{2} i φ} cos φ_{13} . \end{matrix} \end{matrix}

(12)

Given p, the first Equation (12) allows us to fix

ρ

and select

φ

(in an infinite set) as follows:

\begin{matrix} ρ = \sqrt{\frac{| p |}{3}}, φ = Θ_{p} - π . \end{matrix}

(13)

Defining, for

p \neq 0

, the complex parameter

\begin{matrix} χ = \frac{- i q e^{- \frac{3}{2} i Θ_{p}}}{2 \sqrt{\frac{{| p |}^{3}}{27}}}, \end{matrix}

(14)

the second Equation (12) becomes an elementary trigonometric equation in

C

:

\begin{matrix} cos φ_{13} = χ, \end{matrix}

(15)

which admits infinitely many solutions for any

χ

; in fact, similarly to the cosine function of a real variable, the complex cosine function is even and periodic with period

2 π

.

Using Euler’s formula, Equation (15) is easily transformed into a quadratic equation in the variable

e^{i φ_{13}}

, whose solution leads to

\begin{matrix} φ_{13} = - i ln (χ + {i | 1 - χ^{2} |}^{\frac{1}{2}} e^{\frac{i}{2} arg (1 - χ^{2})}) = arccos χ . \end{matrix}

(16)

Due to the presence of the multi-valued complex function

arg (χ^{2} - 1)

, expression (16) represents the set of infinite images of

χ

generated by the inverse of the non-injective cosine function over

C

. Therefore, strictly speaking, the expression found for

φ_{13}

cannot be introduced as it is in the matrix

Q_{3 c} (ρ, φ, φ_{13})

. In fact, the three parameters appearing in the trial ACM (11) of (4) must be single-valued functions of the complex coefficients p and q. For our purposes, therefore, we now need to extract a specific single-valued complex function from the multi-valued function

φ_{13}

.

The single-valued complex function that we use here is the principal value

Φ_{13}

of

φ_{13}

, which is obtained from (16) by substituting the multi-valued functions arg and ln with their principal values, denoted Arg and Ln, respectively. This choice amounts, by definition, to constructing the principal value

Arccos (χ)

of function

arccos (χ)

, which is mostly used in the literature [45,46]. It is worth noting that equation (16) can also be written in terms of

χ^{2} - 1

, but then the use of the principal value in the resulting expression for

φ_{13}

would have some drawbacks, as is discussed in detail in [47].

The procedure described above gives

\begin{matrix} Φ_{13} & \equiv & Φ_{13} (χ) = - i Ln (χ + i | 1 - χ^{2} |^{\frac{1}{2}} e^{\frac{i}{2} Arg (1 - χ^{2})}) \equiv Arccos (χ) \\ = & Arg (χ + i | 1 - χ^{2} |^{\frac{1}{2}} e^{\frac{i}{2} Arg (1 - χ^{2})}) - i l n (|χ + i | 1 - χ^{2} |^{\frac{1}{2}} e^{\frac{i}{2} Arg (1 - χ^{2})}|) \\ \equiv & R e (Φ_{13}) + i I m (Φ_{13}), \end{matrix}

(17)

where ln denotes the ordinary real logarithm of its positive argument and the first term in (17) is the principal value of

arg (χ + {i | 1 - χ^{2} |}^{\frac{1}{2}} e^{\frac{i}{2} Arg (1 - χ^{2})})

, which, by definition, generates real images in

(- π, π]

. Equation (17) provides the algebraic representation of the complex single-valued function

Arccos (χ)

whatever

χ \in C

. Note that, for any real

χ

such that

| χ | \leq 1

(| χ | \geq 1)

, the imaginary (real) component of

Φ_{13}

identically vanishes.

We determined all of the ingredients for constructing the trial ACM of (4) when

p \neq 0

. In accordance with (17), this ACM is a generally non-Hermitian matrix that can be written as follows:

\begin{matrix} {\tilde{Q}}_{3 c} (p, q) = \sqrt{\frac{| p |}{3}} e^{i \frac{φ_{p}}{2}} (\begin{matrix} 0 & 1 & e^{i Φ_{13} (χ)} \\ 1 & 0 & 1 \\ e^{- i Φ_{13} (χ)} & 1 & 0 \end{matrix}) . \end{matrix}

(18)

where

\begin{matrix} φ_{p} = Θ_{p} + π . \end{matrix}

(19)

Thus, based on (9), the ACM of (5) has the form

\begin{matrix} {\tilde{Q}}_{3} (p, q) = (\begin{matrix} - \frac{c_{1}}{3} & \sqrt{\frac{| p |}{3}} e^{i \frac{φ_{p}}{2}} & \sqrt{\frac{| p |}{3}} e^{i [\frac{φ_{p}}{2} + Φ_{13} (χ)]} \\ \sqrt{\frac{| p |}{3}} e^{i \frac{φ_{p}}{2}} & - \frac{c_{1}}{3} & \sqrt{\frac{| p |}{3}} e^{i \frac{φ_{p}}{2}} \\ \sqrt{\frac{| p |}{3}} e^{i [\frac{φ_{p}}{2} - Φ_{13} (χ)]} & \sqrt{\frac{| p |}{3}} e^{i \frac{φ_{p}}{2}} & - \frac{c_{1}}{3} \end{matrix}) . \end{matrix}

(20)

We can find an ACM of (4) for

p = 0

through the same kind of approach, beginning with a matrix different from (10). Our solution, denoted by

{\bar{Q}}_{3 c} (q)

, can be cast as follows:

\begin{matrix} {\bar{Q}}_{3 c} (q) = {(\frac{| q |}{\sqrt{3}})}^{\frac{1}{3}} e^{\frac{i}{3} Arg (i q)} (\begin{matrix} 0 & 1 & 1 \\ - e^{- i \frac{4}{3} π} & 0 & - 1 \\ - e^{i \frac{4}{3} π} & 1 & 0 \end{matrix}) . \end{matrix}

(21)

For completeness, we also write

{\tilde{Q}}_{3 c} (p, 0)

(χ = 0)

:

\begin{matrix} {\tilde{Q}}_{3 c} (p, 0) = \sqrt{\frac{| p |}{3}} e^{i \frac{φ_{p}}{2}} (\begin{matrix} 0 & 1 & i \\ 1 & 0 & 1 \\ - i & 1 & 0 \end{matrix}), \end{matrix}

(22)

since

Φ_{13} (0) = π / 2

from (17). It is not difficult to see that the eigenvalues of (22) are 0 and

\pm {| p |}^{\frac{1}{2}} e^{\frac{i φ_{p}}{2}}

.

Above, we formulated and solved the inverse problem of finding ACMs of generic cubic complex polynomials. Next, by exploiting these ACMs, we will present a way to find the roots of the given polynomial without resorting to the Cardano-Dal Ferro formulas, and then we will delve into the implications of having a polynomial with real coefficients on the form of the ACM and its roots.

3.1. Roots Characterization

Next, we assume that

p \neq 0

. Then, the eigenvalues of

{\tilde{Q}}_{3 c} (p, q)

are the roots of (4) and, hence, the roots of the characteristic polynomial of the matrix appearing in the right-hand side of (18), each multiplied by the pre-factor

\sqrt{\frac{| p |}{3}} e^{i \frac{φ_{p}}{2}}

. This characteristic polynomial

{\tilde{P}}_{3 c} (\tilde{η})

in the unknown

\tilde{η}

has the form

\begin{matrix} {\tilde{P}}_{3 c} (\tilde{η}) = {\tilde{η}}^{3} - 3 \tilde{η} - 2 cos (Φ_{13} (χ)) . \end{matrix}

(23)

The cosine representation of the free term in polynomial (23) is remarkable because it allows one to guess, at first glance, one of its three roots and then to exactly construct the other two by simply reducing the cubic polynomial

{\tilde{P}}_{3 c} (\tilde{η})

to a quadratic polynomial. In fact, without resorting to the well-known Cardano-Dal Ferro formulas [48], and using instead the elementary triplication formula for the cosine function

cos (3 z) = 4 {cos}^{3} (z) - 3 cos (z)

, which also holds in the complex field, it is immediate to see that the generally complex expression (see Comment A8)

\begin{matrix} {\tilde{η}}_{1} = 2 cos (\frac{1}{3} Φ_{13} (χ)) \end{matrix}

(24)

is a root of (23), whatever the complex coefficients

p \neq 0

and q. The algebraic representation (17) of

Φ_{13} (χ)

is the key to explicitly write the p- and q-dependencies of the real and imaginary components of the algebraic expression for

{\tilde{η}}_{1}

, which are obtained using Euler’s formula as

\begin{matrix} R e ({\tilde{η}}_{1}) = 2 cos (\frac{1}{3} R e (Φ_{13})) cosh (\frac{1}{3} I m (Φ_{13})), \end{matrix}

(25)

\begin{matrix} I m ({\tilde{η}}_{1}) = - 2 sin (\frac{1}{3} R e (Φ_{13})) sinh (\frac{1}{3} I m (Φ_{13})), \end{matrix}

(26)

where

R e (Φ_{13})

and

I m (Φ_{13})

are defined in (17). The other two roots are easily found to be [49]

\begin{matrix} {\tilde{η}}_{k} = - \frac{1}{2} {\tilde{η}}_{1} + {(- 1)}^{k + 1} \sqrt{3} {| {sin}^{2} (\frac{1}{3} Φ_{13} (χ)) |}^{\frac{1}{2}} e^{\frac{i}{2} Arg {sin}^{2} (\frac{1}{3} Φ_{13} (χ))}, k = 2, 3 . \end{matrix}

(27)

Equations (24) and (27) express the roots of

{\tilde{P}}_{3 c} (\tilde{η})

as functions of parameter

Φ_{13}

, which appears in the first and last anti-diagonal terms of matrices (18) and (20). We, thus, conclude that our procedure to construct the ACM also yields the roots of the (generally complex) characteristic polynomial.

It is interesting to highlight the conditions for a complex polynomial to admit real roots (note that the roots can never be all real, however, if the imaginary part of at least one of the polynomial coefficients is nonzero). To this end, it is convenient to start from the polynomial form (5). We write the three coefficients of (5) as

c_{j} \equiv x_{j} + i y_{j}

, with

j = 1, 2, 3

. If a real root r of (5) exists, it must satisfy the equation

y_{1} r^{2} + y_{2} r + y_{3} = 0

, which results from equating to zero the imaginary part of the polynomial.

δ = {(y_{2})}^{2} - 4 y_{1} y_{3} \geq 0

is clearly a necessary condition for the existence of the root r, and the two only possible expressions of r are

\frac{- y_{2} \pm \sqrt{δ}}{2 y_{1}}

. Then, any of these two expressions is indeed a root of (5) only if it satisfies the additional condition

r^{3} + x_{1} r^{2} + x_{2} r = - x_{3}

, which results from the real part of the polynomial. We can thus state that the cubic polynomial (5) has at least one real root if and only if the inequality

δ \geq 0

and the last condition are both met. In particular, when

δ = 0

, r is a double root. Finally, we examine the case

y_{1} = 0

(and

y_{2} \neq 0

, since otherwise no real root exists for a complex polynomial). Applying the same procedure, one finds that a real root

r = - \frac{y_{3}}{y_{2}}

of (5) exists if and only if the condition

r^{3} + x_{1} r^{2} + x_{2} r = - x_{3}

holds. It is easy to convince oneself that this real root has multiplicity two, being a real root of the first derivative of (5).

3.2. Real Polynomial Case

We now investigate the special form of

{\tilde{Q}}_{3 c} (p, q)

in the case in which the three coefficients of (5) are real, with the aim of establishing properties of the polynomial roots based on its almost-companion representation built above.

In this case, (7) implies that p and q are real, and thus (4) is also a real polynomial over

C

. Moreover, since

Θ_{p}

can only be 0 (

p > 0

) or

π

(

p < 0

), the parameter

χ

defined in (14) is purely imaginary or real, respectively, and, therefore, its square

\begin{matrix} χ^{2} = - \frac{27 q^{2} e^{- 3 i Θ_{p}}}{{4 | p |}^{3}} = - \frac{27 q^{2}}{4 p^{3}}, \end{matrix}

(28)

is real for any p. As a consequence, using (17) it is not difficult to prove that, if and only if (see also Comment A9)

\begin{matrix} Δ (p, q) \equiv \frac{p^{3}}{27} + \frac{q^{2}}{4} \leq 0, \end{matrix}

(29)

(which implies

p < 0

, i.e.,

Θ_{p} = π

, and

| χ | \leq 1

), the imaginary part of

Φ_{13} (χ)

given in Equation (17) vanishes, while its real part assumes the simple expression

\begin{matrix} Φ_{13} (p, q) = Arg (χ + i | 1 - χ^{2} |^{\frac{1}{2}}) = π - Arccos (- \frac{3 q}{2 p} \sqrt{\frac{- 3}{p}}), \end{matrix}

(30)

where we used the identity

Arccos (x) = π - Arccos (- x)

valid for any real x such that

| x | \leq 1

. It is worth noting that in the present case (27) takes the simpler form

{\tilde{η}}_{k} = - \frac{1}{2} {\tilde{η}}_{1} + {(- 1)}^{k + 1} \sqrt{3} sin (\frac{1}{3} Φ_{13} (χ))

since (30) shows that

Φ_{13} \in [0, π]

.

Under condition (29) and considering that

φ_{p} = Θ_{p} + π = 2 π

, the specialization of (18) to the case under scrutiny produces the Hermitian ACM of (4) as

\begin{matrix} {\tilde{Q}}_{3 c} (p, q) = - \sqrt{\frac{| p |}{3}} (\begin{matrix} 0 & 1 & e^{i Φ_{13} (p, q)} \\ 1 & 0 & 1 \\ e^{- i Φ_{13} (p, q)} & 1 & 0 \end{matrix}), \end{matrix}

(31)

where the real angle

Φ_{13}

is given by Equation (30). The Hermitian nature of the ACM built assures that (4), as well as (5), has three real roots. These roots are distinct when

Δ (p, q) < 0

, while two of them are coincident if

Δ (p, q) = 0

[50].

We can write the real roots

x_{k}

(k = 1, 2, 3)

of (5) as follows:

\begin{matrix} x_{k} = - \sqrt{\frac{| p |}{3}} {\tilde{η}}_{k} - \frac{c_{1}}{3}, \end{matrix}

(32)

where

{\tilde{η}}_{k}

are the three (real) roots of (23). Equations (24) and (27), together with (30), yield

\begin{matrix} x_{k} = 2 \sqrt{\frac{| p |}{3}} cos (\frac{Φ_{13} (p, q) + (2 k + 1) π}{3}) - \frac{c_{1}}{3}, k = 1, 2, 3 . \end{matrix}

(33)

It is possible to check that this formula gives the well-known trigonometric and translated forms of the three roots of (5) when they are real (see [51]).

When

Δ (p, q) > 0

, only one of the three roots of Equation (4) or (5) (with real polynomial coefficients) is real, while the other two roots are complex conjugates. In particular, the real root corresponds to

{\tilde{η}}_{3}

for

p > 0

and to

{\tilde{η}}_{1}

for

p < 0

. In more detail, the three roots are

\begin{matrix} z_{1} & = \frac{\sqrt{p}}{2} (Y + i X) - \frac{c_{1}}{3} \end{matrix}

(34)

\begin{matrix} z_{2} & = \frac{\sqrt{p}}{2} (Y - i X) - \frac{c_{1}}{3} \end{matrix}

(35)

\begin{matrix} z_{3} & = - \sqrt{p} Y - \frac{c_{1}}{3} \end{matrix}

(36)

where

X = \sqrt[3]{\sqrt{1 - χ^{2}} + i χ} + \sqrt[3]{\sqrt{1 - χ^{2}} - i χ}

(37)

and

Y = \frac{\sqrt[3]{\sqrt{1 - χ^{2}} + i χ} - \sqrt[3]{\sqrt{1 - χ^{2}} - i χ}}{\sqrt{3}}

(38)

with

χ = - i \frac{3 q}{2 p} \sqrt{\frac{3}{p}}

(39)

for

p > 0

, and

\begin{matrix} z_{1} & = - \sqrt{- \frac{p}{3}} C - \frac{c_{1}}{3} \end{matrix}

(40)

\begin{matrix} z_{k} & = \sqrt{- \frac{p}{3}} [\frac{C}{2} + i {(- 1)}^{k} \sqrt{3 (\frac{C^{2}}{4} - 1)}] - \frac{c_{1}}{3}, k = 2, 3, \end{matrix}

(41)

where

C = \sqrt[3]{χ + \sqrt{χ^{2} - 1}} + \sqrt[3]{χ - \sqrt{χ^{2} - 1}}

(42)

with

χ = - \frac{3 q}{2 p} \sqrt{- \frac{3}{p}}

(43)

for

p < 0

(see derivation in Appendix A.1). A more elaborate derivation of the roots leading to their formulation in terms of trigonometric functions can be found in [50].

4. Almost-Companion Density Matrices of a Qutrit on Demand

A quantum system living in the Hilbert space

H

spanned by three orthonormal states

| 1 〉

,

| 2 〉

, and

| 3 〉

is called a qutrit [52]. A pure state of the qutrit can always be represented as a normalized linear combination of these three states. To describe an arbitrary pure or mixed state of the qutrit with the same formalism, one uses instead a linear operator

\hat{ρ}

called the density operator [53]. It acts on

H

and, by definition, is positive semi-definite with trace 1:

t r (ρ) = 1

. It is well known that any positive semi-definite operator is Hermitian since its skew-Hermitian part vanishes [2,54]. As a consequence, any positive semi-definite operator is diagonalizable, and it is possible to show that its eigenvalues are real non-negative numbers. In particular, any density operator

\hat{ρ}

is Hermitian. The three eigenvalues of the operator

\hat{ρ}

describing the state of a qutrit are the populations of the three eigenstates of

\hat{ρ}

. The

3 \times 3

basis-dependent matrix representation of

\hat{ρ}

, called density matrix and denoted by

ρ

, is also positive semi-definite and, hence, Hermitian. We observe incidentally that, conversely, any Hermitian matrix with non-negative eigenvalues is positive semi-definite and, if its trace is 1, it is a density matrix.

The purpose of this section is to demonstrate that our recipe for constructing the Hermitian ACM of a generic third-degree polynomial admitting three real roots provides an effective tool for writing density matrices of a qutrit on demand. It is worth noting that our approach itself does not require the support of a vector space, while the relationship to a basis of physical states appears in this application to a qutrit.

The aforementioned definition of density matrix results in unambiguous properties of the real coefficients of its characteristic polynomial. First of all, writing this polynomial in the canonical form

\begin{matrix} p_{3 c} (η) = η^{3} + p η + q, \end{matrix}

(44)

the condition

\begin{matrix} p \leq - \frac{3}{2} \sqrt[3]{2 q^{2}} \end{matrix}

(45)

stemming from Equation (29) ensures that

Φ_{13} (p, q)

is real, so that the ACM (31) of (44) is Hermitian and, hence, has real eigenvalues. Then, turning to form (5) of the monic polynomial through the translation (6), the Vieta–Girard formula for the sum of the roots [55] implies that the coefficient of the quadratic term is

- 1

. Moreover, Descartes’s sign rule [56] requires that the four coefficients of polynomial (5) have alternate signs in order to have three positive roots. Therefore, the characteristic polynomial of an arbitrary density matrix of a qutrit is necessarily a third-degree real and monic polynomial of the form

\begin{matrix} p_{3} (x) = x^{3} - x^{2} + a^{2} x - b^{2}, \end{matrix}

(46)

where a and b are real numbers that satisfy condition (45) after translation (6). One or two roots are zero if

a \neq 0, b = 0

or

a = 0, b = 0

, respectively, while the inequality (45) is never satisfied for

a = 0

and

b \neq 0

.

In conclusion, under conditions (45) and (46),

{\tilde{Q}}_{3} (p, q) = {\tilde{Q}}_{3 c} (p, q) - \frac{c_{1}}{3} I_{3}

, with

{\tilde{Q}}_{3 c} (p, q)

given by Equation (31), is a density matrix. Incidentally, we point out that our inverse problem admits infinitely many non-Hermitian solutions, i.e., non-Hermitian ACMs of (44) or (46), such as, for example, the corresponding Frobenius companion matrix. Therefore, the explicit construction of the Hermitian ACM of (46) and, more generally, of any real third-degree polynomial with only real roots is a successful outcome of our search strategy (10). This recipe, in turn, forms the basis of the application presented below.

Let us introduce the set

D

of all density matrices of a qutrit in a given basis

{| n 〉, n = 1, 2, 3}

of

H

.

E

be the binary relation in

D

defined as follows:

ρ_{1} \in D

and

ρ_{2} \in D

are in the relation

E

if they are ACMs of the same polynomial

p_{3} (x)

defined in (46). This relation, expressed by writing

ρ_{1} E ρ_{2}

, is an equivalence relation as it is manifestly reflexive, symmetric, and transitive.

D

is thus partitioned by

E

. The quotient set

D / E

consists of all equivalence classes of

D

with respect to

E

. Each equivalence class, which comprises all density matrices with the same characteristic polynomial, is represented by one (arbitrarily chosen) of its elements,

\bar{ρ}

, and is commonly denoted by

[\bar{ρ}]

. This is where our result (31) enters the scene, providing an easy way to parameterize the quotient set of

D

.

It is always possible to use the matrix

{\tilde{Q}}_{3} (p, q) = {\tilde{Q}}_{3 c} (p, q) - \frac{c_{1}}{3} I_{3}

as the representative element of the equivalence class consisting of all elements of

D

sharing the characteristic polynomial (46), which, in turn, is uniquely associated with its canonical form (44). In this way, we establish a one-to-one correspondence between

D / E

and the set

P

of polynomials (44). This correspondence amounts to parameterizing the quotient set of

D

in terms of p and q. The most ambitious target of parameterizing set

D

is discussed in a recent topical issue [57]. It is worth emphasizing that a density matrix

ρ

belongs to the class of equivalence

[{\tilde{Q}}_{3 c} (p, q) - \frac{c_{1}}{3} I_{3}]

if and only if it can be unitarily generated from

{\tilde{Q}}_{3 c} (p, q) - \frac{c_{1}}{3} I_{3}

, since its characteristic polynomial, trace, and Hermiticity are unitarily invariant, thus implying the invariance of the positive semi-definiteness. Therefore, while two similar matrices are ACMs of the same polynomial, a similarity transformation of a density matrix does not generate, in general, a density matrix [2].

Our parameterization of

D / E

in terms of the coefficients of its characteristic polynomial written in canonical form provides the theoretical basis for constructing, on demand and in a prefixed basis of

H

, almost-companion density matrices of any assigned polynomial

p_{3} (x)

fulfilling the condition

Δ \leq 0

. We illustrate the concrete applicability of our recipe by constructing an almost-companion density matrix starting from the polynomial

\begin{matrix} p_{3} (x) = x^{3} - x^{2} + \frac{11}{36} x - \frac{1}{36} . \end{matrix}

(47)

The translation:

η = x - \frac{1}{3}

yields

\begin{matrix} p_{3 c} (x) = η^{3} - \frac{1}{36} η, \end{matrix}

(48)

so that, in this case,

p = - \frac{1}{36}

while q vanishes. Exploiting Equation (30), we easily have:

\begin{matrix} Φ_{13} (- \frac{1}{36}, 0) = π - Arccos (0) = \frac{π}{2} . \end{matrix}

(49)

We have, thus, obtained the few ingredients necessary to build an almost-companion density matrix of the given polynomial (47) as the sum

{\tilde{Q}}_{3}

of the representative element of the corresponding equivalence class and the matrix

\frac{1}{3} I_{3}

, in accordance with the realization (20) of (9). That is, the density matrix has the form

\begin{matrix} {\tilde{Q}}_{3} = \frac{1}{6 \sqrt{3}} (\begin{matrix} 2 \sqrt{3} & - 1 & - i \\ - 1 & 2 \sqrt{3} & - 1 \\ i & - 1 & 2 \sqrt{3} \end{matrix}) . \end{matrix}

(50)

Note that, while in this case it is easy to find the roots of (48) directly, and then those of (47), in general the roots of the polynomial

p_{3 c} (η)

corresponding to a given

p_{3} (x)

can be found using (33).

We stress that any matrix equivalent to (50) through a unitary transformation

\hat{V}

is an almost-companion density matrix of (47) and vice versa. For a given basis, each density matrix thus obtained describes a different (generally mixed) state of the qutrit. If, instead, the unitary transformation is interpreted as the generator of a change in the basis

{| n >, n = 1, 2, 3}

, the matrix obtained represents the same density operator in the new basis

(\hat{V} | n >, n = 1, 2, 3)

.

5. Unitary Matrices (Operators) on Demand

The effective construction of density matrices on demand in Section 4, results from the application of our procedure for constructing ACMs to third-degree polynomials that belong to the set

P

and satisfy, a priori, necessary and sufficient conditions for the existence of positive semi-definite ACMs of trace 1. By comparison, the construction of almost-companion unitary matrices on demand (that is, starting from a given appropriate third-degree polynomial) requires addressing two hurdles. The first one is to establish with certainty whether the given polynomial can be the characteristic polynomial of a unitary matrix without knowing its zeros a priori. The second difficulty lies in the fact that the trial ACM of a complex arbitrary polynomial, as given by the main Equations (9) and (10) of our procedure, is never unitary by construction. In this regard, it is important to note that the possibility of finding a non-unitary ACM of a given polynomial is not incompatible with the existence of a unitary almost-companion matrix for the given polynomial. In fact, different ACMs of a given polynomial are generally not unitarily equivalent.

In light of these considerations, we want to first identify possible structural properties shared by the coefficients of all the characteristic polynomials of a unitary matrix. Then, according to our general procedure, we will introduce a class of trial unitary matrices sufficiently representative to allow us to find a unique ACM for an assigned polynomial whose three roots are unknown but certainly have modulus 1.

5.1. Properties of the Characteristic Polynomial of a Unitary Matrix

It is easy to prove the following necessary and sufficient conditions concerning the characteristic polynomial of a unitary ACM:

Theorem 1.

Let

D_{m} (z)

be any complex polynomial of degree m, with

1 \leq m \leq n

, dividing an arbitrarily given complex polynomial

P_{n} (z)

. Then

P_{n} (z)

admits a unitary ACM if and only if any

D_{m} (z)

does.

Proof.

Necessity: if

P_{n} (z)

admits a unitary ACM, then all its roots have modulus one. This property is obviously transferred to each

D_{m} (z)

dividing

P_{n} (z)

, which, in turn, implies the existence of a diagonal unitary ACM of

D_{m} (z)

.

Sufficiency: Since

P_{n} (z)

can be represented as the product of n monic binomials whose free terms are complex numbers of modulus one by hypothesis, then a diagonal ACM of

P_{n} (z)

with entries having modulus one exists. This ACM is unitary [2]. □

When

n = 2

, it is easy to convince oneself that

Theorem 2.

A monic complex, second-degree polynomial is the characteristic polynomial of a unitary matrix of order 2, if and only if it has the structure

\begin{matrix} P_{2} (z) = z^{2} - r_{2} e^{i ϑ} z + e^{2 i ϑ}, \end{matrix}

(51)

with

r_{2} \in [0, 2]

and

ϑ \in (- π, π]

.

Proof.

To demonstrate this double statement it is sufficient to explicitly find the two roots of (51) for the necessity and to use simple geometric arguments (or exploit Theorem 1) for the sufficiency. □

We additionally remark that, for

r_{2} > 2

, the principal arguments of the two roots of (51) coincide with

χ

, and the product of their modules, both different from unity, is still one.

When the order of the unitary matrix is greater than 2, it is still possible to find peculiar properties possessed by the coefficients of the corresponding characteristic polynomial. However, there are polynomials of degrees higher than 2 and structures similar to (51), which also have roots with moduli different from 1. We prove here the following useful necessary condition on the structure of the characteristic polynomial of a

3 \times 3

unitary matrix

Theorem 3.

The complex third-degree characteristic polynomial of any unitary matrix of order 3 has necessarily the structure:

\begin{matrix} P_{3} (z) = z^{3} - r e^{i θ_{1}} z^{2} + r e^{i (θ - θ_{1})} z - e^{i θ}, \end{matrix}

(52)

where

r \in [0, 3]

,

θ_{1} \in (- π, π]

and

θ \in (- π, π]

.

Proof.

Given any three real numbers

α

,

β

, and

γ

, it is always possible to find three real numbers r,

θ_{1}

, and

θ

that satisfy the following relations:

\begin{matrix} e^{i α} + e^{i β} + e^{i γ} = r e^{i θ_{1}}, \end{matrix}

(53)

\begin{matrix} e^{i α} e^{i β} e^{i γ} = e^{i θ} . \end{matrix}

(54)

The product of (54) and the complex conjugate of (53) gives

\begin{matrix} r e^{i (θ - θ_{1})} = (e^{- i α} + e^{- i β} + e^{- i γ}) e^{i α} e^{i β} e^{i γ} = e^{i (α + β)} + e^{i (β + γ)} + e^{i (α + γ)}, \end{matrix}

(55)

where

r \in [0, 3]

,

θ_{1} \in (- π, π]

and

θ = Arg e^{i (α + β + γ)} \in (- π, π]

. Equations (53)–(55) represent the Vieta–Girard formulas for the three roots

e^{i α}

,

e^{i β}

, and

e^{i γ}

of polynomial (52). Since these roots have modulus 1, as is required for

P_{3} (z)

to be the characteristic polynomial of a unitary matrix, the Vieta–Girard formulas (53) and (54) clearly show that the complex coefficients of the characteristic polynomial of any

3 \times 3

unitary matrix are not independent. In fact, the free term and the coefficient of

z^{2}

, which are involved in Equations (53) and (54), respectively, univocally determine the coefficient of z through (55), in accordance with (52). □

Similar to Theorem 1, Theorem 3 can be extended to a generic degree n. We emphasize that the polynomial form (52) and its roots have some remarkable properties. For example, the passage from z to the auxiliary variable

u = z e^{i ψ}

leads, up to a global phase factor, to a polynomial with the same structure (52) after the angle shifts

θ_{1}^{'} = θ_{1} + ψ

and

θ^{'} = θ + 3 ψ

(these shifts are unimportant for what concerns the polynomial structure since the angles

θ_{1}^{'}

and

θ^{'}

can take the same range of values as

θ_{1}^{'}

and

θ^{'}

). Therefore, the three roots of the new polynomial have the same modules and relative principal arguments as the roots of the original polynomial (52). Another interesting property resulting from Equation (54) is that, if (52) admits one root with modulus one, the other two roots must have reciprocal modules (including the case in which they also have modulus one).

Note that Theorem 3 only expresses a necessary condition and, therefore, there exist polynomials with structure (52) that do not admit unitary ACM. Algebraic relations among the three parameters in the expression of

P_{3} (z)

not implied by the structure of the polynomial itself can ensure that

P_{3} (z)

admits a unitary ACM (vide infra).

Consider, for example, the case

r = 3

. The polynomial

z^{3} - 3 z^{2} + 3 e^{i π} z - e^{i π}

has the form (52), from which it is obtained by (arbitrarily) choosing

θ_{1} = 0

and

θ = π

. This is not the characteristic polynomial of a unitary

3 \times 3

matrix, since its roots are

- 1

and

2 \pm \sqrt{3}

; accordingly,

θ = π \neq Arg e^{i (3 θ_{1})} = 0

. The relation

θ = 3 θ_{1}

guarantees, instead, the existence of three roots of modulus 1 when

r = 3

, as is easily seen geometrically or from the fact that in this case

P_{3} (z) = {(z - e^{i θ_{1}})}^{3}

. As another example, for

r = 1

, (52) admits a unitary ACM for any

θ_{1} \in (- π, π]

and

θ \in (- π, π]

, as the roots of the polynomial are

e^{i θ_{1}}

and

e^{i \frac{θ - θ_{1} \pm π}{2}}

.

The analysis in the next section will provide expressions for the coefficients of polynomial (52), making it the characteristic polynomial of a unitary ACM.

5.2. Construction of a Trial Unitary ACM

In Section 3, it was convenient to search for an ACM of a generic monic complex polynomial (5) of the third degree in the unknown z by resorting to the canonical form (4) of the polynomial through a translation of the complex variable z. There are two advantages to using polynomial (4) in the translated variable

η

: the number of parameters appearing in the polynomial expression is reduced from 3 to 2 (namely, p and q instead of

c_{1}

,

c_{2}

, and

c_{3}

), and the very simple recipe (9) allows one to obtain the ACM of the given polynomial from the ACM of its canonical form (4).

It is clearly possible to pass from

P_{3} (z)

to its canonical form through the appropriate translation of z. Unfortunately, such a strategy is not convenient in this case, since the canonical polynomial generally does not admit a unitary ACM, and thus the further mathematical step complicates the achievement of our goal. Therefore, we propose a different approach that combines geometrical and analytical considerations.

Exploiting Theorems 1 and 2, we can represent each element

P_{3} (z)

of the set

[P_{3} (z)]

of all and only the third-degree polynomials that admit a unitary ACM and share the root 1 as follows:

\begin{matrix} P_{3} (z) = (z^{2} - r_{2} e^{i ϑ} z + e^{2 i ϑ}) (z - 1) = z^{3} - (1 + r_{2} e^{i ϑ}) z^{2} + (1 + r_{2} e^{- i ϑ}) e^{2 i ϑ} z - e^{2 i ϑ}, \end{matrix}

(56)

where

r_{2} \in [0, 2]

and

ϑ \in (- π, π]

. Each polynomial (56) possesses, by construction, a unitary ACM and, vice versa, the characteristic polynomial of any

3 \times 3

unitary matrix with a unit eigenvalue is a particular realization of (56).

[P_{3} (z)]

is a subset of the set

[P_{3} (z)]

of the polynomials of the form (52). This point is appreciated by noting that the coefficient of

z^{2}

in Equation (52) can always be represented as

\begin{matrix} r e^{i θ_{1}} = (1 + r_{2} e^{i ϑ}) \end{matrix}

(57)

with

\begin{matrix} {r_{2}}^{2} = 1 + r^{2} - 2 r cos θ_{1} \end{matrix}

(58)

and

\begin{matrix} ϑ = Arg (- 1 + r e^{i θ_{1}}) = 2 Arctan (\frac{r sin θ_{1}}{- 1 + r cos θ_{1} + \sqrt{1 + r^{2} - 2 r cos θ_{1}}}) . \end{matrix}

(59)

The last equality is based on the following identity [58], which gives the principal argument of a generic complex number

(x + i y) \in Ω

, where

Ω

coincides with the complex plane cut along the negative x-axis:

\begin{matrix} Arg (x + i y) = 2 Arctan (\frac{y}{x + \sqrt{x^{2} + y^{2}}}) . \end{matrix}

(60)

As expected, this formula leaves the argument of a complex number of null modulus undefined and, for any fixed negative x, implies

\begin{matrix} lim_{y ⟶ 0^{\pm}} Arg (x + i y) = \pm π . \end{matrix}

(61)

The above equations clearly show that

[P_{3} (z)]

is obtained as a subset of

[P_{3} (z)]

by introducing the relations (57)–(59) among the parameters r,

θ_{1}

, and

θ

, which are arbitrary in their ranges of definition in the polynomial expression (52). In particular,

θ

is compatible with (56) only if

\begin{matrix} θ = 2 ϑ, \end{matrix}

(62)

thus leading to the following:

Theorem 4.

A monic third-degree polynomial (52) belongs to the set

[P_{3} (z)]

if and only if it can be written in the form

\begin{matrix} {\tilde{P}}_{3} (z) = z^{3} - (1 + r_{2} e^{i \frac{θ}{2}}) z^{2} + (1 + r_{2} e^{- i \frac{θ}{2}}) e^{i θ} z - e^{i θ}, \end{matrix}

(63)

where

r_{2} \in [0, 2]

and

θ \in (- π, π]

.

Note that the range of

r_{2}

values is dictated by Theorem 2, and the corresponding range of r values resulting from Equation (58) for any given

θ_{1}

is a subset of the interval

[0, 3]

in Equation (52), in accordance with the fact that

[P_{3} (z)]

is a subset of

[P_{3} (z)]

. On the other hand, since

ϑ \in (- π, π]

, Equation (62) implies that

θ \in (- 2 π, 2 π]

, which can be clearly reduced to the principal interval

[- π, π]

. We emphasize that requiring (52) to be the characteristic polynomial of a unitary matrix with a real positive eigenvalue (that is, 1) entailed relations between the three parameters in Equation (52), thus leading to the dependence of polynomial (63) on only two parameters,

r_{2}

and

θ

.

At this point, we can construct an ACM for a polynomial of the kind (63). The polynomial factorization in Equation (56) enables a block diagonal form for the ACM, with the one-dimensional block simply equal to 1. The diagonal elements of the

2 \times 2

block can be set equal [2] and are immediately obtained from Vieta’s formula for the sum of the roots of polynomial (51). Then, simple algebraic considerations lead to the following ACM of polynomial (63):

{\tilde{W}}_{3} = (\begin{matrix} \frac{r_{2}}{2} e^{i \frac{θ}{2}} & \sqrt{1 - {(\frac{r_{2}}{2})}^{2}} e^{i \frac{θ}{2}} & 0 \\ - \sqrt{1 - {(\frac{r_{2}}{2})}^{2}} e^{i \frac{θ}{2}} & \frac{r_{2}}{2} e^{i \frac{θ}{2}} & 0 \\ 0 & 0 & 1 \end{matrix}) .

(64)

It is easy to verify that the characteristic polynomial of (64) coincides with (63) for all the allowed values of the parameters

r_{2}

and

θ

. The three columns of

{\tilde{W}}_{3}

are normalized and mutually orthogonal, and these properties imply the unitarity of matrix

{\tilde{W}}_{3}

. If

r_{2} > 2

, and thus it is out of the range given in Theorem 4, the first two columns of the matrix are not orthogonal for any

θ

, and then

{\tilde{W}}_{3}

is no longer unitary.

It is worth noting that (64) can be seen as the result of a partial diagonalization of another unitary matrix with the same eigenvalues and that all the other ACMs of a polynomial (63) can be obtained by unitary transformation of (64).

Once an ACM is constructed for any polynomial (63), the subset of

[P_{3} (z)]

that contains all and only the polynomials (52) admitting a unitary ACM can be generated by rotating the roots of each polynomial (63) by an angle

ϵ \in (- π, π]

. This amounts to changing the complex variable z to

u = z e^{i ϵ}

in

{\tilde{P}}_{3} (z)

. Then, up to a global phase factor

e^{3 i ϵ}

, the polynomial

{\tilde{P}}_{3} (u) = {\tilde{P}}_{3} (z e^{i ϵ})

is equal to

\begin{matrix} P_{3 ϵ} (z) = z^{3} - (1 + r_{2} e^{i \frac{θ}{2}}) e^{- i ϵ} z^{2} + (1 + r_{2} e^{- i \frac{θ}{2}}) e^{i θ} e^{- 2 i ϵ} z - e^{i (θ - 3 ϵ)} . \end{matrix}

(65)

The root 1 of

{\tilde{P}}_{3} (z)

corresponds to a general complex root of modulus one,

e^{i ϵ}

, of

P_{3 ϵ} (z)

.

[P_{3 ϵ} (z)]

includes all and only the polynomials

P_{3 ϵ} (z)

, which, by construction, admit a unitary ACM and, therefore, also satisfy the necessary condition expressed by Theorem 4. We have thus proved that

Theorem 5.

A complex monic polynomial of the third degree is the characteristic polynomial of a unitary matrix of order 3 if and only if it has the structure (65), with

r_{2} \in [0, 2]

,

θ \in (- π, π]

, and

ϵ \in (- π, π]

.

Next, we accomplish the main objective of this section by constructing an ACM

W_{3}

of

P_{3 ϵ} (z)

. To this end, in analogy with recipe (9), we use the transformation

z = u e^{- i ϵ}

to generate

W_{3}

from the matrix

{\tilde{W}}_{3}

in Equation (9) by proceeding as follows:

\begin{matrix} {\tilde{P}}_{3} (u) & \equiv & d e t (u I_{3} - {\tilde{W}}_{3}) = d e t (e^{i ϵ} (z I_{3} - e^{- i ϵ} {\tilde{W}}_{3})) \\ = & e^{3 i ϵ} d e t (z I_{3} - W_{3}) \equiv e^{3 i ϵ} P_{3 ϵ} (z), \end{matrix}

(66)

where

W_{3} \equiv e^{- i ϵ} {\tilde{W}}_{3}

. In light of our previous arguments, the characteristic polynomial of the matrix

W_{3} = e^{- i ϵ} {\tilde{W}}_{3} = (\begin{matrix} \frac{r_{2}}{2} e^{i (\frac{θ}{2} - ϵ)} & \sqrt{1 - {(\frac{r_{2}}{2})}^{2}} e^{i (\frac{θ}{2} - ϵ)} & 0 \\ - \sqrt{1 - {(\frac{r_{2}}{2})}^{2}} e^{i (\frac{θ}{2} - ϵ)} & \frac{r_{2}}{2} e^{i (\frac{θ}{2} - ϵ)} & 0 \\ 0 & 0 & e^{- i ϵ} \end{matrix}),

(67)

is polynomial (65). The unitary matrix (67) fully meets our goal. To reach this objective, we first characterized the class

[P_{3 ϵ} (z)]

of all and only the polynomials that admit a unitary ACM, thus removing the difficulties related to the lack of sufficiency of polynomial (52). Then, exploiting recipe (66), we established the form

W_{3}

of a unitary ACM for any polynomial belonging to

[P_{3 ϵ} (z)]

.

In the following, we illustrate our approach through some applications.

5.3. Examples

Given the parameter $σ = \pm 1$ , consider the subclass of polynomials (65) with $ϵ = \frac{(1 - σ) π}{2}$ :

$\begin{matrix} P_{3 σ^{'}} (z) & = & z^{3} - (1 + r_{2} e^{i \frac{θ}{2}}) e^{- i \frac{(1 - σ) π}{2}} z^{2} + (1 + r_{2} e^{- i \frac{θ}{2}}) e^{i θ} z - e^{i (θ - \frac{(1 - σ) π}{2})} \\ = & z^{3} - (1 + r_{2} e^{i \frac{θ}{2}}) σ z^{2} + (1 + r_{2} e^{- i \frac{θ}{2}}) e^{i θ} z - σ e^{i θ}, \end{matrix}$

(68)

In Equation (68) we exploited the identity $e^{- i \frac{(1 - σ) π}{2}} = σ$ . It is easy to verify that $P_{3 σ} (σ) = 0$ , which means that (68) is the most general polynomial with the real root $σ$ and an ACM which, in view of (67), can be written on demand as

$\begin{matrix} W_{3 σ} = (\begin{matrix} σ \frac{r_{2}}{2} e^{i \frac{θ}{2}} & σ \sqrt{1 - {(\frac{r_{2}}{2})}^{2}} e^{i \frac{θ}{2}} & 0 \\ - σ \sqrt{1 - {(\frac{r_{2}}{2})}^{2}} e^{i \frac{θ}{2}} & σ \frac{r_{2}}{2} e^{i \frac{θ}{2}} & 0 \\ 0 & 0 & σ \end{matrix}) . \end{matrix}$

(69)
Consider the polynomial

$\begin{matrix} P_{3, r_{2} = 0} (z) & = & z^{3} - e^{i θ_{1}} z^{2} + e^{i θ^{'}} e^{- i θ_{1}} z - e^{i θ^{'}}, \end{matrix}$

(70)

obtained by setting $r_{2} = 0$ , $ϵ = - θ_{1}$ , and $θ = θ^{'} - 3 θ_{1}$ in Equation (65). With these choices, using (67) we immediately find that polynomial (70) admits the ACM

$\begin{matrix} W_{r_{2} = 0} = (\begin{matrix} 0 & e^{i \frac{θ^{'} - θ_{1}}{2}} & 0 \\ - e^{i \frac{θ^{'} - θ_{1}}{2}} & 0 & 0 \\ 0 & 0 & e^{i θ_{1}} \end{matrix}) . \end{matrix}$

(71)

When $r = 1$ , the polynomial (70) coincides with polynomial (52) up to a trivial change of notation ( $θ$ is substituted with $θ^{'}$ ). This polynomial is then the characteristic polynomial of a unitary matrix for any $θ^{'}$ and $θ_{1}$ , as we observed in Section 5.1, with eigenvalues that are now immediately derived from the matrix (71) as $e^{i θ_{1}}$ and $e^{i \frac{θ^{'} - θ_{1} \pm π}{2}}$ .
For $r = 0$ , (52) yields the polynomial $z^{3} - e^{i θ} = 0$ and, whatever the $ϵ$ value, Equations (57) and (62) imply that $r_{2} = 1$ , and $θ = 2 π$ in Equation (65). The corresponding ACM, with structure (67), takes the form

$\begin{matrix} W_{r = 0} = (\begin{matrix} \frac{1}{2} e^{i (π - ϵ)} & \sqrt{\frac{3}{4}} e^{i (π - ϵ)} & 0 \\ - \sqrt{\frac{3}{4}} e^{i (π - ϵ)} & \frac{1}{2} e^{i (π - ϵ)} & 0 \\ 0 & 0 & e^{- i ϵ} \end{matrix}), \end{matrix}$

(72)

Apart from the cases $r \neq 1$ or $r \neq 0$ examined above, for a generic r Equation (52) admits a unitary ACM only under r-dependent algebraic constraints on $θ$ and $θ_{1}$ . These constraints are realized in the polynomial form (65) through Equations (57)–(59) and (62), for the ranges of parameter values defined in Theorem 5.
These conditions are not all satisfied by the polynomial

$\begin{matrix} P_{3, r = 2} (z) & = & z^{3} - 2 e^{i π} e^{- i ϵ} z^{2} + 2 e^{i π} e^{- 2 i ϵ} z - e^{- 3 i ϵ}, \end{matrix}$

(73)

which is of the form (52) with $r = 2$ , $θ_{1} = - ϵ + π$ and $θ = 2 π - 3 ϵ$ . It is easy to see that this polynomial coincides with

$\begin{matrix} P_{3, r = 2} (z) & = & z^{3} - (1 + 3 e^{i π}) e^{- i ϵ} z^{2} + (1 + 3 e^{- i π}) e^{- 2 i ϵ} z - e^{- 3 i ϵ}, \end{matrix}$

(74)

which has the form (65) with $θ = 2 π$ , except for the fact that $r_{2} \notin [0, 2]$ , as is instead required in Theorem 5 because of Theorem 2. As a consequence, the polynomial (74) or, equivalently, (73) cannot be the characteristic polynomial of a unitary $3 \times 3$ matrix.
Polynomial (73) obviously admits an ACM of the form (20), which is obtained using the general protocol in Section 3. Moreover, the insertion of $r_{2} = 3$ and $θ = 2 π$ in matrix (67) leads to another ACM of polynomial (73) of the form

$W_{3 n u} = (\begin{matrix} - \frac{3}{2} e^{- i ϵ} & - \frac{i}{2} \sqrt{5} e^{- i ϵ} & 0 \\ \frac{i}{2} \sqrt{5} e^{- i ϵ} & - \frac{3}{2} e^{- i ϵ} & 0 \\ 0 & 0 & e^{- i ϵ} \end{matrix}),$

(75)

which is manifestly non-unitary. In general, $W_{3 n u}$ and the ACM of the form (20) are not similar. A sufficient condition for their similarity is that their three common complex eigenvalues are distinct. In this case, in fact, both matrices are surely diagonalizable [2] and, therefore, traceable (in general, not unitarily) to the same diagonal matrix.

6. Discussion and Conclusions

Finding a matrix with an assigned characteristic polynomial is a classic inverse problem solved by Frobenius a long time ago. In Section 2 we observed that there are infinitely many other solutions, generally not equivalent to the one found by Frobenius. The exhaustive description of the set

S (P_{n} (z))

of all complex matrices sharing the same characteristic polynomial is still an open problem. Among the reasons for the missing solution to this problem, it is useful to consider that, even if

S (P_{n} (z))

is invariant under similarity transformations, it includes non-similar (and in general not even equivalent) matrices. Another problem is that the structure of the non-empty subset of non-sparse CMs belonging to

S (P_{n} (z))

, unlike the set of sparse CMs [35], has not yet been fully characterized.

A related inverse problem, stimulated by applications of current interest to both physicists and mathematicians, is the search for ACMs constrained to possess prescribed structural properties, such as, for example, unitarity, positive semi-definiteness, or Hermiticity.

The first focus of this study is the construction of a new ACM of a generic monic and complex third-degree polynomial

P_{3} (z)

, characterized by versatility for applications. This objective is pursued by parameterizing the elements of the ACM in such a way that they lend themselves to additional constraints dictated by specific problems (of which only the structural properties are exploited). To the best of our knowledge, our investigation of inverse problems of this kind opens a new fruitful chapter in this research area whose central goal is the proposal of new ACMs which, in particular, can be CMs. We address the three above-mentioned constrained inverse problems, providing methodology and results that aim for broad applicability and are potentially transferable to solving analogous problems involving higher-degree polynomials.

The adopted step-by-step approach builds new specific classes of unconstrained or constrained matrices as ACMs of suitably given polynomials, relaxing from the outset any condition on the degree of the minimal polynomial. In particular, the strategy implemented, as well as the mathematical tools used, is not influenced by any FCM-based or FCM-inspired technique.

The elements of the ACM that we construct as a solution to the general inverse problem are single-valued complex functions of the coefficients of the given generic polynomial. Exploiting the structural properties of this ACM, we find the algebraic expressions for the three, generally complex, roots of its complex characteristic polynomial.

It is remarkable that, when the polynomial becomes real, the associated general ACM smoothly becomes Hermitian under easy-to-find necessary and sufficient conditions on the coefficients of the polynomial described by (45). Using simply the fact that the ACM becomes Hermitian if and only if

Δ \leq 0

, we are able to extend the trigonometric representation of the three roots of the real polynomial to all possible cases, that is even when (29) does not hold. This representation is obtained without resorting to the well-known Cardano-Del Ferro formulas.

We emphasize that the FCM of a characteristic polynomial does not undergo any structural change when the coefficients of the complex polynomial become real, thus providing no additional information on the polynomial roots. For this reason, we claim that the FCM has a lower flexibility than our ACM. We show how to use this flexibility to obtain an ACM of a prescribed characteristic polynomial on demand, applying our procedure to the important problem of finding a density matrix, particularly that of a qutrit.

A second, novel constrained inverse problem addressed here consists in finding a unitary ACM of a generic polynomial with three roots of modulus one. Excluding the trivial case in which the unitary ACM can be directly given in diagonal form, we first reach the intermediate goal (interesting in itself) of parameterizing

c_{1}

,

c_{2}

, and

c_{3}

in the polynomial (5) so as to set the necessary conditions on the structure of a polynomial to admit an ACM. By setting appropriate relations between the parameters through coupled geometric and analytical considerations, we further constrain the polynomial structure in such a way as to identify the set

[P_{3 ϵ} (z)]

of all and only the third-degree polynomials that admit a unitary ACM. Then, we conclude our analysis by constructing the associated ACM.

The results of this study can be further explored and usefully applied to physical and mathematical contexts, including, at their intersection, the research area of quantum computing. For example, for time-dependent parameters, the prescription of a time-dependent characteristic polynomial of

[P_{3} (z)]

or

[P_{3 ϵ} (z)]

leads to a unitary time-dependent matrix, hence to the pertinent time-dependent Hamiltonian that generates the time evolution of a qutrit, whose properties can thus be traced back to the prescription of

P_{3} [z]

. In mathematical contexts, the analysis developed in this study can be extended to polynomials of a higher degree. For example, in the case of a fifth-degree polynomial, our protocol may provide conditions on the coefficients of the polynomial such that its roots are real. The rich analysis enabled by the use of third-degree polynomials in this study sets a clearer basis to conceive the extension of the analysis to higher-degree polynomials.

Author Contributions

Contributions: Conceptualization, resources, and writing—original draft preparation, L.A.M. and A.M. (Antonino Messina); methodology, formal analysis, investigation, and writing—review and editing, L.A.M., A.M. (Agostino Migliore), and A.M. (Antonino Messina); supervision and project administration, A.M. (Antonino Messina). All authors have read and agreed to the published version of the manuscript.

Funding

L.M. acknowledges funding from the NWO Gravitation Program Quantum Software Consortium. L.M. acknowledges partial support by the Roadmap for the Development of Quantum Technologies in Russian Federation, contract No. 868-1.3-15/15-2021. A.M. (Agostino Migliore) acknowledges funding from the European Union—NextGenerationEU, within the National Center for HPC, Big Data, and Quantum Computing (project no. CN00000013, CN1 Spoke 10: “Quantum Computing”).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

All necessary data are contained in the article.

Acknowledgments

L.M. thanks Maria Carmela Lombardo for her invitation to the University of Palermo for scientific collaboration.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Derivation of Equations (34)–(43)

In this appendix, we find the roots of polynomial (5) in the case in which

\begin{matrix} Δ (p, q) \equiv \frac{p^{3}}{27} + \frac{q^{2}}{4} \geq 0, \end{matrix}

(A1)

From Equation (28) it is immediately seen that the inequality (A1) implies

χ^{2} < 1

or

χ^{2} > 1

depending on whether

p > 0

or

p < 0

, respectively. As a consequence, Equations (17), (24), and (27) lead to different polynomial roots depending on the sign of p, in agreement with previous analyses [50].

We begin with considering the case

p > 0

, which means

Θ_{p} = 0

, whence

φ_{p} = π

according to Equation (13) and

χ

is given by Equation (39), i.e.,

χ = - i u, u = \frac{3 q}{2 p} \sqrt{\frac{3}{p}} .

(A2)

In this case,

Arg (1 - χ^{2}) = 0

and Equation (17) gives

\begin{matrix} Φ_{13} & = & Arg (χ + i \sqrt{1 - χ^{2}}) - i Ln (χ + i \sqrt{1 - χ^{2}}) \\ = & \frac{π}{2} - i ln (\sqrt{1 + u^{2}} - u) = \frac{π}{2} + i ln (\sqrt{1 + u^{2}} + u) . \end{matrix}

(A3)

Then, using Euler’s formula and rationalization, Equation (24) easily yields

{\tilde{η}}_{1} = \frac{\sqrt{3}}{2} (A + B) + \frac{i}{2} (A - B)

(A4)

with

\begin{matrix} A = \sqrt[3]{\sqrt{1 + u^{2}} - u}, B = \sqrt[3]{\sqrt{1 + u^{2}} + u}, \end{matrix}

(A5)

whence

\begin{matrix} z_{1} = i \sqrt{\frac{p}{3}} {\tilde{η}}_{1} - \frac{c_{1}}{3} = \frac{\sqrt{p}}{2} [\frac{1}{\sqrt{3}} (B - A) + i (A + B)] - \frac{c_{1}}{3}, \end{matrix}

(A6)

namely Equation (34), being

u = i χ

and

\begin{matrix} X = A + B, Y = \frac{B - A}{\sqrt{3}}, \end{matrix}

(A7)

Considering that

A B = 1

and using Equation (27), after some lengthy algebra one obtains

\begin{matrix} {\tilde{η}}_{k} & = & \frac{\sqrt{3}}{2} [- \frac{A + B}{2} + i \frac{B - A}{2 \sqrt{3}} + {(- 1)}^{k + 1} \sqrt{1 - \frac{{(B - A)}^{2}}{2} + i \frac{\sqrt{3} (B^{2} - A^{2})}{2}}] \\ = & \frac{\sqrt{3}}{2} [- \frac{X}{2} + i \frac{Y}{2} + \frac{{(- 1)}^{k + 1}}{2} \sqrt{{(X + 3 i Y)}^{2}}], k = 2, 3 . \end{matrix}

(A8)

Choosing the root in the last expression in accordance with Equation (27), we thus have

{\tilde{η}}_{2} = - \frac{\sqrt{3}}{2} (X + i Y)

(A9)

and

{\tilde{η}}_{3} = i \sqrt{3} Y,

(A10)

from which the root expressions (35) and (36) immediately result.

Consider, for example, the polynomial

z^{3} + 4 z - 7 \sqrt{3} = 0

, which is directly given in canonical form, so that

c_{1} = 0

,

c_{2} = p

, and

c_{3} = q

. In this case,

u = - 7 \frac{9}{16}

, whence

A = 2

,

B = \frac{1}{2}

and, therefore,

X = \frac{5}{2}

,

Y = - \frac{\sqrt{3}}{2}

. Equations (34)–(36) thus give

z_{1} = \frac{1}{2} (- \sqrt{3} + 5 i)

,

z_{2} = - \frac{1}{2} (\sqrt{3} + 5 i)

, and

z_{3} = \sqrt{3}

, which are readily verified to be the roots of the above polynomial.

For

p < 0

, we have

Θ_{p} = 0

, from which

φ_{p} = 2 π

, and

χ = - \frac{3 q}{2 p} \sqrt{- \frac{3}{p}} .

(A11)

Therefore,

Arg (1 - χ^{2}) = π

and Equation (17) gives

\begin{matrix} Φ_{13} & = & Arg (χ - \sqrt{χ^{2} - 1}) - i ln (χ - \sqrt{χ^{2} - 1}) \\ = & i ln (χ + \sqrt{χ^{2} - 1}) \equiv i ν . \end{matrix}

(A12)

Using Euler’s formula again, we obtain

\begin{matrix} {\tilde{η}}_{1} = 2 cos (i \frac{ν}{3}) = 2 cosh (\frac{ν}{3}) = C \end{matrix}

(A13)

with C given by Equation (42), from which Equation (40) immediately follows.

Since

\begin{matrix} {sin}^{2} (i \frac{ν}{3}) = - {sin}^{2} (\frac{ν}{3}) = 1 - {cosh}^{2} (\frac{ν}{3}) = 1 - \frac{C^{2}}{4} < 0 \forall ν, \end{matrix}

(A14)

Equation (27) gives

\begin{matrix} {\tilde{η}}_{k} & = & - [\frac{C}{2} + i {(- 1)}^{k} \sqrt{3 (\frac{C^{2}}{4} - 1)}], k = 2, 3, \end{matrix}

(A15)

whence Equation (41).

Appendix A.2. Comments

Comment A1.

A monic polynomial is a univariate polynomial in which the leading coefficient is equal to 1.

Comment A2.

A square matrix is called derogatory (non-derogatory) when the degree of its minimal polynomial is less than (equal to) the order n of the matrix. The term derogatory has been coined by Sylvester in the early years following 1880. Etymologically, it probably originates from the Latin verb “derogare”, in its particular meaning of “decrease”, to underline that the degree of the minimal polynomial of the matrix is less than n.

Comment A3.

As reported by Hawkins [3], the term companion matrix was coined by Loewy in 1917 [59]. In 1946, MacDuffee introduced the term “companion matrix” as a translation from the German “Begleitmatrix”.

Comment A4.

We point out that, in this case, the Frobenius matrix is no longer diagonalizable, since, being non-derogatory by construction, it necessarily has at least one eigenvalue with geometric multiplicity smaller than the (algebraic) multiplicity.

Comment A5.

By parametric we mean that the entries of the ACM can be functions of the polynomial coefficients.

Comment A6.

That is, under any transformation of the form A → P⁻¹ AP, where P is a non-singular matrix.

Comment A7.

The Cayley–Hamilton theorem states that every square matrix over a commutative ring (such as the real or complex numbers, or the integers) satisfies its own characteristic equation.

Comment A8.

An equivalent solution was obtained in [49], using an angle offset by −π with respect to Φ₁₃. Solutions in the trigonometric form of the canonical (or depressed) cubic equation equivalent to those presented here and in [49] were obtained by François Viète (1540–1603) for the case in which the polynomial in Equation (4) is real.

Comment A9.

Note that the discriminant can also be defined with the opposite sign [60]. Clearly, in this case, all of the inequalities involving the discriminant of the polynomial in canonical form are to be inverted.

References

Frobenius, G. Theorie der linearen Formen mit ganzen Coefficienten. J. Reine Angew. Math. 1879, 86, 146–208. [Google Scholar] [CrossRef]
Horn, R.A.; Johnson, C.R. Matrix Analysis; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar] [CrossRef]
Hawkins, T. The Mathematics of Frobenius in Context: A Journey through 18th to 20th Century Mathematics, Sources and Studies in the History of Mathematics and Physical Sciences; Springer: New York, NY, USA, 2013. [Google Scholar]
Barnett, S. Congenial Matrices. Linear Algebra Appl. 1981, 41, 277–298. [Google Scholar] [CrossRef]
Aurentz, J.L.; Vandebril, R.; Watkins, D.S. Fast Computation of the Zeros of a Polynomial via Factorization of the Companion Matrix. SIAM J. Scien. Comp. 2013, 35, A255–A269. [Google Scholar] [CrossRef]
De Teran, F.; Dopico, F.M.; Perez, J. New bounds for roots of polynomials based on Fiedler companion matrices. Linear Algebra Appl. 2014, 451, 197–230. [Google Scholar] [CrossRef]
Bini, D.A. Numerical computation of polynomial zeros by means of Aberth’s method. Numer. Algorithms 1996, 13, 179–200. [Google Scholar] [CrossRef]
Higham, N.J.; Tisseur, F. Bounds for eigenvalues of matrix polynomials. Linear Algebra Appl. 2003, 358, 5–22. [Google Scholar] [CrossRef]
Bueno, M.I.; De Terán, F.; Dopico, F.M. Recovery of eigenvectors and minimal bases of matrix polynomials from generalized Fiedler linearizations. SIAM J. Matrix Anal. Appl. 2011, 32, 463–483. [Google Scholar] [CrossRef]
Antoniou, E.; Vologiannidis, S. A new family of companion forms of polynomial matrices. Electron. J. Linear Algebra 2004, 11, 78–87. [Google Scholar] [CrossRef]
Brand, L. Applications of the companion matrix. Am. Math. Mon. 1968, 75, 146–152. [Google Scholar] [CrossRef]
Wardlaw, W.P. Matrix representation of finite fields. Math. Mag. 1994, 67, 289–293. [Google Scholar] [CrossRef]
Szederkényi, G.; Lakner, R.; Gerzson, M. Intelligent Control Systems: An Introduction with Examples; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2006; Volume 60. [Google Scholar]
Lim, A.; Dai, J. On product of companion matrices. Linear Algebra Appl. 2011, 435, 2921–2935. [Google Scholar] [CrossRef]
Specht, W. Die Lage der Nullstellen eines Polynoms III. Math. Nachr. 1957, 16, 257–263. [Google Scholar] [CrossRef]
Specht, W. Die Lage der Nullstellen eines Polynoms IV. Math. Nachr. 1960, 21, 201–222. [Google Scholar] [CrossRef]
Good, I. The colleague matrix, a Chebyshev analogue of the companion matrix. Quart. J. Math. Oxford Ser. 1961, 12, 61–68. [Google Scholar] [CrossRef]
Maroulas, J.; Barnett, S. Polynomials with respect to a general basis. J. Math. Anal. Appl 1979, 72, 177–194. [Google Scholar] [CrossRef]
Brändén, P. On linear transformations preserving the Polya frequency property. Trans. Am. Math. Soc. 2006, 358, 3697–3716. [Google Scholar] [CrossRef]
Haglund, J. Further Investigations Involving Rook Polynomials with Only Real Zeros. Eur. J. Comb. 2000, 21, 1017–1037. [Google Scholar] [CrossRef]
Pitman, J. Probabilistic Bounds on the Coefficients of Polynomials with Only Real Zeros. J. Comb. Theory Ser. A 1997, 77, 279–303. [Google Scholar] [CrossRef]
Wagner, D.G. The partition polynomial of a finite set system. J. Comb. Theory Ser. A 1991, 56, 138–159. [Google Scholar] [CrossRef] [Green Version]
Wagner, D.G. Total positivity of Hadamard products. J. Math. Anal. Appl. 1992, 163, 459–483. [Google Scholar] [CrossRef]
Wang, Y.; Yeh, Y.N. Polynomials with real zeros and Polya frequency sequences. J. Comb. Theory Ser. A 2005, 109, 63–74. [Google Scholar] [CrossRef]
Sun, G.; Su, S.; Xu, M. Quantum Algorithm for Polynomial Root Finding Problem. In Proceedings of the 2014 Tenth International Conference on Computational Intelligence and Security, Kunming, China, 5–16 November 2014; pp. 469–473. [Google Scholar] [CrossRef]
Nagata, K.; Nakamura, T.; Geurdes, H.; Batle, J.; Farouk, A.; Diep, D.; Patro, S.K. Efficient Quantum Algorithms of Finding the Roots of a Polynomial Function. Int. J. Theor. Phys. 2018, 57, 2546–2555. [Google Scholar] [CrossRef]
Nagata, K.; Nakamura, T. Quantum algorithm for the root-finding problem. Quant. Stud. Math. Found. 2019, 1, 2196–5617. [Google Scholar] [CrossRef]
Tansuwannont, T.; Limkumnerd, S.; Suwanna, S.; Kalasuwan, P. Quantum Phase Estimation Algorithm for Finding Polynomial Roots. Open Phys. J. 2019, 17, 839–849. [Google Scholar] [CrossRef]
Tan, L.; Pugh, A. Spectral structures of the generalized companion form and applications. Syst. Control. Lett. 2002, 46, 75–84. [Google Scholar] [CrossRef]
Weigert, S. A quantum search for zeros of polynomials. J. Opt. B Quantum Semiclassical Opt. 2003, 5, S586–S588. [Google Scholar] [CrossRef]
Spengler, C.; Kraus, B. Graph-state formalism for mutually unbiased bases. Phys. Rev. A 2013, 88, 052323. [Google Scholar] [CrossRef]
Schmeisser, G. A real symmetric tridiagonal matrix with a given characteristic polynomial. Linear Algebra Appl. 1993, 193, 11–18. [Google Scholar] [CrossRef] [Green Version]
Fiedler, M. Expressing a polynomial as the characteristic polynomial of a symmetric matrix. Linear Algebra Appl. 1990, 141, 265–270. [Google Scholar] [CrossRef]
Eastman, B.; Kim, I.J.; Shader, B.; Vander Meulen, K. Companion matrix patterns. Linear Algebra Appl. 2014, 463, 255–272. [Google Scholar] [CrossRef]
Deaett, L.; Fischer, J.; Garnett, C.; Vander Meulen, K. Non-sparse companion matrices. Electron. J. Linear Algebra 2019, 35, 223–247. [Google Scholar] [CrossRef]
Borisenko, A.; Tarapov, I.E. Vector and Tensor Analysis with Applications; Courier Corporation: Chelmsford, MA, USA, 1968. [Google Scholar]
Kalman, D. A matrix proof of Newton’s identities. Math. Mag. 2000, 73, 313–315. [Google Scholar] [CrossRef]
Prasolov, V.V. Problems and Theorems in Linear Algebra; American Math. Society: Providence, RI, USA, 1994; Volume 134. [Google Scholar]
Boas, M.L. Mathematical Methods in the Physical Sciences; John Wiley & Sons: Hoboken, NJ, USA, 2006. [Google Scholar]
Hadamard, J. Sur les problèmes aux dérivées partielles et leur signification physique. Princet. Univ. Bull. 1902, 49–52. [Google Scholar]
Kabanikhin, S. Definitions and examples of inverse and ill-posed problems. J. Inv. Ill-Posed Probl. 2008, 16, 317–357. [Google Scholar] [CrossRef]
von Würtemberg, I. Ill-Posed Problems and Their Applications to Climate Research, U.U.D.M. Project Report; Technical Report; Department of Mathematics, Uppsala University: Uppsala, Sweden, 2011. [Google Scholar]
Barnett, S. Matrices: Methods and Applications; Oxford University Press: Oxford, UK, 1990. [Google Scholar]
Funkhouser, H.G. A Short Account of the History of Symmetric Functions of Roots of Equations. Am. Math. Mon. 1930, 37, 357–365. [Google Scholar] [CrossRef]
Abramowitz, M.; Stegun, I. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables; US Government Printing Office: Washington, DC, USA, 1964; Volume 55. [Google Scholar]
Haber, H.E. The Complex Inverse Trigonometric and Hyperbolic Functions; University of California: Berkeley, CA, USA, 2022; Available online: http://scipp.ucsc.edu/~haber/webpage/arc3.pdf (accessed on 16 January 2023).
Kahan, W. Branch cuts for complex elementary functions. In The State of the Art m Numerical Analyszs; Powell, M.J.D., Iserles, A., Eds.; Oxford University Press: New York, NY, USA, 1987. [Google Scholar]
Milovanovic, G.V.; Mitrinovic, D.S.; Rassias, T.M. Topics in Polynomials; World Scientific: Singapore, 1994. [Google Scholar] [CrossRef]
Lambert, W.D. A Generalized Trigonometric Solution of the Cubic Equation. Am. Math. Mon. 1906, 13, 73–76. [Google Scholar] [CrossRef]
Tricomi, F.G. Lezioni di Analisi Matematica; Cedam: Lansing, MI, USA, 1956; Volume 1. [Google Scholar]
Kurtz, D.C. A Sufficient Condition for All the Roots of a Polynomial To Be Real. Am. Math. Mon. 1992, 99, 259–263. [Google Scholar] [CrossRef]
Carmichael, H.J.; Glauber, R.J.; Scully, M.O. Directions in Quantum Optics: A Collection of Papers Dedicated to the Memory of Dan Walls Including Papers Presented at the TAMU-ONR Workshop Held at Jackson, Wyoming, USA, 26–30 July 1999; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2001; Volume 561. [Google Scholar]
Fano, U. Description of states in quantum mechanics by density matrix and operator techniques. Rev. Mod. Phys. 1957, 29, 74. [Google Scholar] [CrossRef]
Blanchard, P.; Brüning, E. Mathematical Methods in Physics: Distributions, Hilbert Space Operators, Variational Methods, and Applications in Quantum Physics; Birkhäuser: Basel, Switzerland, 2015; Volume 69. [Google Scholar]
Connor, M. A historical Survey of Methods of Solving Cubic Equations. Master’s Thesis, University of Richmond, Richmond, VA, USA, 1956. [Google Scholar]
Descartes, R. The Geometry of Rene Descartes: With a Facsimile of the First Edition; Courier Corporation: Chelmsford, MA, USA, 2012. [Google Scholar]
Brüning, E.; Mäkelä, H.; Messina, A.; Petruccione, F. Parametrizations of density matrices. J. Mod. Opt. 2012, 59, 1–20. [Google Scholar] [CrossRef]
Armitage, J.; Eberlein, W.F. Elliptic Functions; Cambridge University Press: Cambridge, UK, 2006; Volume 67. [Google Scholar] [CrossRef]
Loewy, A. Die Begleitmatrix eines linearen homogenen Differentialstatusdruckes. Nachr. Ges. Der Wiss. Göttingen Math. Phys. Kl. 1917, 1917, 255–263. [Google Scholar]
Dickson, L.E. First Course in the Theory of Equations; J. Wiley & Sons, Incorporated: Hoboken, NJ, USA, 1922. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Markovich, L.A.; Migliore, A.; Messina, A. Hermitian and Unitary Almost-Companion Matrices of Polynomials on Demand. Entropy 2023, 25, 309. https://doi.org/10.3390/e25020309

AMA Style

Markovich LA, Migliore A, Messina A. Hermitian and Unitary Almost-Companion Matrices of Polynomials on Demand. Entropy. 2023; 25(2):309. https://doi.org/10.3390/e25020309

Chicago/Turabian Style

Markovich, Liubov A., Agostino Migliore, and Antonino Messina. 2023. "Hermitian and Unitary Almost-Companion Matrices of Polynomials on Demand" Entropy 25, no. 2: 309. https://doi.org/10.3390/e25020309

APA Style

Markovich, L. A., Migliore, A., & Messina, A. (2023). Hermitian and Unitary Almost-Companion Matrices of Polynomials on Demand. Entropy, 25(2), 309. https://doi.org/10.3390/e25020309

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Hermitian and Unitary Almost-Companion Matrices of Polynomials on Demand

Abstract

1. Introduction

1.1. Purpose and Contribution of this Study

1.2. Physical Applications

2. Formulation of the Inverse Problem

3. Almost-Companion Matrices of a Cubic Complex Polynomial

3.1. Roots Characterization

3.2. Real Polynomial Case

4. Almost-Companion Density Matrices of a Qutrit on Demand

5. Unitary Matrices (Operators) on Demand

5.1. Properties of the Characteristic Polynomial of a Unitary Matrix

5.2. Construction of a Trial Unitary ACM

5.3. Examples

6. Discussion and Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix A.1. Derivation of Equations (34)–(43)

Appendix A.2. Comments

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI